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Introduction 


R. E. Asher and Christopher Moseley 


When the original edition of this atlas, the first 
attempt at a comprehensive mapping of all of the 
world’s languages in a single volume, was published 
in 1994, the possible need in due course for a revised 
edition, resulting for example from the discovery of 
new languages, was envisaged. The fact that the print 
run of the original edition has been exhausted 
provides an opportunity to make such a revision. 
Research undertaken during the closing years of the 
twentieth century and beyond has added to the list of 
known languages and has provided more information 
about the location of some of the languages mapped 
earlier, about their genetic affiliations and about 
numbers of speakers. Every attempt has been made to 
ensure that the work is as up to date as possible. 
Two sections, for instance, incorporate the results of 
fieldwork undertaken during the first quarter of 2006. 


Arrangement of the atlas 


As was the case in the first edition, the atlas is divided 
into eight regions, namely: 


The Americas 

Australasia and the Pacific 

East and South-East Asia 
Southern Asia 

Northern Asia and Eastern Europe 
Western Europe 

Middle East and North Africa 
Sub-Saharan Africa 


The first of these regions is once again subdivided 
into three sections: North America, followed by 
Meso-America and South America. As in the first 
edition, the blocks of maps for each of the ten 
sections are preceded in each case by an explanatory 
text, which outlines the general linguistic history 
of the area shown, the genetic relations of the lan- 
guages spoken within it and some of their character- 
istic structural features, together with statistical and 
sociolinguistic information. The texts are written 
with the general reader rather than the specialist in 
mind. 

Further detailed information about individual 
languages, where available, will be found in the 
more specialized works listed in the further reading 
sections, found at the end of the text for each region. 
For reasons of space, the atlas does not aim at 
detailed descriptions of individual languages: for 
these the reader might consult a work such as George 
Campbell’s Compendium of the World’s Languages, or 
encyclopedias such as the International Encyclopedia 
of Linguistics (2nd edition, ed. W. J. Frawley) and 
The Encyclopedia of Language and Linguistics (ed. 
R.E. Asher). There is similarly no detailed informa- 
tion about the world’s numerous writing systems, of 
which one of a number of comprehensive accounts 
currently available is The World’s Writing Systems 
(ed. P.T. Daniels and W. Bright). 


Aims of the atlas 


The principal concern of the atlas is the contempor- 
ary distribution of the world’s languages. There are, 


however, as in the first edition, two major exceptions 
to this. These are the first two of the regions covered. 
The colonization of distant parts of the world by 
maritime nations of Europe from the seventeenth 
century onwards led both to the displacement of 
indigenous populations and to the extinction of some 
of their languages. Because these events are relatively 
recent in the world’s history and because the language 
names involved, even when the language is extinct, 
are still associated with many of the displaced com- 
munities, the view was taken in the planning stages of 
the first edition that it would be appropriate to depart 
from the contemporary theme as far as the Americas 
and Australasia were concerned, and this approach 
has been maintained in the second edition. For this 
reason, the maps in the first four sections of the atlas 
are of two kinds: ‘time of contact’ maps and ‘con- 
temporary maps, the first of these indicating as 
accurately as possible the areas occupied by speakers 
of the languages in question when the European col- 
onists first appeared on the scene. The presentation of 
these two major regions nevertheless differs in an 
important respect. In the case of Australia, the abo- 
riginal populations are so diminished that the time of 
contact map is the only detailed one given: the areas 
where the few surviving languages are spoken are 
indicated in a general way in the following inset map. 
The number of languages spoken worldwide is 
generally estimated to be between 6,000 and 7,000; 
the number of names associated with them certainly 
exceeds 40,000. Not all of these names are to be found 
in this volume. What has been provided as a mini- 
mum is the name by which a language is most com- 
monly referred to in English and, where this differs, 
the name used by scholars writing about a language 
or language family. Commonly used variant spellings, 
too, are provided. A very comprehensive list of alter- 
native names is given in Ethnologue (ed. R. G. Gordon). 
While the basic aim in this atlas, as in its predeces- 
sor, is to map as far as current knowledge allows the 
location of all of these living languages, no matter 
how small the number of speakers, it will already be 
clear that some of the maps, in the sections devoted 
to the Americas and to Australasia, also present lan- 
guages that are already extinct. Other languages in 
these and other regions are seriously endangered and 
some of these will within a few years have no surviv- 
ing speakers. Introductions to the various sections 
provide some information about this developing situ- 
ation; for a fuller account, readers may consult the 
Encyclopedia of the World’s Endangered Languages 
(ed. C. Moseley). As the combined forces of ease of 
communication, education and centralization work 
together to put more and more languages at risk, lin- 
guistic fieldwork in areas that are remote from indus- 
trialized and urban regions is increasingly important, 
both for the purpose of making possible full descrip- 
tions and records of threatened languages as well as 
that of ensuring that some few languages that may 
have remained undiscovered are found and studied. 


Language boundaries 


The information provided by the maps can be looked 
at from two points of view: it shows the locations in 


which a given language is spoken and also which lan- 
guages are spoken in a particular area. A reader look- 
ing for the first type of information will begin by 
searching the index of languages at the end of the 
volume; a reader looking for the second type will look 
at the list of maps that precedes this introduction. 
The provision of this information in map form 
involves a number of problems, among which is that 
of determining whether to regard a speech variety as 
a language or as a dialect of a language. The criteria 
for deciding are varied and to some extent contra- 
dictory. They include that of mutual intelligibility — 
the degree to which in a pair of languages each is 
intelligible to speakers of the other — and that of the 
percentage of basic vocabulary shared by a pair or a 
group of speech varieties. Cutting across these are 
political or sociocultural criteria. 

An examination based on the amount of shared 
vocabulary or on the extent of mutual intelligibility 
could lead only to the conclusion that Hindu and 
Urdu, for instance, represent a single language. Their 
place in South Asian societies and the use of entirely 
different writing systems, among other factors, 
determine their classification as different languages. 
There are other special cases that illustrate differences 
in the way that the words ‘language’ and ‘dialect’ are 
used as contrasting terms, among them those of Chi- 
nese and Arabic. Thus the seven groups of languages 
spoken in China that are conventionally referred to 
as dialects of Chinese fail the test of mutual intelligi- 
bility when it is applied to them. Though Arabic is 
often thought of as a single language, there are several 
regional contemporary standards that any objective 
criterion based on shared linguistic features will rec- 
ognize as separate languages. Within these regional 
variants there are various widely spoken subvariants 
that are readily identifiable, even though the boun- 
daries between them are not always sharp. 

Account has been taken of these special cases in 
the mapping process. Nevertheless, this atlas does not 
undertake the task, impossible within the confines of 
a single volume, of mapping all recognized dialects of 
all languages. The areas where major ones are spoken, 
particularly if they are sometimes held to merit the 
label of language, are indicated. Where helpful, the 
two categories of language and dialect are indicated 
in keys to maps by the fact of dialects being grouped 
under a single reference number but distinguished by 
a following letter. 

The areas where different languages are spoken 
are delineated with a boundary line, and the space 
encompassed by the line is numbered. If a language 
boundary corresponds to a political boundary, inter- 
national or internal, the line marking this political 
boundary also marks the linguistic boundary. The 
first edition of the atlas used a hierarchical system of 
lines to differentiate phyla, stocks, groups, subgroups, 
languages and dialects. Given the lack of agreement 
on a strict definition of these terms and the problem 
of consistent application of them from one region to 
another, the second edition has simplified the situ- 
ation by employing only one type of line to delimit 
languages. The place of a language within its genetic 
classification can be deduced from the structure of 
the legend and from further information found in the 
extensive text which accompanies each section of the 
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atlas. The indication of a dialect area is indicated in 
two different ways. Where it has seemed appropriate 
to the contributor, boundary lines are drawn in — as is 
the norm for languages. In other cases, the names of 
dialects are written directly on the maps. 

Section editors have used informed judgement to 
determine where the boundaries between languages 
should be placed, but such boundaries are rarely 
definitive. This judgement is based on a range of 
sources, including national censuses, national atlases, 
published descriptions of languages and other schol- 
arly sources, including a contributor’s own extensive 
fieldwork. Source material varies considerably in its 
reliability. National censuses of many countries are 
infrequently held. Where they are held, the informa- 
tion gathered varies in both quantity and quality. 
Some censuses that provide data on language 
distribution limit this to languages that have more 
than a specified number of speakers. Information 
on minority languages may be withheld where a 
government wishes to present the view of a nation 
unified by one or two official languages. 

Such factors introduce an element of uncertainty 
in the drawing of lines to indicate the boundaries of 
a language area. Even apart from this, simple facts 
about language distribution mean that there is con- 
siderable variability in the degree of confidence with 
which boundary lines can be drawn. Except in cases 
of certain natural features that may limit the ease of 
population movement — such as a wide river, a high 
mountain range or the sea — there is almost always a 
degree of overlap between areas where a language or 
languages predominate. National borders may limit 
the amount of overlap, but commonly areas on both 
sides of an international frontier are bilingual. How- 
ever, the alternative to drawing boundary lines in this 
way, namely the insertion of language numbers or 
names on a base map, is usually far from satisfactory 
since no idea of the true spread of a language may be 
gleaned unless the map user is skilled in interpolat- 
ing boundaries — a skill that cannot reasonably be 
assumed. It is therefore necessary for the reader to 
keep in mind that the language boundaries shown are 
rarely definite or ‘hard’, but merely indicative. They 
are, moreover, subject to change over time. 

The scale of the maps in the atlas is determined 
primarily by the density of the information shown, 
but consideration has also been given to the ease of 
use of the atlas, so that a map may show a region or 
country at a scale that allows the whole area to be 
seen at a glance, even if some regions contain areas of 
smaller language polygons than other maps. Where it 
is practicable to use a scale that allows it, the language 
polygons are shown as monolingual. However, given 
the extent to which many parts of the earth’s surface 
are peopled by societies that are bilingual or multi- 
lingual, some demarcated areas have unavoidably to 
be shown with two, three, or more, language num- 
bers. Yet even this sort of mapping cannot indicate 
the language situation in many population centres 
across the world that are linguistically very mixed. To 
illustrate this point, a number of diagrams have been 
introduced in the introduction to some sections 
showing the situation in some major cities, as well as 
in some non-metropolitan areas where there is a 
mingling of different groups that cannot readily be 
shown on a map. The unavailability of relevant stat- 
istics for many countries has meant that it has not 
been possible to do this uniformly for all regions. 

Areas denoted as ‘uninhabited’ are either com- 
pletely uninhabited or so sparsely inhabited that the 
section editor deems it unreasonable to assign a lan- 
guage to the area. Clearly some areas with very few or 
no permanent inhabitants have nomadic or otherwise 
transitory populations (such as the Sahara) and lan- 
guages assigned to such areas should not imply per- 
manent settlement. 


Colour on the maps 


Colours have been assigned to the maps to show 
different sets of languages. Sometimes such a set 


will represent a whole language family; elsewhere, 
depending on the size of the family, it will represent 
a subfamily, a group or a subgroup within the 
family. That is to say that the aim in the selection 
of colours is to enhance the maps by showing lan- 
guages with a common classification in the same 
colour. The colours assigned to each map are usually 
unique to that map, but where appropriate we have 
employed the same colour for the same language 
where it appears on facing pages and direct com- 
parison is appropriate (for instance comparing two 
maps of South America showing the situation at the 
time of European contact and the contemporary 
situation). 

There may superficially appear to be a case for 
using the same colour for the same language through- 
out the atlas. Unfortunately, there are good practical 
reasons why this is not possible. To do so would mean 
employing about 6,000 unique colours. Even if it were 
possible to mix and print 6,000 different colours (and 
it is not), the human eye is limited in the number of 
different colours it can distinguish. If for instance, 
shades of blue were employed to denote the various 
Niger-Congo languages, it would mean using hun- 
dreds of different blues if no language were to be rep- 
resented by the same colour as any other. The average 
human eye can only distinguish about seven different 
shades of any hue, and that only when they are shown 
in a sequence of increasing or decreasing value. It 
would therefore be of no use to show fifty different 
shades of blue on a single map, since the reader could 
not distinguish between them. Accordingly, colours 
have been chosen which provide contrast. In order to 
avoid the idea of a hierarchy of languages, similar 
chromas (saturation or ‘strength’ of colour) of a 
variety of hues have been used. 


Changes to the cartography in 
the 2nd edition 


The design of the atlas has been changed for the 
second edition, with a reconsideration of the colours 
used and clearer title information. Because of the 
relevance of geographical features to the peopling of 
the Earth’s surface, to the distribution of languages 
and, in some cases, to the distinctness of boundaries 
between languages, two further general maps have 
been added at the head of the maps in each section — 
one showing the natural topography of the region 
and one indicating population density. What was the 
opening map in each section in the first edition — the 
one indicating the way in which the region is divided 
to produce the set of language maps — immediately 
follows these. 

The front endpapers provide, on a map of the 
whole world, a general index to the 108 full-page or 
double-page maps. The back endpapers consist of 
two related parts: first is a fictional map to which are 
attached explanations of the various methods used to 
show where different languages (and in some cases 
major dialects) are located; second is an equally fic- 
tional legend showing how the genetic structure of 
language families is presented and how numbers, 
repeating those on a map, are used to identify particu- 
lar languages. To allow quicker access to the adjoining 
maps, continuation arrows have been introduced 
on each map. Base topographic detail has also been 
revised, with additional cities added, new country 
names and international borders shown, and the 
revision of changed natural features (for example, the 
diminishing Aral Sea). 

Where, in the first edition, some peripheral 
areas of maps were left unfilled, with a note refer- 
ring the reader to the appropriate map, maps now 
show as far as possible continuous language infor- 
mation. Where limited space means that the inclu- 
sion of all languages on the edge of the main area of 
interest would mean a legend too large for the page, 
we have included outlines of the language boundar- 
ies of those areas, so that the reader will be able to 
locate quickly the same language pattern on the map 


showing that area. Notes still refer the reader to the 
appropriate map. For more detailed information 
on the areas peripheral to the featured region, the 
reader will still need to consult the principal map 
showing the area in question. For example, Map 65 
(Germany, Austria, Liechtenstein and Switzerland) 
shows part of Italy with its principal languages, but 
for dialects and minor languages of Italy, the reader 
will need to consult Map 67 where Italy is the focus 
of the map. 

As well as a thorough revision of all maps in the 
atlas, publication of the second edition has presented 
the opportunity to recast completely some of the 
maps which appeared in the first edition. Revision 
has also allowed the showing of more details of the 
languages of an area — and often more languages — as 
new research has identified their existence or location. 
For example Map 50 (Japan, Korea and Taiwan) has 
been redesigned to include a much larger-scale map 
of Taiwan, showing eighteen languages compared 
to the four shown in the first edition. Languages have 
been reclassified where scholarship indicates a 
revision is necessary. The order of some of the maps 
has been revised allowing, for example, on facing 
pages the comparison of the situation at the time of 
European contact with the distribution of languages 
today. 

North America is shown with entirely new maps, 
seven depicting the probable situation at the time 
of contact and two the contemporary distribution of 
languages. Meso-American and South American 
maps have been updated to include lingua francas 
and creoles and national (European) languages, as well 
as incorporating variant spellings of indigenous lan- 
guages. The section on Australasia and the Pacific 
includes a recast double-page map of Australia at the 
time of contact and one showing contemporary lan- 
guages of Australia. Extensive revision of the section 
on South-East Asia includes a large-scale map of 
China’s Yunnan Province with completely revised 
information. A new map of North-East India, Bang- 
ladesh, Bhutan and Nepal (Map 51) complements a 
complete revision of all maps in the Southern Asia 
section. The scale of the map of Central Asia (Map 
57) has been increased, and maps of Eastern Europe 
fully revised. Maps of Western Europe now depict 
continuous language information, without the 
unfilled areas on some maps in the first edition. All 
maps in Section 9 (Middle East and North Africa) 
have been updated and a new double-page map 
shows Sudan and the Horn of Africa. Maps of 
sub-Saharan Africa show continuous language 
information and have been re-ordered as well as 
updated. Finally, maps have been introduced into the 
introductory texts of some sections to provide add- 
itional information — such as the spread of languages 
of intercultural communication — that would not 
match the system of presentation adopted in the main 
maps and, moreover, would not justify the allocation 
of a full page. 
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1.1 Introduction 


At the time when Europeans first arrived in North 
America, more than 300 native languages were spoken 
in what is now the USA and Canada, classified in 
some 58 independent language families. All of these 
languages are endangered, some more so than others; 
only a handful are spoken by more than 10,000 
people and all are endangered in the long term. 


1.2 Origins 


Incontrovertible evidence of human activity in the 
western hemisphere can be dated no earlier than 
12,500 years before the present, though opinions vary 
about earlier possible entry of humans to the New 
World. In any event, the Americas were the last major 
land mass to be reached by humans. This not- 
withstanding, there is extensive linguistic diversity 
among American Indian languages, both in terms of 
numbers of languages and language families, and in 
their structural properties. 


1.3 Classification 


The classification presented here reflects the ‘con- 
sensus classification’ (see Campbell 1997, Goddard 
1996b, and Mithun 1999). It indicates how the 322 
languages of North America are classified in 58 units 
of classification (language families and isolates — iso- 
lates are languages with no known relatives; that is, 
language families with only one member): 


e 14 major language families, made up of several 
languages with significant subgrouping: (1) 
Eskimo-Aleut, (2) Algic, (3) Na-Dene, (5) 
Wakashan, (7) Salishan, (17) Utian, (19) Plateau, 
(27) Cochimi-Yuman, (33) Uto-Aztecan, (34) 
Kiowa-Tanoan, (45) Siouan-Catawba, (47) Cad- 
doan, (52) Muskogean and (57) Iroquoian. 

e 19 minor language families, made up of a small 
number of relatively closely related (usually 
contiguous) languages: (6) Chimakuan, (8) Tsim- 
shianic, (9) Chinookan, (12) Coosan, (14) Kala- 
puyan, (15) Wintuan, (16) Maiduan, (18) Yokuts, 
(22) Shastan, (23) Palaihnihan, (24) Pomoan, 
(26) Salinan, (28) Guaicurian, (30) Yukian, 
(32) Chumashan, (35) Keresan, (38) Comecru- 
dan, (49) Atakapan and (55) Timucuan. 

e 25 isolates: (4) Haida, (10) Alsean, (11) Siuslaw, 
(13) Takelman, (20) Karuk, (21) Chimariko, 
(25) Yana, (29) Washo, (31) Esselen, (36) Zuni, 
(37) Coahuilteco, (39) Cotoname, (40) Aranama, 
(41) Solano, (42) Karankawa, (43) Kootenai, 
(44) Cayuse, (46) Tonkawa, (48) Adai, (50) Chiti- 
macha, (51) Tunica, (53) Natchez, (54) Yuchi, 
(56) Calusa and (58) Beothuk. 


Contributor: J. Lachlan Mackenzie 


1.3.1 The languages classified 


The languages and classificatory units (language fam- 
ilies and isolates) that were extinct before 1930 are 
indicated by italics; those that have become extinct 
since 1930 are indicated by an asterisk (*). The maps 
of North American Indian languages contain names 
for some languages which are scarcely known; these 
names are not included directly in the classification 
given here due to lack of data. Thus not all names on 
the maps appear in the classification chart. The clas- 
sification is as follows. 


(1) Eskimo-Aleut 
Eskimoan 
Western Eskimo (Yupik) 
1. *Sirenik 
2. East Cape (Naukan) Yupik 
3. Central Siberian Yupik 
4. Central Alaskan Yup'ik 
. Pacific Yupik (Alutiiq) 
Eastern Eskimo (Inuit) 
6. Seward Peninsula Inupiaq 
7. North Alaska Inupiaq 
8. Western Canadian Inuktitut 
9. Eastern Canadian Inuktitut 
10. West Greenlandic 
11. East Greenlandic 


on 


12. Aleut 
Aleut-Russian mixed language 
13. Copper Island Aleut 


(2) Algic 
Algonquian 
Blackfoot 
14. Blackfoot 
Cree-Montagnais 
Cree 
15. Plains Cree 
16. Michif (Plains Cree- 
French Mixed Language) 
17. Woods Cree 
18. Western Swampy Cree 
19. Eastern Swampy Cree 
[includes Moose Cree] 
20. Attikamek 
Montagnais-Naskapi 
East Cree 
21. Southern East Cree 
22. Northern East Cree 
Naskapi 
23. Western Naskapi 
(Naskapi) 
24. Eastern Naskapi 
(Mushuaau Innu) 
Montagnais (Innu) 
25. Western 
Montagnais 
26. CentralMontagnais 
27. EasternMontagnais 
Arapahoan 
28. Gros Ventre (Atsina) 


29. Arapaho 
30. Nawathinehena 
Cheyenne 
31. Cheyenne 
Menominee 
32. Menominee 
Ojibwayan 
Northern Ojibwe 
33. Severn Ojibwe 
(Oji-Cree) 
34. Northern Algonquin 
Southern Ojibwe 
35. Saulteaux 
36. Central Southern Ojibwe 
(Anishinaabemowin) 
37. Eastern Ojibwe 
[includes ‘Southern 
Algonquin’] 
38. Ottawa (Odawa) 
Old Algonquin 
39. Old Algonquin 
Potawatomi 
40. Potawatomi 
Sauk-Fox-Kickapoo 
41. Sauk-Fox (Meskwaki) 
42. Kickapoo 
43. (Mascouten) 
Shawnee 
44. Shawnee 
*Miami-Illinois 
45. *Miami 
46. *Illinois 
Eastern Algonquian 
Micmac 
47. Micmac 
Abenakian 
48. Maliseet- 
Passamaquoddy 
49. *Eastern Abenaki 
50. Western Abenaki 
Etchemin 
51. Etchemin 
Southern New England Algonquian 
52. Massachusett- 
Narragansett 
53. Loup 
54. Mohegan-Pequot 
55. Quiripi- Unquachog 
*Mahican 
56. *Mahican 
Delaware 
57. Munsee 
58. *Unami 
Nanticoke-Conoy 
59. Nanticoke-Conoy 


Virginia Algonquian 
60. Virginia Algonquian 
Carolina Algonquian 
61. Carolina Algonquian 
Ritwan 
*Wiyot 


62. *Wiyot 
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Yurok 
63. Yurok 


(3) Na-Dene (Narrow Sense) 


Tlingit 
64. Tlingit 
Eyak-Athabaskan 
Eyak 
65. Eyak 
Athabaskan 
Ahtna 
66. Ahtna 
Tanaina 
67. Tanaina 
Koyukon-Ingalik 
68. Ingalik (Deg Hit'an) 
69. Holikachuk 
70. Koyukon 
Tanana 
71. Upper Kuskokwim 
(Kolchan) 
72. Lower Tanana 
73. Tanacross 
74. Upper Tanana 
Tutchone 
75. Northern Tutchone 
76. Southern Tutchone 
Kutchin-Han 
77. Kutchin (Gwich'in) 
78. Han 
Central Cordillera 
79. Tagish 
80. Tahltan 
81. Kaska 
Southeastern Cordillera 
82. Sekani 
83. Beaver 
Dene 
84. Slave (South Slavey) 
85. Mountain Slavey 
86. Bearlake (North Slavey) 
87. Hare (North Slavey) 
88. Dogrib 
89. Chipewyan (Dene 
Soun'line) 
Babine-Carrier 
90. Babine 
91. Carrier 
Chilcotin 
92. Chilcotin (Tsilhgot'in) 
Nicola 
93. Nicola 
Tsetsaut 
94. Tsetsaut 
Sarcee 
95. Sarcee (Tsutina) 
Kwalhioqua-Clatskanie 
96. Kwalhioqua-Clatskanie 
Oregon Athabaskan 
97. Upper Umpqua 
98. *Tututni 
99. *Galice-Applegate 
100. Tolowa 
California Athabaskan 
101. Hupa 
102. *Mattole 
103. *Eel River 
104. *Kato (Cahto) 
Southern Athabaskan (Apachean) 
105. Navajo 
106. Western Apache 
Mescalero-Chiricahua 
107. Mescalero 
108. Chiricahau 
109. Jicarilla 
110. *Lipan 
111. Kiowa Apache (Plains 
Apache) 


(4) Haida 


112. Haida 


(5) Wakashan 


Northern Wakashan 
113. Haisla 


114. Heiltsuk-Oowekyala 

115. Kwakiutl (Kwak'wala) 
Nootkan 

116. Nootka (Nuuchahnulth) 

117. Nitinaht (Ditidaht) 

118. Makah 


(6) *Chimakuan 
119. Chemakum 
120. *Quileute 


(7) Salishan 
Bella Coola 
121. Bella Coola (Nuxalk) 
Central Salish 
122. Comox 
123. Pentlatch 
124. Sechelt 
125. Squamish 
126. Halkomelem 
127. *Nooksack 
128. Northern Straits Salish 
129. Clallam 
130. Lushootseed 
131. *Twana 
*Tsamosan 
132. *Quinault 
133. *Lower Chehalis 
134. *Upper Chehalis 
135. Cowlitz 
*Tillamook 
136. *Tillamook 
Interior Salish 
137. Lillooet 
138. Thompson 
139. Shuswap (Secwepemc) 
140. Okanagan 
141. Spokane-Kalispel-Flathead 
142. Coeur d’Alene 
143. Columbian 


(8) Tsimshianic 
Nass-Gitksan 
144. Nisga'a 
145. Gitksan 
Maritime Tsimshian 
146. Coast Tsimshian (Sm'algyax) 
147. Southern Tsimshian (Skiiiixs) 


(9) Chinookan 
148. *Lower Chinook 
149. Cathlamet 
150. Kiksht 


(10) *Alsean 
151. *Alsea 
152. Yaquina 


(11) *Siuslaw 
153. *Siuslaw 


(12) *Coosan 
154. *Hanis 
155. *Miluk 


(13) Takelman 
156. Takelma 


(14) *Kalapuyan 
157. *Tualatin-Yamhill 
158. *Central Kalapuyan 
159. *Yoncalla 


(15) Wintuan 
160. Wintu-Nomlaki 
161. Patwin 


(16) Maiduan 
162. *Maidu 
163. Konkow 
164. *Nisenan 


(17) Utian 
Miwok 

Western Miwok 
165. *Coast Miwok 
166. Lake Miwok 

Eastern Miwok 
167. Bay Miwok 
168. *Plains Miwok 


169. Northern Sierra Miwok 
170. Central Sierra Miwok 
171. Southern Sierra Miwok 
Costanoan 

172. Karkin 

173. Ramaytush 

174. Chochenyo 

175. Tamyen 

176. Awaswas 

177. Chalon 

178. Mutsun 

179. Rumsen 


(18) Yokuts 
180. Palewyami 
181. Buena Vista 
182. Tule-Kaweah 
183. Kings River 
184. *Gashowu 
185. Valley Yokuts 


(19) Plateau (also called Plateau Penutian) 

Klamath 

186. Klamath-Modoc 
Sahaptian 

187. Sahaptin 

188. Nez Perce 
*Molala 

189. *Molala 


(20) Karuk 
190. Karuk 


(21) *Chimariko 
191. *Chimariko 


(22) *Shastan 
192. *Shasta 
193. New River Shasta 
194. Okwanuchu 
195. Konomihu 


(23) Palaihnihan 
196. Achumawi 
197. *Atsugewi 


(24) Pomoan 
Northeastern Pomo 
198. *Northeastern Pomo 
Southeastern Pomo 
199. *Southeastern Pomo 
Eastern Pomo 
200. Eastern Pomo 
Western Pomoan 
Northern Group 
201. Northern Pomo 
Southern Group 
202. Central Pomo 
203. Southern Pomo 
204. Kashaya 


(25) Yana 
205. Yana 


(26) *Salinan 
206. *Antoniano 
207. *Miguelefo 


(27) Cochimi-Yuman 
Yuman 
Pai 
208. Paipai 
209. Upland Yuman 
River Yuman 
210. Mohave 
211. Quechan (Yuma) 
212. Maricopa 
Dieguefio-Cocopa 
213. Ipai 
214. Kumeyaay 
215. Tipai 
216. Cocopa 
Kiliwa 
217. Kiliwa 
Cochimi 
218. Northern Cochimi 
219. Southern Cochimi 


(28) Guaicurian 
220. Guaicura (Waykuri) 
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221. Pericti (?) 
222. Monqui-Diditi (?) 


(29) Washo 
223. Washo 


(30) *Yukian 
224. *Yuki 
225. *Wappo 


(31) Esselen 
226. Esselen 


(32) *Chumashan 
Northern Chumash 
227. Obispefio 
Southern Chumash 
Central Chumash 
228. Purisimefio 
229. Inesetio 
230. *Barbarefio 
231. Venturefio 
Island Chumash 
232. Island Chumash (Cruzefio) 


(33) Uto-Aztecan 
Numic 
Western Numic 
233. Mono 
234. Northern Paiute 
Central Numic 
235. Panamint 
236. Shoshone (Shoshoni) 
237. Comanche 
Southern Numic 
238. Kawaiisu 
239. Ute-Southern Paiute- 


Chemehuevi 
Tubatulabal 
240. Tubatulabal 
Takic 


Serrano- Kitanemuk 
241. Serrano 
242. *Kitanemuk 
Gabrielino 
243. *Gabrielino (Tongva) 
Cupan 
244. Cahuilla 
245. *Cupefio 
246. Luisefio 
Hopi 
247. Hopi 
Tepiman 
248. Upper Piman (O'odham) 
Taracahitan 
249. Yaqui (Yoeme) 


[The Uto-Aztecan languages of Mexico, 
including other languages in the Tepiman 
and Taracahitic subfamilies, are treated in 
Section 2, p. 54] 


(34) Kiowa-Tanoan 


Kiowa 
250. Kiowa 
Jemez 
251. Jemez 
Tiwa 
Northern Tiwa 
252. Taos 


253. Picuris 
Southern Tiwa 
254. Isleta-Sandia 
Tewa 
255. Tewa 
256. Arizona Tewa (Hano) 
Piro 
257. Piro 


(35) Keresan 
258. Acoma-Laguna 
259. Rio Grande Keresan 


(36) Zuni 
260. Zuni 


(37) Coahuilteco 
261. Coahuilteco 


(38) Comecrudan 
262. Comecrudo 
263. Mamulique 
264. Garza 


(39) Cotoname 
265. Cotoname 


(40) Aranama 
266. Aranama 


(41) Solano 
267. Solano 


(42) Karankawa 
268. Karankawa 


(43) Kootenai 
269. Kutenai (Kootenai, Ktunaxa) 


(44) Cayuse 
270. Cayuse 


(45) Siouan-Catawban 
Siouan 
Missouri River 
271. Hidatsa 
272. Crow 
Mandan 
273. Mandan 
Dakotan 
274. Sioux 
275. Assiniboine 
276. Stoney 
Dhegiha 
277. Omaha-Ponca 
278. Osage 
279. *Kansa 
280. *Quapaw 
Chiwere-Winnebago 
281. Chiwere 
282. Winnebago (Ho-Chunk, 
Hochank) 
Southeastern 
283. Ofo 
284. Biloxi 
285. Tutelo 
*Catawban 
286. *Catawba 
287. Woccon 


(46) *Tonkawa 
288. *Tonkawa 


(47) Caddoan 

Northern Caddoan 
289. Arikara 
290. Pawnee 
291. *Kitsai 
292. Wichita 

Southern Caddoan 
293. Caddo 


(48) Adai 
294. Adai 


(49) Atakapan 
295. Western Atakapa 
296. Eastern Atakapa 


(50) *Chitimacha 
297. *Chitimacha 


(51) *Tunica 
298. *Tunica 


(52) Muskogean 
Western Muskogean 
299. Choctaw 
300. Chickasaw 
Eastern Muskogean 
301. Hitchiti-Mikasuki 
302. Apalachee 
303. Alabama 
304. Koasati 
305. Creek (Muskogee) 


(53) *Natchez 
306. *Natchez 


(54) Yuchi 
307. Yuchi (Euchee) 


(55) Timucuan 
308. Timucua 
309. Tawasa 


(56) Calusa 
310. Calusa 


(57) Iroquoian 
Northern Iroquoian 
Tuscarora-Nottoway 
311. Tuscarora 
312. Nottoway 
Lake Iroquoian 
313. *Huron-Wyandot 
314. Laurentian 
Iroquois Proper 
315. Seneca 
316. Cayuga 
317. Onondaga 
318. Susquehannock 
319. Oneida 
320. Mohawk 
Southern Iroquoian 
321. Cherokee 


(58) Beothuk 
322. Beothuk 


1.3.2 Other classifications 


The first important classification of North American 
languages was that of J. W. Powell (1891), in which 
58 families were distinguished (not all of which 
coincide with the 58 families of the classification pre- 
sented here). In subsequent years, as additional data 
were obtained and research methods were refined, 
many linguists attempted to combine language fam- 
ilies into larger groupings, sometimes called stocks or 
phyla, representing possible, but unconfirmed, more 
remote historical relationships. This was done early 
for a number of California languages, where Dixon 
and Kroeber (1913) hypothesised the now controver- 
sial stocks of Hokan and Penutian. Edward Sapir (1929) 
proposed that all attested North American languages 
could be grouped into six phyla: Eskimo-Aleut, Na- 
Dene (expanded to include Haida), Algonkin-Waka- 
shan, Hokan-Siouan, Penutian and Aztec-Tanoan. 

Subsequent research tended to continue seeking 
broader relationships among groups of American 
Indian languages. Mary R. Haas was the most active 
researcher in this area in the 1950s and 1960s (see 
Haas 1969), and some of her proposals, together with 
others, were brought together in a revised version of 
Sapir’s scheme published by Voegelin and Voegelin 
(1965). This classification recognised seven phyla: 
American Arctic-Paleosiberian (connecting Eskimo- 
Aleut with Chukchi-Kamchatkan), Na-Dene, Macro- 
Algonquian, Macro-Siouan, Hokan, Penutian and 
Aztec-Tanoan, with nine smaller units ranging from 
the Salish and Wakashan families to the Beothuk and 
Kutenai language isolates. 

In later years, however, scholars came increas- 
ingly to recognise that many of the hypotheses could 
not be validated on the basis of the evidence available, 
and the pendulum began to swing back toward the 
recognition of a larger number of distinct families. 

In spite of mainstream scholars’ increasing 
demands for adequate evidence of proposed remote 
linguistic relationships, Joseph Greenberg (1987) 
went far beyond Sapir’s ‘superstocks’ in his proposal 
that all Native American language families, except 
‘Na-Dene’ and Eskimo-Aleut, belong to a single 
macro-family, which he called ‘Amerind’ based on his 
method of multilateral comparison. Amerind is 
rejected by nearly all specialists in Native American 
languages. They maintain that valid methods do not 
at present permit reduction of North American 
Indian languages to fewer than about 58 independent 
language families and isolates. Amerind has been 
criticised on various grounds. Where Greenberg stops 
— after assembling superficial similarities and declar- 
ing them due to common ancestry — is where other 
linguists begin. Since such similarities can be due to 
chance, borrowing, onomatopoeia, sound symbolism, 
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nursery words [the mama, papa, nana, dada, caca 
sort] and misanalysis, for a plausible proposal of lin- 
guistic relationship one must attempt to eliminate all 
these other possible explanations, leaving a shared 
common ancestor as the most likely. Greenberg made 
no attempt to eliminate these other explanations, and 
the similarities he amassed appear to be due mostly to 
accident and a combination of these other factors. In 
various instances, Greenberg compared arbitrary 
segments of words, equated words with very different 
meanings (for example, ‘excrement/night/grass’), 
mis-identified many languages, failed to analyse the 
morphology of some words and falsely analysed that 
of others, neglected regular sound correspondences, 
failed to eliminate loanwords, ignored or misinter- 
preted well-established findings, and there are exceed- 
ingly many errors in Greenberg’s data. In short, it is 
with good reason that Amerind has been rejected. 

Many similarities among languages result not 
because they are inherited from a common parent 
language, but because features have been borrowed 
among languages in contact — not only vocabulary, 
but also elements of phonology and grammar. This 
has taken place especially in certain geographically 
and culturally defined areas, such as the Northwest 
Pacific Coast; thus we speak of ‘areal’ groupings of 
languages as something distinct from genetic (family) 
groupings. Such groupings are not represented in this 
atlas, but are discussed in Campbell (1997: 330-52) 
and Mithun (1999: 314-17). 

Many proposals for more inclusive groupings 
have been made, summarised in Goddard (1996b: 
308-23), Campbell (1997: 66-80) and Mithun (1999: 
301-10). 


1.4 Loss of diversity 


Of the 58 classificatory units, exactly half (29) are 
now extinct — 16 having become extinct before 1930 
and 13 since then. Much of this loss is due to the 
extinction of isolates, only six of which survive, 
although half of the 18 minor families are also 
extinct. In addition, several branches of major 
families have been lost: Southern New England 
Algonquian within Algonquian; Tsamosan within 
Salishan; Costanoan within Utian; Cochimi within 
Cochimi-Yuman; and Catawban within Siouan- 
Catawba. This loss of diversity will almost certainly 
accelerate in the near future. A major subdivision of 
Na-Dene will be extinguished with the death of the 
one remaining speaker of Eyak, and five of the nine 
remaining small language families — Chinookan, 
Maiduan, Palaihnihan, Wintuan, and Yukian — sur- 
vive in the speech of less than a score of very elderly 
people. 


1.5 Catalogue of languages 


Accurate information on the current state of lan- 
guages and the number of speakers in North Ameri- 
can communities is difficult to come by. Most of the 
information in the list of languages below was 
gathered by Victor Golla between 1999 and 2001 
directly from the linguists and educators best placed 
to know the relevant facts about the various speech 
communities. 


Abenaki. See Eastern Abenaki, Western Abenaki 


Achumawi [196] is one of the two languages of the 
Palaihnihan family of northeastern California; the 
other is Atsugewi, which has been extinct since 1988. 
Both Achumawi and Atsugewi are heritage languages 
of the Pit River Tribe, which is organised into 11 
bands representing the tribal groups with traditional 
territory along Pit River. Achumawi, the language 
of nine of these bands, is spoken by fewer than ten 
elderly people, most of them semi-speakers or passive 
speakers. There are noticeable differences among local 
varieties. Instructional materials were prepared for a 


language programme in the 1980s but this pro- 
gramme is apparently no longer in operation. 


Acoma-Laguna [258] is the Keresan language spoken 
by members of the Pueblos of Acoma and Laguna in 
northwestern New Mexico, west of Albuquerque. 
Acoma-Laguna is partially intelligible to speakers 
of most Rio Grande Keresan varieties, but is usually 
considered a separate language. At Acoma about half 
of the total population of approximately 4,000 are 
speakers; most are over 30. No children are acquiring 
the language. A bilingual education programme 
existed for a number of years at Sky City Elementary 
School, but was discontinued around 1990. Retention 
efforts were renewed in 1997 and a summer immer- 
sion camp was started in 1998. At Laguna there are 
about 2,000 speakers out of a total population of 
approximately 7,000; most are over 40. No children 
are acquiring the language. Laguna is taught as a 
second language at Laguna Elementary School, and 
the Pueblo sponsors a language preservation project 
and a dictionary is in preparation by the Summer 
Institute of Linguistics. 


Adai [294] was originally spoken in west-central 
Louisiana, between the Red and Sabine Rivers, and 
during the eighteenth century it was the language of 
the Spanish mission of Adayes, west of Natchitoches. 
After the closing of the mission in 1792 the remnants 
of the tribe migrated to Texas and apparently joined 
one or more Caddoan groups. The language is known 
from a single vocabulary of 275 words collected in 
1806 or 1807. Although it was earlier thought to be 
Caddoan, Adai is now considered an isolate. 


Ahtna [66] is the language of eight communities along 
the Copper River and in the upper Susitna and Nenana 
drainages in south-central Alaska. The total Ahtna 
population is about 500 with perhaps 80 speakers. 


Ajachemem. See Luisefio 
Akimel O'odham (Pima). See O'odham 


Alabama [303] is an Eastern Muskogean language 
whose speakers were in north-eastern Mississippi in 
the sixteenth century and later moved to east central 
Alabama, where they became politically allied with 
the Creek Confederacy. Most of them moved to 
Spanish Louisiana and Florida with the British take- 
over in 1763, but some remained behind and were 
removed to Indian Territory in the 1830s. At present 
Alabama is spoken by 250 to 300 residents of the 
Alabama-Coushatta Indian Reservation near Living- 
ston, in the Big Thicket area of East Texas. Alabama 
speakers share the reservation with a smaller number 
of Koasati (Coushatta) speakers, and some indi- 
viduals have learned to speak or understand both of 
these related languages. Until recently there were also 
a few elderly speakers of Alabama among the 900 
enrolled members of the Alabama-Quassarte Tribe, 
an administrative subdivision of the Muskogee Creek 
Nation, in Okfuskee County, Oklahoma. In Texas, 
Alabama is the language of choice among those 
50 and above and is used at home and at the Senior 
Citizen Center. The youngest speakers are probably in 
their teens. 


Aleut [12] is the only language of the Aleut branch of 
the Eskimo-Aleut family. Its speakers are indigenous 
to the Aleutian Islands, the Pribilof Islands, and the 
Alaska Peninsula west of Stepovak Bay. The only 
major internal division occurs at Atka Island, separat- 
ing Eastern from Western dialects. The traditional 
ethnonym is Unangan (‘Aleuts’ was introduced by 
Russian explorers, who used the same term for the 
Pacific Yupik Eskimos). Of a current population of 
about 2,200 Aleuts, about 150-160 speak the lan- 
guage. In the early nineteenth century Russian 
Orthodox missionaries promoted native literacy and 
helped foster a remarkably bilingual society. The most 
notable of these missionaries, Ivan Veniaminov, 
developed a writing system and translated religious 
material into Aleut. 


Alsea [151] and Yaquina [152], the two closely related 
languages that constitute the Alsean family, were for- 


merly spoken by adjacent tribes in a small territory 
on the central coast of Oregon. The remnants of both 
tribes were removed to the Siletz Reservation in 1875, 
where they were absorbed into the heterogenous 
population and their languages fell into disuse. Some 
Yaquina vocabulary was documented in the 1880s, 
but the language appears to have become extinct soon 
afterward. Alsea was more thoroughly documented, 
primarily by Frachtenberg in 1910-13, who published 
a volume of narrative texts with a lexicon. Much 
archival material exists, including a grammatical 
sketch. At least one speaker survived into the 1940s, 
but the language has been extinct for at least 50 years. 


Alutiiq. See Pacific Yupik 
Antoniano. See Salinan 


Apalachee [302] was an Eastern Muskogean language 
that was spoken in the late seventeenth century in 
northern Florida around present-day Tallahassee and 
Apalachee Bay. It is attested mainly in a letter written 
in 1688 to Charles II of Spain, published in 1860, the 
original of which is lost. This difficult document has 
been analysed in detail by several scholars and a 
vocabulary of about 120 items recovered. 


Aranama [266] was spoken at the Franciscan mission 
of Espiritu Santo de Zufiiga, established in 1754 on 
the lower Guadalupe River in southeastern Texas. 
Only two words of Aranama are attested, and its rela- 
tionship is unknown. 


Arapaho [29] is an Algonquian language that was 
spoken by three independent bands on the High 
Plains south of the Missouri River at the beginning of 
the nineteenth century. One band soon consolidated 
with the others, but the Staetan band remained 
independent for a few decades and some words of its 
distinct dialect, called Besawunena, were still remem- 
bered as late as the 1930s. Modern Arapaho, presum- 
ably originally only the dialect of the dominant band, 
became the language of the descendants of all three of 
these Northern Arapaho bands as well as of the 
Nawathinehena (or Southern Arapaho), a politically 
allied tribe that originally spoke a distinct but 
closely related language. Consequently, Arapaho is the 
heritage language of both the Northern Arapaho 
(Arapahoe) of the Wind River Reservation in central 
Wyoming and of the Southern Arapaho members of 
the Cheyenne-Arapaho Tribe in west-central Okla- 
homa. Approximately 1,000 of the 5,953 Northern 
Arapaho tribal members are first-language speakers, 
most over 50; there are no first-language speakers of 
Arapaho in Oklahoma. 


Arikara [289] is a Northern Caddoan language, 
formerly spoken in earthlodge villages along the 
Missouri River in central and north-central South 
Dakota, downstream from the Mandan villages. The 
modern Arikara, one of the Three Affiliated Tribes, 
share the Fort Berthold Reservation in North Dakota 
with the Mandan and Hidatsa. They now live in the 
Eastern Segment of the reservation, primarily in the 
communities of White Shield and Parshall. There are 
fewer than ten fluent speakers, all elderly, in a popula- 
tion of approximately 2,000. 


Assiniboine (Nakon) [275] is an emergent language in 
the Dakotan dialect complex of Siouan, spoken (with 
little variation) on two reservations in Montana — 
Fort Belknap and Fort Peck — and on three reserves in 
Saskatchewan — Whitebear, Carry the Kettle, and 
Mosquito-Grizzly Bear’s Head. Other Indian lan- 
guages are present in all of these communities. On the 
Saskatchewan reserves Cree is widely spoken and 
many of the Assiniboine speakers are fluent in it. In 
Montana, Assiniboines share the Fort Belknap Reser- 
vation with the Algonquian-speaking Gros Ventre 
(Atsinas), and the Fort Peck Reservation with 
speakers of Sioux. In a total Assiniboine population 
of approximately 3,500 there are no more than 150 
first-language speakers, none under 40 and most 
elderly. Courses in Assiniboine are taught at Fort 
Belknap Community College. 


Atakapa is the general term for the languages and dia- 
lects of the Atakapan family, formerly spoken for 200 


10 


North America 


miles along the Gulf of Mexico between Galveston 
Bay in Texas and Vermillion Bay in Louisiana. There 
were several dialects, clustering in two emergent 
languages, Western Atakapa and Eastern Atakapa. 
Western Atakapa was extensively documented by 
Gatschet in 1885, and a dictionary, grammatical 
sketch, and collection of texts were published. The 
last speakers of any variety of Atakapa died early in 
the twentieth century. Although a few scholars think 
Atakapa belongs to a ‘Gulf stock, most consider it an 
isolate. 


Atsina. See Gros Ventre 


Atsugewi [197] is one of the two languages of the 
Palaihnihan family of northeastern California. It is 
the heritage language of two of the eleven bands that 
constitute the Pit River Tribe, the Atsugewi of Hat 
Creek and the Aporige (Apwaruge) of Dixie Valley. 
Atsugewi has been well documented in recent 
decades, principally by Talmy, who published a major 
study of its semantic structure. There have been no 
known speakers since 1988, and no revitalisation 
effort is under way. 


Attikamek (Téte de Boule, Atikamekw) [20] is the 
Cree dialect spoken on the Manouane/Manuan, 
Obedjiwan/Obidjewan, and Weymontachingue/ 
Wemontachie Reserves, north of Trois-Riviéres in 
south-central Quebec. The entire native population 
of about 3,000 is fluent in the language, and most 
children are monolingual before entering school. 


Awaswas. See Costanoan 


Babine (or Bulkley Valley/Lakes District Language) 
[90] is the Athabaskan language spoken on Bulkley 
River and in the Lake Babine area of central British 
Columbia, to the north and west of the Carrier dialect 
complex. Although there is a tradition of grouping 
Babine with Carrier (it has sometimes been referred 
to as ‘Northern Carrier’), there is a sharp linguistic 
and cultural boundary between the two speech 
communities. Babine has two clearly differentiated 
dialects. The western dialect (usually called Wetsu- 
wet'en) includes the Bulkley River communities 
(Hagwilget, Moricetown, Smithers, Houston, and 
Broman Lake) and the Nee-Tahi-Buhn and Skin Tayi 
bands at Burns Lake. It has about 100 fluent speakers, 
none of them children. An additional 100 or more are 
passive speakers, including a few children. Although 
Wetsuwet'en has been offered for credit by the Uni- 
versity of Northern British Columbia since 1996 and 
is taught in local schools in Moricetown and Smith- 
ers, there is little indigenous literacy. The eastern 
dialect (‘Babine proper’) includes the Lake Babine 
and Takla Lake communities as well as former resi- 
dents from Lake Babine who have settled in Burns 
Lake. It has up to 200 speakers of all degrees of flu- 
ency out of a total population of 250. However, there 
are few speakers under 25, though some children have 
a passive knowledge none are active speakers. 


Bannock. See Northern Paiute 


Barbarefio [230] is the Chumash language once 
spoken at Santa Barbara mission and along the 
southern coast of Santa Barbara County from Point 
Conception to Carpinteria. The last Chumash 
speaker, a fluent Barbarefio, died in 1965. 


Bay Miwok, sometimes referred to as Saclan, was a 
language of the Eastern Miwok branch of the Utian 
family, and was formerly spoken in parts of Contra 
Costa County, California, west of Mt. Diablo. About 
80 words are documented in a manuscript from the 
Mission period. 


Bearlake [86] is an emergent Athabaskan language 
within the North Slavey group of Slavey dialects of 
the Dene complex. It is spoken as a first language by 
about 580 people (450 of whom use it actively at 
home) in two communities in the Northwest Terri- 
tories, Déline, formerly Fort Franklin (460 speakers 
out of a total population of 615), and Tulita, formerly 
Fort Norman (up to 120 speakers out of 450 total). At 
Déline, Bearlake is the lingua franca of a dialectally 
mixed community and many speakers are also fluent 


in Dogrib. At Tulita, an unknown number of the 
speakers of Bearlake are also fluent (or primarily flu- 
ent) in Mountain. 


Beaver [83] is an Athabaskan language, spoken in 
eastern British Columbia (in the communities of 
Doig, Blueberry, Hudson Hope, and Prophet River) 
and in northwestern Alberta (in the communities 
of Horse Lakes, Clear Hills, Boyer River, and Rock 
Lane). There are about 300 speakers. Although Beaver 
is partially intelligible to speakers of emergent lan- 
guages in the Dene dialect complex, for political and 
geographical reasons it is not usually considered a 
Dene language. 


Bella Coola [121] is a Salishan language spoken by 
members of the Bella Coola Band in a single com- 
munity (Bella Coola) on the north-central coast of 
British Columbia. The term Nuxalk (derived from the 
native name for Bella Coola Valley) was adopted by 
the Band around 1980 to designate the language 
and its speakers. Isolated geographically from other 
Salishan languages, Bella Coola has been heavily 
influenced by the Wakashan languages that adjoin 
it (Haisla, Heiltsuk-Oowekyala). There are about 20 
native speakers, the youngest in their sixties. 


Beothuk [322] was the aboriginal language of New- 
foundland. Although the English began settling the 
island in the early sixteenth century and were in con- 
tact with the original inhabitants for over 300 years, 
Beothuk is attested in only three vocabularies, total- 
ling about 325 words, all collected between 1791 and 
1828. These lists, furthermore, are error-ridden and 
difficult to assess. The last known Beothuk survivor 
died in 1829. It has long been conjectured that 
Beothuk is distantly related to the Algic family, but 
the Beothuk data are too poor to allow a definite con- 
clusion and the language is best considered an isolate. 


Biloxi [284] was a language of the Southeastern sub- 
group of Siouan, spoken in the seventeenth and early 
eighteenth century on the lower Pascagoula River and 
Biloxi Bay in southern Mississippi. The language was 
extensively documented by Dorsey in 1892-93 from 
a remnant of the tribe living in Rapides Parish, 
Louisiana. A search for speakers in 1934 was fruitless, 
although a short wordlist was collected from a 
woman who had spoken the language in her youth. 


Blackfoot [14] is an Algonquian language of the 
northern High Plains, spoken principally on the 
Blackfoot, Peigan, and Blood Reserves in southern 
Alberta, and on the Blackfeet Reservation in north- 
western Montana. There are three shallowly differen- 
tiated dialects, representing old tribal subdivisions: 
Siksika, spoken primarily on the Blackfoot Reserve; 
Kainaa, or Blood, spoken on the Blood Reserve; and 
Piegan (spelled ‘Peigan’ in Canada), spoken on the 
Peigan Reserve in Alberta and the Blackfeet Reserva- 
tion in Montana. While the number of fluent 
speakers of Blackfoot has declined in the past gener- 
ation, there are still several thousand speakers of the 
language, including hundreds who no longer reside 
on tribal land. In Canada, 5,605 first-language 
speakers of Blackfoot were counted in the 1996 cen- 
sus, out of a total combined Band membership of 
over 15,000. In some locations Blackfoot remains the 
principal means of communication for older adults. 
In the United States the 1990 census counted 1,062 
first-language speakers in a tribal enrolment of 
approximately 13,000. 


Buena Vista Yokuts [181], formerly spoken in at 
least two local varieties around Buena Vista Lake in 
Kern County, California, was a distinctive emergent 
language in the Yokuts complex. Its vocabulary is 
partially documented in several wordlists. There have 
been no speakers since the 1930s. 


Bulkley Valley/Lakes District Language. See Babine 


Caddo [293] is the sole surviving member of the 
Southern branch of the Caddoan family, remotely 
related to members of the Northern branch. The 
modern Caddo Tribe was formerly an aggregate of 
numerous autonomous bands speaking distinctive 
dialects, and organised into at least three confeder- 


acies that were distributed over a vast area of eastern 
Texas, southeastern Oklahoma, southwestern Arkan- 
sas, and northern Louisiana. During the nineteenth 
century the remnants of those groups settled in pres- 
ent Caddo County, Oklahoma, primarily in the vicin- 
ity of Anadarko and Binger. Currently, the language is 
spoken by fewer than 25 elderly members of the tribe, 
and only remnants of the former dialectal diversity 
survive. 


Cahita. See Yaqui 


Cahuilla [244] is a Uto-Aztecan language of the Takic 
subfamily, the heritage language of several small 
tribes in the inland area of southern California, 
including people at Morongo, Agua Caliente (Palm 
Springs), Cabazon, Augustine, Torres-Martinez, Santa 
Rosa, Cahuilla, Ramona, and Los Coyotes. Originally 
there were probably three dialects of Cahuilla: Moun- 
tain, Desert, and Malki. There are approximately 30 
fluent first-language speakers of Cahuilla in a com- 
bined reservation population of about 2,300. 


Calusa [310] was a language spoken in the sixteenth 
century in the far southwest of Florida, directly 
attested only in a handful of words cited in a book 
published in 1575. The data are too scant to allow the 
language to be classified. 


Carolina Algonquian [61], also known as Pamlico, 
was the Algonquian language spoken in the vicinity 
of Pamlico Sound, North Carolina. It is known only 
from two short word lists, one from the Roanoke 
colony in the 1580s, the other published in 1709. The 
language became extinct in the eighteenth century. 


Carrier [91] is the general term for a complex of 
Athabaskan dialects in central British Columbia, 
adjoining (but clearly distinct from) Babine on the 
northwest and Chilcotin on the south. Carrier (locally 
called Dakelh) is spoken in a number of local varieties, 
traditionally divided into ‘Upper Carrier’ (the com- 
munities to the north of Fort St. James, around Stuart 
and Trembleur Lakes) and ‘Lower Carrier’ in com- 
munities to the south. More recent research indicates 
that Lower Carrier should be split into a Fraser/ 
Nechako dialect group (Prince George, Cheslatta, 
Stoney Creek, Nautley, and Stellakoh) and a Blackwa- 
ter dialect group (Ulkatcho, Kluskus, Nazko, Red 
Bluff, and Anahim Lake). A Carrier lingua franca was 
established by Catholic missionaries (most notably 
Father A. G. Morice) in the nineteenth century, based 
on the dialect around Fort St. James, and a syllabic 
writing system introduced. 


Catawba [286] and Woccon [287] are the two lan- 
guages for which there is documentation in the 
Catawban branch of the Siouan-Catawba family. As 
a political unit the Catawba tribe was formed in 
the first half of the eighteenth century by the consoli- 
dation of many small peoples of North and South 
Carolina. Some of these probably spoke additional 
Catawban languages while others did not, and they 
were joined by refugee groups originally from else- 
where in the southeast. After conflict and disease 
decimated the tribe, some fled to the Choctaws in 
Oklahoma, and others joined the Cherokee. The 
Catawba Nation of York County, South Carolina, 
remains on a reservation near their old homeland, 
where the language continued to be spoken through 
the mid-twentieth century. The last speakers retained 
knowledge of two dialects, Esaw and Saraw, the last 
hint of the linguistic diversity that had included more 
than twenty dialects and languages in 1743. 


Cathlamet [149] was the Chinookan language for- 
merly spoken along the lower Columbia River from 
the vicinity of Astoria upstream about 50 miles. It was 
documented by Boas in the 1890s, primarily in narra- 
tive texts. The language has been extinct since shortly 
after the beginning of the twentieth century. 


Cayuga [316] is a Northern Iroquoian language, 
originally spoken by a tribe of the Iroquois Confeder- 
acy (Six Nations) situated west of Onondagas and 
east of the Senecas, between Cayuga and Owasco 
Lakes. After the American Revolution many of 
the Cayugas fled to Canada, where their modern 
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descendants make up part of the population of the 
Six Nations Reserve at Grand River, Ontario. There 
are about 100 first-language speakers of Cayuga in 
Ontario, the youngest around 40 years of age. Other 
Cayugas joined the Seneca, where their language was 
gradually replaced by Seneca, and yet others moved 
westward with other Iroquois, eventually settling in 
northeastern Oklahoma. A dialect of Cayuga was 
spoken in Oklahoma as late as the 1980s but is now 
apparently extinct. 


Cayuse [270] was spoken in the early nineteenth cen- 
tury in the plateau region of northeastern Oregon 
and southeastern Washington. It is an isolate; an earl- 
ier classification that linked Molala (now considered 
Plateau (also called Plateau Penutian) and Cayuse in 
the ‘Waiilatpuan’ family has been abandoned. By 
1850 the remnants of the Cayuse had settled among 
the Nez Perce and the Umatillas, where the Cayuse fell 
out of use. A few people retained some fluency in the 
language as late as the early 1930s, but Cayuse was 
totally extinct by the 1960s. 


Central Alaskan Yup'ik [4] (the apostrophe denotes 
a long p) is the most vigorously surviving Native lan- 
guage in Alaska. Of a total population of about 
21,000, 10,000 are speakers, and the language is still 
being acquired by children in 17 of the 68 villages in 
which it is spoken. Five regional dialects can be dis- 
tinguished, one spoken by a majority of speakers and 
four minority dialects. 


Central Pomo [202], a language of the Pomoan 
family of Northern California, is spoken by a very few 
speakers on the California coast at Point Arena- 
Manchester and about 40 miles inland in the Hop- 
land area. 


Central Siberian Yupik [3] is spoken in Alaska in 
two villages on St. Lawrence Island, Gambell and 
Savoonga. Almost the entire adult population (1,200 
on the island, about 200 more in Nome and Anchor- 
age) speaks the language, although a decreasing 
number of children acquire it as their first language. 
The St. Lawrence Island dialect is nearly identical to 
the Yupik spoken on the Chukotka Peninsula on the 
Russian side of the Bering Strait, largely concentrated 
in the villages of Novoye Chaplino, Sireniki, Provi- 
deniya, and Uel'kal'. There are about 300 Russian 
speakers of Central Siberian Yupik in a population of 
about 900, none under 30. 


Central Sierra Miwok. See Eastern Miwok 


Central Southern Ojibwe is an emergent language of 
the Ojibwayan dialect complex, closely related to 
Saulteaux in Manitoba and to Eastern Ojibwe and 
Ottawa in Michigan and Southern Ontario, a rela- 
tionship which reflects the historical dispersion of 
Southern Ojibwe speakers across the Great Lakes and 
beyond during the fur trade from the seventeenth to 
the nineteenth century. Central Southern Ojibwe 
is spoken in a large number of communities in 
Northern Ontario, Michigan, Wisconsin, and Min- 
nesota, as well as on reservations in North Dakota 
and Montana. These include 25 separate tribal 
entities in the United States and a roughly equal 
number of communities in Canada. Different groups 
refer to themselves and their language as Ojibwe, 
Ojibway, Chippewa, Chippeway, and Anishinabe, but 
local varieties do not differ substantially. 


Chalon. See Costanoan 
Chaplinsky Yupik. See Central Siberian Yupik 


Chemakum [119], one of the two languages of the 
Chemakuan family, was spoken in a small territory 
between Hood Canal and Port Townsend, on the east 
side of the Olympic Peninsula in northwest Washing- 
ton. In the nineteenth century, when it was first 
noted, Chemakum was being replaced by Salish 
(Klallam and Lushootseed) and was not well docu- 
mented. Boas found three speakers in 1890, and the 
language was extinct by the 1940s. 


Chemehuevi. See Ute-Chemehuevi 


Cherokee [321] is an Iroquoian language spoken in 


three divergent dialects by up to 10,000 of the more 
than 122,000 members of the Cherokee Nation of 
Oklahoma, and about 1,000 of the approximately 
10,000 members of the Eastern Band of Cherokees in 
North Carolina. In addition, an undetermined — but 
relatively high — percentage of the 7,500 members of 
the United Keetoowah Band of Oklahoma and 
Arkansas are speakers of the Oklahoma variety. A 
number of the speakers reported for the Cherokee 
Nation may in fact be Keetoowahs; the political 
independence of the Keetoowahs is in dispute and the 
membership rolls of the two tribes overlap. The Kee- 
toowah population is largely rural and culturally con- 
servative, and at least a few children are reported to 
be fluent speakers. There were three major dialects of 
Cherokee spoken in the eighteenth and early nine- 
teenth century: Lower (Elati), Middle (Kituhwa), and 
Western or Overhill (Otali). The Lower dialect, which 
had /r/ where the others have /1/, ceased to be spoken 
by the end of the nineteenth century. The Overhill 
dialect, from the upper Tennessee Valley, is the variety 
spoken today in northeastern Oklahoma, where most 
Cherokees were forced to relocate in the late 1830s. 
The Middle dialect is used on the Qualla Boundary 
reservation, North Carolina, by descendants of 
Cherokees who resisted removal. The third modern 
dialect, spoken in the Snowbird community near 
Robbinsville, North Carolina, combines features of 
the Western and Middle dialects. Cherokee is written 
in a traditional syllabic orthography, devised by 
George Guest (Sequoyah) in the 1820s and later pro- 
moted by missionaries and progressives in the Chero- 
kee Nation both before and after removal. The 
Sequoyah syllabary remains a badge of Cherokee tri- 
bal identity, both in Oklahoma and North Carolina, 
but is no longer in active use for general literacy. 


Cheyenne [31] is an Algonquian language of the High 
Plains, spoken on the Northern Cheyenne Reserva- 
tion in southeastern Montana, and in scattered com- 
munities in central Oklahoma. There are a number of 
differences between the Northern Cheyenne and 
Southern Cheyenne dialects, but they are not signifi- 
cant. The language is widely spoken in Montana, with 
about 1,700 first-language speakers out of a total 
Cheyenne population of 4,000, including at least 
some children. There are an additional 400 speakers 
in Oklahoma, most of them middle aged or older. 


Chickasaw [300] is a Western Muskogean language, 
closely related to Choctaw, spoken by about 600 
members of the Chickasaw Nation of south-central 
Oklahoma. The youngest speaker is in her mid 40s, 
though most are in their 50s or older. The language is 
in use among those who are middle-aged and elderly, 
and is not being learned by children or by second- 
language learners. 


Chilcotin (Tsilhgot'in) [92] is an Athabaskan lan- 
guage of south-central British Columbia, spoken in 
several communities along the Chilco and Chilcotin 
Rivers in the vicinity of Williams Lake, including 
Alexis Creek, Anaham, Nemaiah Valley, Stone, and 
Toosey, as well as at Alexandria on the Fraser River. 
Although Chilcotin adjoins Carrier on the north and 
there are several communities in which both lan- 
guages are spoken, they are quite distinct and are not 
mutually intelligible. Until the 1980s Chilcotin had 
been considered to be in a relatively healthy state, 
with many children acquiring it as their first lan- 
guage; in 1979 the language was estimated to have 
1,725 speakers, a high percentage of the population. A 
survey conducted in 1988, however, showed that 
while a sizeable proportion of Chilcotin children over 
ten were speakers of the language, younger children 
spoke only English. Current estimates of the number 
of speakers range between 400 and 1,200, with the 
youngest in their mid-teens. 


Chimariko [191] was spoken in a small territory along 
the Trinity River and its tributaries in the mountain- 
ous interior of northwestern California. It is an isol- 
ate, but has been grouped in the controversial Hokan 
stock. At the time of contact many Chimarikos were 
bilingual in Hupa, the adjoining Athabaskan lan- 


guage, and most Chimariko survivors joined the 
Hupas. The language was extensively documented by 
Harrington in the 1920s, who worked with the last 
fluent speakers. None of his material has been pub- 
lished, although some earlier, less accurate material 
has. The last speaker died around 1950. 


Chinook. See Kiksht 


Chipewyan (Dene Soun'line) [89] is an Athabaskan 
language of the Dene complex spoken in a number of 
communities scattered across a large area in the forest 
and tundra of northern Alberta, Saskatchewan, Mani- 
toba, and the eastern Northwest Territories. Among 
the principal settlements are Cold Lake and Fort Chi- 
pewyan, Alberta, and Fort Resolution and Lutselk'e, 
NWT. Making an accurate estimate of the number of 
first-language Chipewyan speakers is difficult because 
many are also speakers of Cree. The Government of 
Canada estimates a total of 1,865 speakers, the major- 
ity in Alberta and Saskatchewan. A recent survey at 
Cold Lake found only 200 fluent speakers out of 
1,800—2,000 Band members, but the proportion is 
much higher in some remote communities such as 
Wollaston Lake, Saskatchewan, where most children 
are reported to be more fluent in Chipewyan than in 
English. In the Northwest Territories, where Chi- 
pewyan has official language status, there are 370 
speakers (185 of whom use the language at home) in 
the communities of Lutselk'e, Ft. Smith, and Ft. 
Resolution. 


Chitimacha [297], the heritage language of the 
Chitimacha Tribe of Cheranton, St. Mary Parish, 
Louisiana, was originally spoken throughout the 
Bayou country at the delta of the Mississippi River. 
Although attested in several wordlists in the nine- 
teenth and early twentieth century, Chitimacha is 
primarily documented in the extensive data collected 
by Swadesh in 1932-34 from the last two fluent 
speakers, most of which remains unpublished. The 
last speaker died in 1940. 


Chiwere [281] is a moribund Siouan language of the 
Chiwere-Winnebago subgroup. Two tribal dialects 
can be distinguished: Otoe-Missouria (Jiwere) is the 
dialect of the Otoe-Missouria Tribe of the Red Rock 
region of north central Oklahoma. Iowa (Baxoje) is 
the dialect of the Iowa Tribe of Perkins, Oklahoma, 
and of the Iowa Tribe of Kansas and Nebraska at 
White Cloud, Kansas. The total Chiwere population is 
about 1,150, but in 1999 only four passive first- 
language speakers of the Otoe-Missouria dialect 
remained, none fluent. 


Chochenyo. See Costanoan 


Choctaw [299] is a Western Muskogean language, 
closely related to Chickasaw, with 9,000 to 11,000 
speakers in various locations in Mississippi, Okla- 
homa, and Louisiana. The Mississippi Band of 
Choctaws has around 5,000 fluent speakers in seven 
small communities scattered throughout the state 
(the tribal headquarters is in Philadelphia, Missis- 
sippi). The use of Mississippi Choctaw is vigorous at 
all ages, and many children are monolingual in 
Choctaw before attending school. The Choctaw 
Nation of Oklahoma counts at least 4,000 speakers 
among its more than 20,000 members, most of them 


middle-aged or elderly. 


Chumash is the general term for the six languages 
[227-232] of the extinct Chumashan family, formerly 
spoken along the south-central coast of California 
from Morro Bay to Malibu, as well as in the interior 
of Santa Barbara and Ventura Counties. Most of 
the languages are referred to by the Franciscan 
mission community with which they were associated. 
Obispefio [227], spoken at San Luis Obispo mission, 
was quite distinct from the rest. The other languages, 
except for Island Chumash [232], may have been 
partially mutually intelligible and may have consti- 
tuted a dialect complex with emergent languages 
structured around the mission communities. All of 
the Chumash languages were poorly documented 
before the twentieth century, but after 1913 they 
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became the focus of much of Harrington’s work. 
He extensively documented all of the languages, and 
continued the work until his death in 1961. 


Clallam (Klallam) [129] is the Central Salish lan- 
guage of the north shore of the Olympic Peninsula in 
the State of Washington, closely related to Northern 
Straits Salish (the two languages are sometimes 
grouped together as ‘Straits Salish’). The principal 
Klallam communities are at three small reservations, 
Port Gamble, Lower Elwha, and Jamestown. There is 
also a Klallam community on the Becher Bay Reserve 
on Vancouver Island. There are very few first- 
language speakers remaining (two of them at Becher 
Bay), none fully fluent. The language is used in cere- 
monies and for tribal identity and there is consider- 
able interest in revival. 


Coahuilteco [261] was spoken in the eighteenth cen- 
tury at the Franciscan mission in San Antonio, Texas. 
It is primarily documented in an 88-page bilingual 
Spanish-Coahuilteco confessor’s manual published 
in Mexico in 1760, which apparently represents one 
dialect (Pajalate) of a language spoken widely in 
southern Texas and Coahuila. Although in the past 
Coahuilteco was often joined to several other extinct 
languages of Texas and Mexico in a postulated family 
called ‘Coahuiltecan,’ it is now considered an isolate. 
There is no attestation of the language after the end 
of the eighteenth century. 


Coast Miwok [165] was a language of the Western 
Miwok branch of the Utian family formerly spoken in 
Marin County and southern Sonoma County, Cali- 
fornia, immediately to the north of San Francisco. 
The Coast Miwoks were brought into the Franciscan 
missions at San Rafael and Sonoma and their culture 
and language largely destroyed before 1835. Rem- 
nants of the language, mostly vocabulary, were col- 
lected from survivors between 1840 and 1964. Several 
local dialects were represented, with the Bodega Bay 
dialect (Bodega Miwok) somewhat divergent from 
the others (Marin Miwok). The last known person to 
have direct knowledge of the language died in the 
1970s. A compilation has been made of the extant 
documentation of Marin Miwok, and a dictionary of 
Bodega Miwok has been published. 


Coast Tsimshian. See Maritime Tsimshian 


Cochimi was a chain of dialects that were formerly 
spoken in the central portion of the peninsula of Baja 
California, from about 150 miles south of the US 
border to about 200 miles north of Cabo San Lucas. 
Cochimi is a branch of the Cochimi-Yuman family, 
coordinate with the Yuman languages. Two dialect 
clusters or emergent languages can be identified: 
Northern Cochimi [218], spoken by nomadic bands 
in the extremely arid Central Desert and later associ- 
ated with the missions at Santa Maria Cabujaca- 
mang, Santa Gertrudis, and San Francisco de Borja 
(Borjefio); and Southern Cochimi [219], spoken by 
more settled groups at the oases and in the highlands 
south of the 28th parallel and best attested from the 
missions of San Javier Viggé, San José Comondu, and 
San Ignacio Kadakaaman (Cadegomefio). Laimén 
was apparently a variety of Cochimi, though its iden- 
tity is sometimes disputed. Monqui (Monqui-Didit) 
is Cochimi and sometimes with Guaicurian, leaving 
its status unclear. Most of the scanty documentation 
of Cochimi comes from the Jesuit missions (1697— 
1767), although there are some nineteenth century 
vocabularies. As late as 1925 Harrington was able to 
find a person who remembered a few Northern 
Cochimi forms, but the language was essentially 
extinct by that time. 


Cochiti. See Rio Grande Keresan 


Cocopa [216] is a Yuman language, which together 
with Dieguefio forms the Dieguefio-Cocopa sub- 
group of the family. It was originally spoken by the 
people of the lowermost Colorado River and its delta. 
It is spoken today by between 150 and 300 of the 
approximately 700 members of the Cocopah Tribe, 
who have a reservation near Yuma, Arizona, and an 
equal or greater number of Mexican Cucapas in 


communities in Baja California and Sonora. In Ari- 
zona, most Cocopas over 50 are fluent, and a number 
of younger people are semi-speakers, including at 
least some children. 


Coeur d’Alene [142] is a moribund Interior Salish lan- 
guage spoken on the Coeur d'Alene Reservation in 
northern Idaho. There are only four surviving first- 
language speakers, ranging in age from the mid-70s to 
101 (the oldest member of the tribe). 


Columbian [143] is an Interior Salish language, 
originally spoken in a number of local dialects along 
the Columbia River in north-central Washington. 
Most of the approximately 25 remaining first-lan- 
guage speakers refer to their dialect as Nxa7amxcin 
and live on or near the Colville Reservation. All are 
elderly, ranging in age from late 60s to mid-80s with 
most well over 70. A few other speakers elsewhere 
may represent other dialects. 


Colville. See Okanagan 


Comanche [237] is a Central Numic language, for- 
merly spoken in the southern Plains from Kansas and 
Colorado to the Rio Grande. Before the eighteenth 
century the ancestors of the Comanches were Shos- 
honi speakers in what is now Wyoming, but the 
Comanche dialect became quite distinct after the 
groups separated, and they are now mutually intelli- 
gible only with difficulty. In the late nineteenth cen- 
tury the Comanches were placed on reservation lands 
in southwestern Oklahoma, north of Lawton, where 
the tribe maintains a current membership of about 
8,500. No more than 100 are speakers of Comanche, 
all older than 50. 


Comecrudan was a family of three languages that 
were formerly spoken along the lower Rio Grande 
River in southern Texas and northern Mexico. Come- 
crudo is documented by two short wordlists collected 
in 1829 and 1861 and by a substantial vocabulary col- 
lected in 1886. Mamulique and Garza are attested in 
one short wordlist from 1829. Although earlier 
thought to be part of a postulated ‘Coahuiltecan’ 
family, Comecrudan is not now considered to be 
related to any other family. 


Comox [122] is a Central Salish language, spoken at 
the northern end of the Strait of Georgia in British 
Columbia, both on the mainland and on the east 
coast of Vancouver Island. Island Comox and Main- 
land Comox dialects are recognised, the latter divided 
into three varieties associated respectively with the 
Homalco, Klahoose, and Sliammon Bands. The last 
fluent speaker of Island Comox died in the mid- 
1990s. Although in the early 1980s Mainland Comox 
was reported to be spoken fluently by about one-third 
of the population, in 2000 it was estimated that there 
were 60 or fewer active first-language speakers, most 
of them 55 or older, in a total population of 1,500. 


Coos. See Hanis. 


Copper Island Aleut [13] is a mixed language (Aleut 
and Russian) originally spoken in the village of 
Preobrazhenskoye on Copper (Mednyy) Island, Kam- 
chatka Province, Russia. The first settlers on Copper 
Island in the early nineteenth century were Russians, 
soon followed by Aleuts brought from the island of 
Attu. By 1969 the entire community had been 
relocated to the village of Nikol'skoye on Bering 
Island. The language is critically endangered, and 
only a handful of elderly speakers remain. Copper 
Island Aleut is a true mixed language. Its lexicon, 
nominal inflection and derivational morphology are 
derived largely from Aleut, while its verbal inflection 
and syntax are basically Russian. 


Costanoan is one of the two branches of the Utian 
family, and includes eight languages that were spoken 
in a compact area along the coast of California from 
north of San Francisco to south of Monterey. All 
Costanoan languages are extinct, and some are very 
poorly documented. Three of the four languages 
formerly spoken around San Francisco Bay, Karkin 
[172], Ramaytush [173] and Tamyen [175], have 


been extinct since the mid-nineteenth century and 
are known only through short vocabularies. The 
fourth language, Chochenyo [174], formerly spoken 
along the eastern shore of the bay, was moderately 
well documented by Harrington early in the twen- 
tieth century, although very little of his material has 
been published. Awaswas [176], spoken around 
Santa Cruz, and Chalon, spoken in the Salinas Valley 
near Mission Soledad, became extinct early along and 
are very scantily attested. Mutsun [178] was the lan- 
guage of the area around Mission San Juan Bautista. 
It survived until the 1930s and is well documented, 
both from the Mission period and in the twentieth 
century by Harrington. The Mission materials have 
been published, and there is an unpublished gram- 
mar and dictionary based on Harrington’s materials. 
Rumsen (Rumsien) [179] was the language of the 
Monterey area. It is moderately well documented, 
mostly by Harrington, and a full dictionary has been 
prepared based on all extant Rumsen materials. 


Cotoname [265] was formerly spoken along the lower 
Rio Grande River in southern Texas and northern 
Mexico, adjacent to the Comecrudan languages. It is 
attested by two short vocabularies, collected in 1829 
and 1886. Although earlier thought to be part of a 
“Coahuiltecan’ stock, Cotoname is now considered to 
be an isolate. 


Cowlitz [135] , one of the four languages of the 
Tsamosan division of the Salishan family, was for- 
merly spoken along the Cowlitz River in southwestern 
Washington. Cowlitz descendants live in scattered 
locations in and around their former territory. The 
language is very poorly documented, and no speakers 
are known to remain. 


Cree-Montagnais [15-30] is a chain of Algonquian 
languages extending across Canada from the Rockies 
to the coast of Labrador. A major distinction is usu- 
ally drawn between the Cree languages in the west 
(Plains Cree [15], Woods Cree [17], Western 
Swampy Cree [18], Eastern Swampy Cree, Moose 
Cree, and Attikamek [20]) and the Montagnais- 
Naskapi languages of northern Quebec and Labrador 
(East Cree [21, 22], Western Montagnais [25], 
Central Montagnais [26], Eastern Montagnais [27], 
and Eastern Naskapi [24]). There is some degree of 
mutual intelligibility across the entire chain of lan- 
guages. Most dialects are thriving, and in some com- 
munities a few older speakers are monolingual. In the 
1996 Canadian census 102,215 individuals reported 
that a Cree-Montagnais dialect was their mother 
tongue, and about half of these (49,850) said that 
they used it as their home language. The language is 
strongest in the more northerly, isolated com- 
munities, many of which are not accessible by road. 


Creek (or Muskogee) [305] is an Eastern Muskogean 
language spoken by 4,000 to 6,000 residents of the the 
former territory of the Muscogee (Creek) Nation and 
Seminole Nation in east-central Oklahoma, and 
by fewer than 200 members of Seminole Tribe of 
Florida, most of them living on the Brighton Reserva- 
tion. The dialect of Creek spoken by the Florida Sem- 
inoles is distinct. There are also some differences 
between the dialects of the tribal groups in Okla- 
homa. As of 2001, the youngest speaker in Oklahoma 
was 18, the youngest in Florida 44. Creek is routinely 
used among those in their 60s and above, and is also 
widely used at church services, for hymns, and for 
ceremonial speeches. 


Crow [272], a Siouan language of the Missouri River 
subgroup, is spoken on the Crow Reservation in 
southeastern Montana and in adjacent off-reservation 
communities. Principal towns on the reservation are 
Crow Agency, Lodge Grass, Pryor, Wyola and St. 
Xavier, and the total tribal enrolment is about 8,500. 
There are 3,000—4,000 first language speakers, mostly 
over the age of thirty. There are a few children, teen- 
agers and young adults who are Crow speakers, and a 
larger number who are semi-speakers and/or under- 
stand the language. 


Cupefio [245], a language of the Cupan division of 
the Takic subfamily of Uto-Aztecan, was originally 
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spoken near Warner’s Hot Springs, in Riverside 
County, California, but in 1902 the Cupefios were 
resettled with speakers of Luisefio on the rancheria at 
Pala. The last fluent speaker of Cupefio died in 1987 
at the age of 94, although several people still remem- 
ber a few words and phrases and there is one elderly 
semi-speaker. 


Dakelh. See Carrier 
Dakota. See Sioux 


Dakotan is a Siouan dialect complex, within which 
three relatively well defined dialect areas may be dis- 
tinguished: Stoney, currently spoken in southwestern 
Alberta; Assinibone, in southern Saskatchewan and 
northern Montana; and Sioux, spoken widely in 
North and South Dakota, Minnesota, and in southern 
Manitoba and Saskatchewan. Intelligibility between 
these dialect areas is low, although speakers can 
communicate after a while with some difficulty, and 
they are perhaps best considered emergent languages. 
Sioux is further divided into three major dialects, 
Santee-Sisseton (Dakota), Yankton-Yanktonai, and 
Teton (Lakota), between which there is a fair degree 
of mutual intelligibility. 


Delaware. See Munsee, Unami 


Dene, in older Canadian usage synonymous with 
“Athabaskan’, is now used as the name of a complex 
of Athabaskan dialects and emergent languages in the 
Mackenzie River drainage of northwestern Canada, 
primarily in the Northwest Territories but also 
extending into parts of northern British Columbia, 
Alberta, Saskatchewan, and Manitoba. Three major 
Dene languages are distinguished, Slavey, Dogrib, 
and Chipewyan. Slavey can be further divided into 
Bearlake and Hare (together constituting North 
Slavey), Mountain, and Slave or South Slavey. There 
is a moderate degree of mutual intelligibility across 
the Dene complex, but this extends as well to some 
adjacent Athabaskan languages (particularly Beaver, 
Sekani, and Kaska) that are not usually called Dene. 
The group is defined as much on a social and political 
basis (the Athabaskan languages of the Northwest 
Territories) as on a linguistic basis. 


Dieguefio is the general term for a complex of Yuman 
dialects spoken in southern California and northern 
Baja California. Three emergent languages are recog- 
nised within this complex: Ipai [213], or Northern 
Dieguefio, is spoken by a small number of elderly 
people in four communities in northwestern San 
Diego County, including Mesa Grande, Santa Ysabel, 
San Pasqual, and Barona. Kumeyaay [214] is spoken 
in several locations in central and southern San Diego 
County, the most important of these being Campo. 
There are between 40 and 50 fluent speakers. (In 
recent years the entire Dieguefio dialect complex has 
also been referred to as Kumeyaay, creating some 
confusion.) Tipai [215] is spoken by approximately 
100 people in several communities in northern 
Baja California, as far south as Ensenada and Santa 
Catarina, and also in California by the Jamul com- 
munity near San Diego. The distinction between 
Kumeyaay and Tipai is perhaps more political and 
social than it is linguistic, and is greatly influenced by 
the US-Mexican border. 


Ditidaht. See Nitinaht 


Dogrib [88] is an Athabaskan language of the Dene 
complex spoken in the Northwest Territories between 
Great Slave Lake and Great Bear Lake. It is the first 
language of 2,470 people (of whom about 1,350 regu- 
larly use it in the home), primarily in five small 
communities: Detah (105 speakers out of a total 
population of 190), Rae Lakes (210 out of 260), Rae- 
Edzo (1,010 out of 1,655), Snare Lake (100 out of 
135), and Wha Ti (325 out of 415). There are also 
about 220 speakers in the city of Yellowknife, as well 
as an unknown number of speakers in the dialectally 
mixed community of Déline (Fort Franklin), where 
Bearlake is the lingua franca. 


East Cape Yupik (or Naukan Yupik) [2] is a severely 
endangered Western Eskimo language of Siberia, lin- 


guistically the link between Central Alaskan Yupik 
and Central Siberian Yupik. Originally spoken in the 
village of Naukan on the easternmost point of East 
Cape (Cape Dezhnev), Chukchi Autonomous Dis- 
trict, Russia, facing the Diomede Islands in Alaska, 
the community was forcibly relocated in 1958 to the 
nearby villages of Lorino and Lavrentiya. The number 
of speakers may still be close to 200, but a shift to 
Russian is proceeding rapidly. 


East Cree [21-22] is the Montagnais-Naskapi lan- 
guage spoken in northwestern Quebec, along the east 
coast of James and Hudson bays and inland. There is 
a distinction between northern and southern dialects, 
the latter with coastal and inland varieties. The 
northern dialect is spoken in Whapamagostui (Great 
Whale River), Chisasibi (Fort George), and Wemindji 
(Paint Hills); southern dialects are spoken in 
Nemaska (Nemiscau), Waskaganish (Rupert House), 
Eastmain, Waswanipi, Ouje-bougamau and Mistis- 
sini. Of a population of about 12,000, all but a small 
number are speakers. 


Eastern Abenaki [49] is an extinct Eastern Algonquian 
language that was spoken in southern and central 
Maine. The only surviving Eastern Abenaki group is 
the Penobscot community at Old Town, on Indian 
Island north of Bangor, Maine, where the last fluent 
speaker of the Penobscot dialect died in 1993. A 
dialect originally spoken further west survived into 
the twentieth century at Bécancour, Quebec. The 
language was extensively documented by French 
Jesuit missionaries in the late seventeenth and early 
eighteenth centuries and by fieldwork in the twen- 
tieth century by Frank T. Siebert, Jr. The most exten- 
sive published sources are the dictionaries compiled 
by the Jesuits Sébastien Rale and Joseph Aubery. 


Eastern Canadian Inuktitut. See Inuktitut 
Eastern Eskimo. See Inuit 


Eastern Miwok, one of the two branches of the Miwok 
languages of north-central California, consists of five 
languages: Bay Miwok [167] and Plains Miwok [168] 
are extinct. East and south of the Plains Miwok, in the 
Sierra Nevada mountains as far south as Yosemite 
Valley, was the territory of the Sierra Miwoks. Their 
descendents (who refer to themselves as ‘Mewuk’) 
now live in a scattering of small communities, the 
largest at the Jackson and Tuolumne Rancherias. Of 
the three emergent languages that are usually dis- 
tinguished in the Sierra Miwok dialect complex, the 
most visible today is Northern Sierra Miwok [169], 
spoken at the Jackson Rancheria near Westpoint. It is 
estimated that there are between 6 and 12 speakers, 
only one of whom has active conversational fluency. 
Both of the other two Sierra Miwok languages, Cen- 
tral Sierra Miwok [170], and Southern Sierra Miwok 
[171] have a few semi-speakers or passive speakers. 


Eastern Naskapi [24] is a Montagnais-Naskapi dialect 
or emergent language that is spoken by the Mushuau 
Innu community of Labrador, which was relocated 
from Utshimassits (Davis Inlet) to Natuashish (Sango 
Bay) in 2002. It is the first language of the more than 
500 band members. It is quite distinct from the 
Eastern Montagnais spoken at Sheshatshiu, Labrador, 
having more features in common with Western 
Naskapi and Northern East Cree, but like Montagnais 
it is often referred to as Innu-aimun. 


Eastern Ojibwe [37] is an Algonquian language 
spoken in southern Ontario and Quebec in two 
major dialect clusters. The more westerly variety, 
historically called Mississauga in Canada, is spoken 
east of Georgian Bay, where the largest numbers of 
speakers are today on Parry Island and at Curve Lake, 
near Peterborough. It was formerly spoken in many 
other places both in Ontario west of Georgian 
Bay and in southern Michigan (where it was called 
Chippewa), areas in which it was generally replaced 
by Ottawa through language shift in the nineteenth 
and early twentieth centuries. The Michigan varieties 
were not sharply demarcated from Central Southern 
Ojibwe. The more easterly variety of Eastern Ojibwa 
is the dialect called Algonquin at Golden Lake, 


Ontario, and Maniwaki. Quebec. This was historically 
the Nipissing dialect from north of Lake Huron, 
which emerged as the dominant form of speech 
at Oka and other missions near Montreal. This 
Southern Algonquin dialect must be distinguished 
from Northern Algonquin, which is very different. 


Eastern Pomo [200], a language of the Pomoan family 
of Northern California, is centered on the west side of 
Clear Lake. The language appears to have become 
extinct recently. 


Eastern Swampy Cree [19] (Algonquian language of 
the Cree-Montagnais branch) is spoken in Ontario on 
the west coast of Hudson Bay and James Bay at Fort 
Albany, Attawapiskat, and Peawanuck (where the 
Weenusk band moved after being flooded out of their 
village at Winisk in 1986). Most band members are 
speakers. 


Eel River Athabaskan [103] was a complex of closely 
related local dialects of California Athabaskan that 
were formerly spoken along the Eel River and its 
major tributaries in Humboldt and Mendocino 
Counties. At least four dialect clusters can be dis- 
tinguished: Sinkyone, Nongatl, Lassik, and Wailaki. 
Semi-fluent speakers of one or more Eel River Atha- 
baskan varieties survived into the 1970s. 


Esselen [226] was the language spoken in the late 
eighteenth century in the upper Carmel Valley and 
around Big Sur, on the central California coast south 
of Monterey. It is an isolate usually considered to 
belong to the controversial Hokan stock. The small 
population was divided among three missions and 
the language did not survive long into the nineteenth 
century. Esselen is directly documented in three 
vocabularies collected between 1786 and 1832, as well 
as by a catechism and a short translation. Additional 
material was obtained between 1830 and 1930 from 
Costanoan speakers who remembered something of 
Esselen. The total extant corpus is about 300 words. 


Etchemin [51] was an Algonquian language spoken 
along the coast of Maine, extinct since the seven- 
teenth century. Nothing is known of it for certain 
besides a list of numbers, which indicate it was dis- 
tinct from adjoining Algonquian languages. 


Eyak [65], the only member of its branch of the Na- 
Dene family, was spoken in the nineteenth century 
along the south-central Alaska coast from Yakutat to 
the Copper River. Today, about 50 of the approxi- 
mately 500 members of the Eyak Village corporation 
recognise Eyak or part-Eyak ancestry (most of the 
others are Chugach). As of 2001 there was only one 
remaining speaker, born in 1920 and living in 
Anchorage. 


Flathead. See Spokane-Kalispel 


Gabrielino (Tongva) [243], a language of the Serrano- 
Gabrielino division of the Takic subfamily of 
Uto-Aztecan, was the language of the community 
at San Gabriel mission. The Gabrielino were origin- 
ally a large and powerful tribe whose territory 
included most of Los Angeles and Orange County. 
Several local dialects were distinguished including 
Gabrielino (proper), spoken in the Los Angeles 
basin; Fernandefio, spoken in San Fernando Valley; 
and the variety spoken on Santa Catalina Island. The 
Gabrielino were nearly destroyed by missionisation, 
but a few speakers survived into the twentieth century 
and the language was extensively documented, 
primarily by Harrington beween 1914 and 1933. 


Galice-Applegate [99] was a language of the Oregon 
Athabaskan subgroup formerly spoken along Galice 
Creek and Applegate River, tributaries of the Rogue 
River in southwestern Oregon. There were at least two 
distinct dialects, but only the Galice Creek dialect 
is well documented. After the Rogue River War of 
1855-56 the Galice-Applegate people were forcibly 
resettled in northern Oregon on the Grand Ronde 
and Siletz Reservations, where the language fell into 
disuse. 


Garza. See Comecrudan 
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Gitksan. See Nass-Gitksan 
Gosiute. See Shoshoni 


Greenlandic [10, 11] is the English name for the Inuit 
(Eastern Eskimo) dialects of Greenland (the Inuit 
term is Kalaallisut). Of the 79 Inuit communities in 
Greenland, all but 17 are on the west coast, including 
the largest, Nuuk (Gothab), which has an Inuit popu- 
lation of 8,500. There is a significant dialect difference 
between the west coast settlements and those on the 
east coast, leading to a distinction between West 
Greenlandic and East Greenlandic. The five Thule 
communities in the far northwest of the island consti- 
tute a third dialect cluster, sometimes called Polar 
Eskimo. This dialect is closer to the speech of Baffin 
Island than to West or East Greenlandic, and is usu- 
ally considered to be a variety of Eastern Canadian 
Inuktitut that has been influenced by standard Green- 
landic. Greenland, which became an autonomous 
province associated with the Danish Commonwealth 
in 1979, has a population of about 56,000, approxi- 
mately 10,000 of whom are Danes. The remaining 
46,000 are Inuit, nearly all of them speakers of Green- 
landic. Another 7,000 speakers of Greenlandic live in 
Denmark, most of them in Copenhagen. 


Gros Ventre (Atsina) [28], a moribund Algonquian 
language of the High Plains, is closely enough related 
to Arapaho so that speakers of the two languages 
could to some extent understand each other. It is the 
heritage language of approximately 1,000 Gros Ventre 
who live on the Fort Belknap Reservation in north- 
central Montana, which they share with the Siouan 
Assiniboine. The Gros Ventre allied with the Black- 
foot in the eighteenth century, by which time early 
vocabularies show that their language was already 
differentiated from Arapaho. (They are occasionally 
confused with the completely unrelated Hidatsa, 
whom the French also called Gros Ventre, apparently 
because the symbols for the two tribes in Plains Indian 
Sign Language are quite similar.) Fewer than ten 
elderly first-language speakers remain, none of them 
fully fluent; the last traditional speaker died in 1981. 


Guaicura (Waikuri) [220], the sole confirmed mem- 
ber of the postulated Waikurian family in southern 
Baja California was spoken in the territory of Mission 
San Luis Gonzaga in the interior between Loreto and 
La Paz and south across the Magdalena Plain to the 
southwestern coast. It is documented in the eight- 
eenth century report of a Jesuit who had served in the 
mission at San Luis Gonzaga; he supplies only the 
Lord’s Prayer, the Twelve Articles of the Creed, a verb 
paradigm, and a few additional words. 


Guaicurian (Waikurian) is a presumed family of lan- 
guages that were formerly spoken in the southern- 
most part of the peninsula of Baja California. Only 
one of the languages, Guaicura (Waikuri), is attested 
in any appreciable way. Several other languages or 
dialects were also spoken in lower Baja California, 
but there is next to no attestation of these. Never- 
theless, based on non-linguistic information and 
observations of similarities and differences in colonial 
sources, it is sometimes hypothesised that in addition 
to Guaicura, the Guaicurian family may also have 
had two other languages, Perici [221] and Monqui 
(Monqui-Didit) [222], spoken in a small region 
around Loreto. Some have also held that Huchiti 
(Uchiti), another essentially unattested language, was 
another Guaicurian language, though it is seen by 
others to be a variant of Guacura itself. 


Guarijio (Warijio). See §2.8, p. 54. 


Gwich'in (Kutchin) [77] is an Athabaskan language 
spoken in northeastern Alaska in the villages of Arctic 
Village, Venetie, Fort Yukon, Chalkyitsik, Circle, and 
Birch Creek, as well as in Aklavik, Inuvik, Tsiigehtchic 
(formerly Arctic Red River) and Fort McPherson in 
the Northwest Territories, and in Old Crow in 
the Yukon Territory. A distinction is made between 
Western (Alaskan) and Eastern (Canadian) dialects, 
with the latter often called Loucheux. The Gwich'in 
population of Alaska is about 1,100, and of that 
number about 300 are speakers of the language; the 


Canadian population is about 1,900, with perhaps as 
many as 500 speakers. 


Haida [112], an isolate (sometimes grouped with Na- 
Dene), is the aboriginal language of the isolated 
Queen Charlotte Islands (known locally as Haida 
Gwaai), which lie about 75 miles off the coast of 
British Columbia, immediately south of Alaska. There 
are two Haida villages on the Queen Charlottes, each 
with a distinct dialect: Masset (pop. 750) and Skide- 
gate (pop. 500). Between 50 and 100 first-language 
speakers, none younger than 50, are divided between 
the two communities; only those over 70 are active 
speakers. About 600 additional Haida people live in 
Alaska (representing an eighteenth century migration 
from Masset), in the villages of Hydaburg, Kasaan, 
and Craig on the southern half of Prince of Wales 
Island, as well as in the city of Ketchikan. Only about 
15 Alaskan Haidas, all very elderly, are active speakers 
of the language. 


Haisla [113] is the northernmost Northern Wakashan 
language, spoken in northwestern British Columbia 
immediately adjacent to Coast Tsimshian. The 
principal Haisla community is Kitamaat, where out of 
a total population of 600 there are estimated to be 
between 50 and 150 fluent speakers, none under the 
age of 25. 


Halkomelem [126] is a Central Salish language, spoken 
in southwestern British Columbia in a number of 
small communities along the lower Fraser River and 
on the east coast of Vancouver Island. The combined 
population of all Halkomelem groups is 6,700, of 
whom about 120 speak the language with some degree 
of fluency, and another 100 are passive speakers. 
Three dialects are recognised: (1) Island Halko- 
melem, on the southeast coast of Vancouver Island, 
is spoken in local varieties at Malahat, Cowichan, 
Halalt, Chemainus, Penelakut, Nanaimo, and Nanoose. 
There are up to 100 active speakers. (2) Downriver 
Halkomelem, at the mouth of the Fraser River in and 
around the city of Vancouver, has six elderly first- 
language speakers. (3) Upriver Halkomelem (Sto:lo), 
in the Fraser River valley, has between five and ten 
fluent speakers, the youngest over 70. 


Han (Han) [78] is a moribund Athabaskan language 
spoken in the village of Eagle, Alaska, and in Dawson 
City, Yukon Territory. Of the total Alaskan Han 
population of about 50 people, perhaps 12 speak the 
language. In Dawson City only a handful of fluent 
speakers remain. Han is closely related to Gwich'in, 
and some older speakers make use of Archdeacon 
McDonald’s Tukudh (Eastern Gwich'in) Bible (1886) 
and prayer book. 


Hanis [154] and Miluk [155] were the two languages 
of the Coosan family, formerly spoken in a small 
territory along the south-central coast of Oregon 
around Coos Bay. Both languages are often referred to 
as ‘Coos’, although they were quite distinct. Although 
some Hanis and Miluk people were removed to reser- 
vations between 1855 and 1875, most remained in 
their traditional territory, where more than 150 des- 
cendants still reside. The Confererated Tribes of Coos, 
Lower Umpqua, and Siuslaw was recognised in 1984, 
with tribal headquarters at Coos Bay. Extensive 
documentation of both Hanis and Miluk exists, from 
several linguists, including a number of sound record- 
ings made by Jacobs in the 1930s. A number of narra- 
tive texts have been published in both languages, and 
a grammatical sketch of Hanis. The last fluent speaker 
of Miluk died in 1939, and the last speaker of Hanis 
in 1972. 


Hano. See Tewa 


Hare [87] is an emergent Athabaskan language within 
the North Slavey group of Slavey dialects of the Dene 
complex. It is spoken as a first language by about 70 
people (30 of whom use it actively at home) in two 
communities in the Northwest Territories, Colville 
Lake (30 speakers out of a total population of 90), 
and Fort Good Hope (45 speakers out of 645 total). 


Havasupai. See Upland Yuman 


Heiltsuk-Oowekyala [114] is a Northern Wakashan 
language, spoken on the coast of British Columbia 
south of Haisla and Coast Tsimshian and north of 
Kwakiutl. It has two deeply differentiated dialects, or 
emergent languages, Heiltsuk (also known as Bella 
Bella) and Oowekyala. Heiltsuk is principally spoken 
in two communities, Bella Bella (population 1,200) 
and Kitasoo (or Klemtu, population 370), the latter 
on former Southern Tsimshian territory. There are 
between 100 and 200 fluent speakers, none under the 
age of 35. 


Hidatsa [271] is a Siouan language of the Missouri 
River subgroup, originally spoken in a group of 
villages along the Missouri River in central North 
Dakota upstream from the Mandan villages. The 
modern Hidatsas (sometimes called Gros Ventres) are 
one of the Three Affiliated Tribes and share the Fort 
Berthold Reservation of North Dakota with the Man- 
dans and the Arikaras. Their principal reservation 
settlement is Mandaree. There are about 200 fluent 
speakers, the youngest in their late 20s. 


Hitchiti. See Mikasuki 


Holikachuk [69] is a moribund Athabaskan language 
of west-central Alaska, formerly spoken by a group 
that lived at Holikachuk on the Innoko River, but 
which is now located at Grayling on the lower Yukon 
River. Holikachuk is intermediate between Ingalik 
and Koyukon, and was only identified as a separate 
language in the 1970s. The total population is about 
200, of whom six or seven speak the language. 


Hopi [247] is the traditional language of Hopi Pueblo 
and constitutes an independent branch of the Uto- 
Aztecan family. At least 5,000 of the approximately 
7,350 members of the tribe are fluent speakers, 
including many children. Four dialects are usually 
distinguished: (1) the First Mesa villages of Walpi and 
Sichomovi (the language of a third First Mesa village, 
Hano, is Tewa) and the town of Polacca; (2) the 
Second Mesa village of Shipaulovi; (3) the Second 
Mesa village of Mishongnovi (also called Toreva); and 
(4) the Third Mesa villages of Oraibi, Hotevilla, 
Bacabi, and New Oraibi, as well as the settlement of 
Moencopi 40 miles to the west. 


Hualapai. See Upland Yuman 
Huchiti. See Uchiti 


Hupa[101],the onlysurviving language ofthe Calfornia 
Athabaskan subgroup, is spoken on the Hoopa Valley 
Reservation in northwest California. There are fewer 
than 12 fluent first-language speakers, all elderly. 


Huron-Wyandot (Wendat) [313] was a Northern 
Iroquoian language that was originally spoken in the 
Georgian Bay area of Ontario, near Lake Huron. A 
variety of this language was the lingua franca of the 
Huron Confederacy in the seventeenth century and 
was extensively documented by Jesuit missionaries. 
After the Huron Confederacy was destroyed by the 
Iroquois around 1650 in a war over control of the fur 
trade, some Hurons settled at Lorette, near Quebec 
City, where their French-speaking descendants still 
live, while others moved to Ohio. From there one 
group moved to Amherstburg, south of Windsor, 
Ontario, and the rest were removed to Kansas and 
later Oklahoma. They are now organised as the 
Wyandotte Tribe of Oklahoma, with about 3,600 
members. The Oklahoma group was the last to retain 
its language, but the language was gone by around 
1980. Huron was documented by French missionaries 
working in Canada in the seventeenth and eighteenth 
centuries, and Wyandot by Marius Barbeau in 
Ontario and Oklahoma in the early twentieth 
century. 


Inesefio [229] was the Chumash language spoken at 
Santa Ynez mission and throughout the upper valley 
of the Santa Ynez River. 


Ingalik (Deg Xinag) [68] is an Athabaskan language of 
west-central Alaska, spoken at Shageluk and Anvik 
and by the Athabaskans in the multilingual com- 
munity at Holy Cross on the lower Yukon River. Of a 
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total population of about 275, about 40 speak the 
language. 


Innu-aimun is a socio-political designation for the 
Montagnais-Naskapi dialects of Labrador, specifically 
the Eastern Naskapi spoken at Natuashish and the 
Montagnais spoken at Sheshatshiu. The term is also 
sometimes used to refer to all Montagnais dialects. 


Inuit (Eastern Eskimo) consists of a chain of dialects 
spoken in at least 165 settlements from Norton Sound 
in Northwestern Alaska to the East Coast of Green- 
land. The dialects at the opposite ends of this chain 
are not mutually intelligible, but the location of 
‘language’ boundaries is largely arbitrary. Mainly for 
political and cultural reasons three languages are dis- 
tinguished, Inupiaq in Alaska, Inuktitut in Canada, 
and Greenlandic in Greenland. Each of these is sub- 
divided into dialect clusters or emergent languages. 


Inuktitut is the collective name for the dialects of 
Inuit (Eastern Eskimo) spoken on the northern coast 
of Canada, from the Mackenzie Delta in the west to 
Labrador in the east. (It is also sometimes used for 
Eastern Eskimo dialects in general, synonymous with 
Inuit.) The dialects of Western Canadian Inuktitut 
[8] are usually distinguished from Eastern Canadian 
Inuktitut [9], the boundary falling between the Cen- 
tral Arctic coast and Baffin Island, but there is no 
sharp discontinuity. All of the Inuktitut speakers of 
the newly formed territory of Nunavut, which 
encompasses dialects belonging to both the Western 
and Eastern divisions, can understand one another’s 
speech. The Inuktitut population in Canada is about 
31,000, a high proportion being speakers. In the 1996 
Canadian census 26,960 people indicated Inuktitut 
was their first language, 18,495 of them in the North- 
west Territories (and Nunavut) and 7,685 in Québec. 


Inupiaq is the collective term for the dialects of 
Western Eskimo (q.v.) spoken in Alaska and immedi- 
ately adjacent parts of Northern Canada. There are 
two major dialect groups, Seward Peninsula Inupiaq 
(Qawiaraq) [6] and North Alaskan Inupiaq [7]. 
Seward Peninsula Inupiaq includes the local dialects 
of the southern Seward Peninsula and Norton Sound 
area, and of the villages surrounding Bering Strait 
and on King and Diomede Islands. North Alaskan 
Inupiaq includes the the Malimiut dialect around 
Kotzebue Sound and the North Slope dialect spoken 
along the Arctic Coast as far east as the Mackenzie 
Delta. The Seward Peninsula and North Alaskan dia- 
lect groups differ significantly from each other and a 
fair amount of experience is required for a speaker of 
one to understand a speaker of the other. There are 
about 13,500 Inupiat (the plural form, referring to 
the people collectively) in Alaska, of whom about 
3,000, mostly over age 40, speak the language. 


Iowa (Baxoje). See Chiwere 
Ipai. See Dieguefio 


Island Chumash (Islefio) [232] was the Chumash lan- 
guage originally spoken on three islands in the Santa 
Barbara Channel — San Miguel, Santa Rosa, and Santa 
Cruz — but with the establishment of the Spanish mis- 
sions, the Island communities were relocated on the 
mainland, primarily at San Buenaventura. It was quite 
distinct from the Chumash languages of the adjacent 
mainland, Barbarefio and Venturefio. 


Isleta-Sandia. See Southern Tiwa 


Jemez [251], sometimes known as Towa, is the Kiowa- 
Tanoan language of the Pueblo of Jemez (pronounced 
Hay-mis), 45 miles northwest of Albuquerque. The 
residents of the former Pueblo of Pecos, east of Santa 
Fe, moved to Jemez early in the nineteenth century 
and some Pecos descendents retain separate tradi- 
tions and — to an undocumented extent — a distinctive 
variety of Towa. Nearly the entire Jemez population 
of about 3,000 speaks the language, including most 
children. 


Jicarilla [109] is an emergent language within the 
Southern Athabaskan dialect complex, spoken on the 
Jicarilla Reservation in northeastern New Mexico. 


There are about 300 first-language speakers and an 
equal or greater number of semi-speakers out of a 
total Jicarilla tribal population of 3,100. 


Juanenio. See Luisefio 


Kalapuya is the general term for the three languages 
of the Kalapuyan family, Tualatin-Yamhill [157], 
Central Kalapuyan (or Santiam) [158], and Yoncalla 
[159], formerly spoken throughout most of the 
Willamette River valley of western Oregon. All three 
languages had well-differentiated local dialects. The 
Kalapuya people suffered a catastrophic demographic 
decline after contact with whites, and in 1856 the few 
survivors were settled on the Grand Ronde Reserva- 
tion, where most abandoned their native language for 
other Indian languages or Chinook Jargon. The best 
documented varieties of Kalapuya are the Tualatin 
dialect of Tualatin-Yamhill and a variety of Santiam, 
but Jacobs was able to collect a substantial amount of 
data for all three languages, much of it from elderly 
speakers in the 1920s and 1930s. This documentation 
remains largely unpublished except for an extensive 
collection of narrative texts (without interlinear gloss- 
ing). Most Kalapuyan varieties were extinct before 
1940, but a speaker of Santiam lived into the 1950s. 


Kalispel. See Spokane-Kalispel 


Kansa [279] was a language of the Dhegiha subgroup 
of Siouan closely related to Osage. Kansa was spoken 
before the mid-nineteenth century by the Kansa or 
Kaw tribe in northeastern Kansas, which was removed 
in 1873 to a small reservation in Oklahoma. Dissolved 
in 1902, the Kaw Nation was reconstituted in 1959 
with a headquarters at Kaw City, Oklahoma, and now 
has a membership of about 1,700. There have been no 
fluent speakers of the language since the early 1980s, 
but about a dozen people claim some knowledge of it. 


[268] was spoken until the mid- 
nineteenth century along the south Texas coast from 
Galveston to Corpus Christi. It is attested in several 
vocabularies, collected between 1698 and 1888, one 
of the most extensive from a white woman in 
Massachusetts who had spent her childhood near 
the last Karankawa-speaking community. Although 
earlier thought to be part of a “Coahuiltecan’ family 
Karankawa is now considered to be an isolate. 


Karankawa 


Karkin. See Costanoan 


Karuk [190], an isolate, often associated with the 
controversial Hokan stock, is the heritage language of 
the Karuk Tribe of the Klamath River in northwestern 
California. There are fewer than a dozen fluent first- 
language speakers, and a larger number of semi- 
speakers and passive speakers. 


Kashaya or Southwestern Pomo [204] is a language 
of the Pomoan family, spoken by several dozen people 
along the California coast near the mouth of the Rus- 
sian River. 


Kaska [81] is an Athabaskan language spoken in the 
southeastern Yukon at Ross River, Watson Lake and 
Upper Liard, and in northern British Columbia at 
Lower Post, Fireside, Good Hope Lake, Dease Lake 
and Muncho Lake. In all these communities there is 
perhaps a total of 250 fluent speakers and another 
150 passive speakers. Kaska territory adjoins that of 
Tahltan (as well as the extinct Tagish) on the south- 
west and Sekani on the southeast, and a high degree 
of mutual intelligibility exists between these lan- 
guages and the adjoining dialects of Kaska. 


Kathlamet. See Cathlamet 


Kato (Cahto) [104] was the California Athabaskan 
language that was formerly spoken in Cahto Valley, 
near Laytonville. It was extensively documented 
around 1905-10 by P.E. Goddard, who published a 
grammatical sketch and a volume of narrative texts. 
The last fluent speaker died in the 1960s, but a few 
individuals remain who have fragmentary memories 
of the language. 


Kaw. See Kansa 


Kawaiisu [238] is the Southern Numic language of a 
small unrecognised tribe of the Tehachapi region 


between the Mohave Desert and the San Joaquin 
Valley in south-central California. Fewer than 10 
speakers were reported in 1994, but this is a signifi- 
cant proportion of the total population of this cultur- 
ally conservative group, which numbers less than 100. 


Keweevkapaya (Yavapai). See Upland Yuman 


Kickapoo [42] is an Algonquian language that is partly 
mutually intelligible with Sauk-Fox but has been 
spoken since the earliest contact in the seventeenth 
century by a separate political group. The Kickapoos 
are first known to have been in southeastern Michi- 
gan and northwestern Ohio, and after the mid- 
seventeenth century they were in Wisconsin and later 
Indiana. They are now on reservations in Kansas and 
the Mexican state of Coahuila, and in communities 
near McLoud, Oklahoma, and Eagle Pass, ‘Texas. 
There are estimated to be around 1,100 first-language 
speakers of Kickapoo, about 700 of them in Mexico, 
400 in Oklahoma, and only a few in Kansas. 


Kiksht [150] is the only surviving language of the Chi- 
nookan family, originally spoken along the Columbia 
River from its mouth upriver to the vicinity of The 
Dalles. Kiksht (or Upper Chinook) was the language 
spoken upstream from Portland and _ originally 
included a string of dialects, of which only the east- 
ernmost, Wasco-Wishram, is still spoken. The Wasco 
variety is represented by five elderly speakers on the 
Warm Springs Reservation in north-central Oregon. 
The Wishram variety is spoken by two elders on the 
Yakama Reservation in south-central Washington. 


Kiliwa [217] is a Yuman language, forming its own 
distinct subgroup. Originally spoken on the Baja 
California peninsula south of Paipai, the fewer than 
10 surviving speakers of Kiliwa now share the Santa 
Catarina community with speakers of Paipai. 


Kings River. See Yokuts 


Kiowa Apache (Plains Apache) [111] is a nearly 
extinct Southern Athabaskan language, distinctly 
different from the rest of the Southern Athabaskan 
dialect complex. It is spoken in Caddo County, 
western Oklahoma, by the descendents of an Apache 
band that joined the Kiowas in the eighteenth cen- 
tury. Only three very elderly first-language speakers 
remain, together with a few semi-speakers, but there 
are no second language speakers. 


Kiowa is the only Kiowa-Tanoan language not spoken 
in a Pueblo community. It is the traditional language 
of the Kiowas, a Plains tribe now settled in south- 
western Oklahoma, mainly in Caddo, Kiowa, and 
Comanche counties. There is no reservation, but the 
Kiowa Tribe has its headquarters in Carnegie, Okla- 
homa. There are fewer than 400 speakers, most over 
the age of 50, out of a population of about 6,000. 


Kitanemuk [242] was a language of the Serrano- 
Kitanemuk division of the Takic subfamily of Uto- 
Aztecan, closely related to Serrano. It was formerly 
spoken in the Tehachapi Mountains and Antelope 
Valley in the interior of southern California, immedi- 
ately south of the San Joaquin Valley. The last 
speakers probably died in the 1940s. 


Kitsai [291] was the Northern Caddoan language of a 
tribe that formerly lived south of the Wichitas in east 
Texas, but which joined the Wichitas in Oklahoma 
in 1858. Kitsai continued to be spoken alongside 
Wichita for two or three generations, and then died 
out. The language was moderately well documented 
in 1929-30 from the last speaker, who died in 1940. 


Klallam. See Clallam 


Klamath-Modoc [186] is the Plateau (Plateau 
Penutian) language originally spoken, in shallowly 
differentiated dialects, by both the Klamaths of south- 
central Oregon and the Modocs of the Tule Lake 
area in adjacent northeastern California. After the 
Modoc War of 1872-1873 about 150 Modocs were 
relocated in Oklahoma, the remainder merging into 
the Klamath community. The Modoc dialect is 
extinct in Oklahoma. The Klamath dialect has con- 
tinued to be spoken in and around the former reser- 
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vation community of Chiloquin, Oregon. In 2001 
only one very elderly first-language speaker survived. 


Koasati (Coushatta) [304] is an Eastern Muskogean 
language, closely related to Alabama. It was spoken in 
the sixteenth century in the upper Tennessee Valley, 
and now is spoken by 300 to 400 members of the 
Coushatta Tribe of Louisiana (living near Elton, in 
Allen Parish), and by up to 100 residents of the 
Alabama-Coushatta Indian Reservation near Living- 
ston, Texas, where Alabama is the dominant language; 
some speakers of Alabama have also learned to speak 
Koasati. There are speakers of all ages in the Louisiana 
community, which is culturally quite conservative. 
Until recently there were also a few elderly speakers of 
Koasati among the 900 enrolled members of the 
Alabama-Quassarte Tribe. 


Kolchan. See Upper Kuskokwim 


Konkow [163] is a Maiduan language, spoken in the 
Feather River and Oroville area of Butte and Yuba 
County, California, at the eastern edge of the Sacra- 
mento Valley. The principal modern Konkow com- 
munity is at the Mooretown Rancheria. There are a 
few elderly speakers. 


Konomihu [195] was a Shastan language, once spoken 
along the Salmon River, centering in the area around 
Forks of Salmon. It has been extinct for several gener- 
ations and is very poorly attested. 


Koyukon [70] is the Athabaskan language of 11 
villages along the Koyukuk and middle Yukon rivers 
of central Alaska, the most widespread Athabaskan 
language in the state. It is spoken in three dialects, 
Upper, Central, and Lower. About 300 speak the lan- 
guage, out of a total population of about 2,300. 
Koyukon was painstakingly documented by the Jesuit 
missionary Jules Jetté in the early twentieth century. 


Kumeyaay. See Dieguefio 
Kutchin. See Gwich'in 


Kutenai (Kootenai, Ktunaxa) [269], an isolate, is the 
heritage language of three politically independent 
groups in Montana, Idaho, and British Columbia (the 
name Ktunaxa is now official in Canada). The Mon- 
tana Kootenai are part of the Confederated Salish and 
Kootenai Tribes, and are concentrated at the northern 
end of the Flathead Reservation, around Elmo. The 
Kootenai Tribe of Idaho has a reservation near Bon- 
ners Ferry, in the Idaho panhandle. The communities 
in British Columbia are represented by the Ktunaxa/ 
Kinbasket Tribal Council and include the Lower 
Kootenay Band, with a reserve near Creston; the 
Tobacco Plains Band with a reserve at Grasmere; the 
St. Mary’s Band with a reserve near Cranbrook; and 
the Columbia Lake Band with a reserve at Winder- 
mere. Most elders of the neighbouring Shuswap Band, 
the Kinbaskets, near Invermere, have a working know- 
ledge of the Kutenai language. The language is spoken 
by elders in all the communities, and there were 
monolingual speakers at Tobacco Plains and Bonners 
Ferry as recently as the 1980s. With a few exceptions, 
the youngest first-language speakers are in their 50s, 
and even some of them have more of a passive know- 
ledge of the language than active fluency. 


Kwak'wala. See Kwakiutl 


Kwakiutl (Kwak'wala) [115] is the southernmost of 
the three Northern Wakashan languages, spoken in a 
number of local varieties on the central coast of Brit- 
ish Columbia from Smith Sound to Cape Mudge, and 
on the northern third of Vancouver Island. The prin- 
cipal communities are Campbell River, Cape Mudge, 
Fort Rupert, Mamaleleqala, Nimpkish (Alert Bay), 
Nuwitti, Qualicum, Quatsino, Tanakteuk, Tlowitsis- 
Mumtagilia, Tsawataineuk, and Tsulquate. Out of a 
total population of approximately 3,500 there are 
between 200 and 400 speakers, none under the age 
of 35. 


Kwalhioqua-Clatskanie [96] was an Athabaskan lan- 
guage formerly spoken in two separate areas along the 
lower Columbia River, the Kwalhioqua to the north of 
the river, the Clatskanie to the south, separated by the 


territory of the Lower Chinook and Cathlamet. The 
Kwalhioqua area, along the Willapa River in what is 
now southwestern Washington, included two sub- 
groups, the Willapa and the Suwal. The Kwalhioqua- 
Clatskanie people were dispersed among Coast Salish 
tribes in the nineteenth century and their language 
was extinct before the 1930s. 


Laguna. See Acoma-Laguna 
Lake Miwok. See Western Miwok 
Lakota. See Sioux 


Laurentian [314] was a Northern Iroquoian language 
spoken in the sixteenth century along the St. Law- 
rence River in Quebec. It is known only from the 
wordlists collected by the explorer Jacques Cartier 
during his two visits in 1534 and 1536. Examination 
of the data suggests they represent more than one 
dialect. By the time Champlain visited the area in 
1603 the Laurentians had vanished. 


Lenape. See Unami 


Lillooet (St'at'imcets) [137] is an Interior Salish lan- 
guage of southwestern British Columbia, and is 
spoken in two major dialect clusters, Upper Lillooet, 
in and around Fountain and Lillooet on the Fraser 
River, and Lower Lillooet, in and around Mount 
Currie, near Pemberton, on the Lillooet River. The 
bands associated with these two areas are culturally 
and politically distinct. There are about 200 speakers 
divided between the two areas. 


Lipan [110] was an emergent language within the 
Southern Athabaskan dialect complex, spoken in the 
eighteenth century by several bands of Plains Apaches 
who lived in south-central Texas. During the nine- 
teenth century the Lipan amalgamated with other 
Apache groups and today their descendants share the 
Mescalero Reservation in southeastern New Mexico 
with the Mescalero and the Chiricahua. There were 
two or three elderly speakers living as late as 1981, 
but the language is now extinct. It is very poorly 
documented. 


Loucheux. See Gwich'in 


Loup [53] was an Algonquian language of the 
Southern New England group, spoken in central 
Massachusetts and parts of northern Connecticut. It 
is mainly attested by a list of words and sentences 
recorded from refugees among the Abenakis at the St. 
Francis mission in Quebec, where their descendants 
became speakers of Western Abenaki in the course of 
the eighteenth century. A second source, sometimes 
distinguished as “Loup B,’ shows extensive dialectal 
variation and may simply be notes on the speech 
of New England Algonquian refugees in French 
missions, rather than a record of a separate language. 


Lower Chehalis [133], one of the four languages of the 
Tsamosan division of the Salishan family, was for- 
merly spoken in a number of local varieties along the 
lower Chehalis River and around Grays Harbor and 
Shoalwater Bay on the coast of southwest Washing- 
ton. Most Lower Chehalis descendants live on the 
Shoalwater Bay Reservation, although some the 
Quinault Reservation. Few speakers of Lower Cheha- 
lis survived into the twentieth century, and the lan- 
guage is not well documented, although a substantial 
number of sound recordings of one of the last fluent 
speakers were made around 1940. One very elderly 
and incapacitated man may have some limited flu- 
ency, and a few others may have a passive knowledge 
of words and phrases, but the language is extinct as a 
medium of communication. 


Lower Chinook [148] was the Chinookan language 
formerly spoken at the mouth of the Columbia River 
from Shoalwater Bay in the north to Tillamook Head 
in the south, and for about ten miles upstream. After 
a brief period of prosperity after the establishment of 
a fur-trading post at Astoria in 1811, they suffered a 
steep demographic decline, and by the end of the 
nineteenth century the surviving Lower Chinooks 
had been incorporated into the Lower Chehalis Salish 
to the north. The last speaker died in the 1930s. 


Lower Tanana [72] is an Athabaskan language belong- 
ing to the Tanana series. It was originally spoken in a 
number of villages on the Tanana River in the vicinity 
of Fairbanks, but now is spoken only at Nenana and 
Minto. The native population of these two villages is 
approximately 380, of whom about 30 elderly people 
speak the language. 


Luisefio [246] is a Uto-Aztecan language of the Takic 
subfamily, originally spoken by the native peoples of 
the southern California coast north of the Dieguefio 
and south of the Gabrielino, in the area dominated by 
the missions of San Luis Rey and San Juan Capist- 
rano. The dialects of the two mission communities 
differed, but were mutually intelligible; the Juanefio 
(Ajachemem) dialect is now extinct. Both dialects 
were extensively documented by J. P. Harrington in 
the 1930s. Approximately 2,500 descendants of the 
San Luis Rey and San Juan Capistrano communities 
currently live at the La Jolla, Rincon, Pauma, 
Pechanga, and Pala Reservations, and in the town of 
San Juan Capistrano. Between five and ten elderly 
speakers or semi-speakers of Luisefio remain. 


Lummi. See Northern Straits Salish 


Lushootseed [130] is a complex of closely related 
Central Salish dialects spoken in the Puget Sound 
area of Washington (older sources refer to it as Puget 
Sound Salish). The principal modern communities in 
which Lushootseed is the heritage language include 
the Upper Skagit, Swinomish, Suquamish, Muckle- 
shoot, Puyallup, and Nisqually Reservations, and 
especially the Tulalip Reservation near Marysville. 
Lushootseed is now spoken as a first language by 
fewer than five elderly people out of a total popula- 
tion of over 18,000. 


Mahican [56] was an Eastern Algonquian language 
spoken in the upper Hudson River Valley from Lake 
Champlain south to Greene County, and in the upper 
Housatonic Valley in Massachusetts. It shared features 
with its southern neighbour Munsee and with the 
Southern New England languages to the east. In the 
eighteenth century it was the dominant language at 
the mission village of Stockbridge, Massachusetts. 
There is a fair amount of documentation, principally 
by Moravian and English missionaries, and there are 
translations of religious texts by literate native 
speakers. Some dialectal diversity is attested. Many 
Mahicans joined various groups of Delawares and 
their descendants switched to speaking Munsee or 
Unami. A remnant in upstate New York was removed 
in the nineteenth century to Wisconsin, where the 
language was last spoken in the 1930s, and where 
some fieldwork was done with the last speakers and 
semi-speakers. 


Maidu [162], also known as Northeastern Maidu or 
Mountain Maidu, is the Maiduan language of the 
people who traditionally occupied the northern 
Sierra Nevada and the Honey Lake Valley, east and 
south of Lassen Peak in northeastern California. 
Some Maidu descendants are members of the small 
Susanville and Greenville Rancherias, but most live 
away from tribal land in scattered locations in Plumas 
and Lassen County. Only a few semi-speakers of 
Maidu remain. All three of the Maiduan languages 
(Maidu, Konkow, and Nisenan) are sometimes 
referred to collectively as ‘Maidu’, but they differ con- 
siderably in their grammars and are not mutually 
intelligible. 


Makah [118] is the Southern Wakashan language 
spoken by the Makah Tribe, at Neah Bay on the 
northwest tip of the Olympic Peninsula in Washing- 
ton State. Of the other two Southern Wakashan 
languages, Nootka and Nitinaht, both spoken on 
Vancouver Island, Makah appears to be closer to the 
latter. About a dozen elderly first-language speakers 
survive. 


Maliseet-Passamaquoddy [48] is an Eastern Algon- 
quian language spoken in the St. Croix and St. John 
River valleys along the border between the state of 
Maine and the Canadian province of New Brunswick. 
There are about 500 first-language speakers. Most of 
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those in Maine are members of the Passamaquoddy 
tribe, while in Canada they identify themselves as 
Maliseet, but dialect differences between the two 
groups are minimal. The principal communities in 
Maine are Pleasant Point and Indian Township, with 
fewer than 100 fluent speakers between them. In New 
Brunswick there are 355 first-language speakers at 
Tobique, Woodstock, Kingsclear, St. Mary’s, and 
Oromocto; another 40 live elsewhere in Canada. 
There are also some speakers of both dialects residing 
with the Penobscots of Indian Island, at Old Town, 
Maine, and in an urban community in Bridgeport, 
Connecticut. Nearly all speakers are middle aged, 
with the most fluent in their 60s and older. 


Mamulique. See Comecrudan 


Mandan [273] is a moribund Siouan language, origin- 
ally spoken in a cluster of villages along the Missouri 
River in south central North Dakota, located between 
the Hidatsa villages to the north and the Arikara 
villages to the south. The modern Mandans are one of 
the Three Affiliated Tribes who share the Fort 
Berthold Reservation in North Dakota, the others 
being the Hidatsas and the Arikaras. Their primary 
settlement is Twin Buttes. There are fewer than ten 
remaining first-language speakers of Mandan, all 
elderly, and no fluent second-language speakers. 


Maricopa [212] is a Yuman language of the River 
subgroup, originally spoken by several small tribes 
along the lower Gila and Colorado Rivers. It is now 
spoken by a minority — perhaps as few as 100 — of the 
approximately 800 members of the Maricopa (or 
Pee-Posh) tribe of Arizona, most of whom live at the 
Maricopa Colony at Laveen, on the Gila River Reser- 
vation south of Phoenix, and in the community 
of Lehi on the Salt River Reservation northeast of 
Phoenix. 


Maritime Tsimshian, the more southerly of the two 
branches of the Tsimshianic language family of 
northwestern British Columbia, has two deeply- 
differentiated dialects, or possibly emergent lan- 
guages. Southern Tsimshian (or Skiiiixs) [147], ori- 
ginally spoken along the coast south of the Skeena 
River and on a few islands, now has only one fully 
fluent speaker, who lives in the village of Klemtu on 
Swindle Island. Coast Tsimshian (often referred to 
simply as Tsimshian or as Sm'algyax) [146], is spoken 
near Terrace on the lower Skeena River and on the 
coast near the Skeena estuary, as well as at one loca- 
tion in southern Alaska. In the Canadian com- 
munities it is estimated that there are between 250 
and 400 fluent speakers of Coast Tsimshian out of 
a total population of 6780, almost none under 40. 
The highest proportion of speakers is in Kitkatla and 
Hartley Bay, where perhaps one person in five is 
fluent, although most are age 50 or older. Of the 
1,300 Coast Tsimshians currently living at Metlakatla, 
about 70 of the most elderly speak the language. 


Massachusett-Narragansett [48] was a complex of 
Southern New England Algonquian dialects spoken 
around Massachusetts Bay from southern Maine to 
Cape Cod, as well as around Narragansett Bay and on 
Martha’s Vineyard. Several varieties of Massachusett- 
Narragansett are quite well documented from the 
Colonial period, including the dialect of the mission 
town of Natick, which was the basis of John Eliot’s 
translation of the whole Bible (1663) and grammat- 
ical sketch (1666), and the Narragansett dialect 
of Rhode Island (Williams 1643). A grammar by 
Goddard is included in an edition of documents 
written by native speakers by Goddard and Bragdon 
(1988). Currently there are efforts to revive the speak- 
ing of the language under the name Wampanoag. 
The language fell into disuse in most areas during the 
eighteenth century, but a variety continued to be 
spoken on Martha’s Vineyard until the end of the 
nineteenth century. 


Mattole [102] was a language of the California Atha- 
baskan subgroup formerly spoken along the Mattole 
and Bear Rivers near Cape Mendocino. The Mattole 
River and Bear River dialects were distinct. The last 


speaker died in the 1950s, although a few individuals 
retain some memory of the language. 


Menominee [32] is a Central Algonquian language, 
spoken on the Menominee Reservation in northern 
Wisconsin in the towns of Keshena, Neopit, South 
Branch, and Zoar. It is spoken as a first language by 
about 35 people, none under 50 years old. 


Mescalero-Chiricahua [107, 108] is an emergent lan- 
guage within the Southern Athabaskan dialect com- 
plex, spoken with very little dialectal variation by 
people whose tribal identity is either Mescalero or 
Chiricahua. Over 1,500 members — slightly under half 
— of the Mescalero Tribe of New Mexico are first- 
language speakers of Mescalero-Chiricahua, most of 
them Mescaleros, although perhaps a dozen identify 
as Chiricahuas. The principal Chiricahua community 
is at Ft. Sill, in southwestern Oklahoma, where they 
settled early in the twentieth century. At most three 
fluent speakers remain at Ft. Sill, together with a few 
semi-speakers. 


Miami-Illinois [45, 46] was an Algonquian dialect 
complex spoken in the eighteenth century and earlier 
by groups in what is now Illinois and northern 
Indiana. At least three clusters of dialects were pre- 
served after the relocation of these tribes to Kansas in 
the nineteenth century: (1) Peoria-Kaskaskia, (2) 
Piankashaw-Wea, and (3) Miami. Speakers of at least 
some varieties survived into the twentieth century, 
but there have been no fluent first-language speakers 
since 1962. 


Michif (Mitchif) [16] is a special variety of Plains Cree 
in which Cree and French are combined. It is spoken 
in some communities of Métis, descendants of Indian 
women and French fur traders who form a distinct 
ethnic group on the northern Great Plains. In Michif 
the words, grammar, and pronunciation of nouns and 
their modifiers are almost entirely those of a variety 
of Canadian French, while the verbs are the grammat- 
ically complex, fully inflected forms of Plains Cree, 
with minor differences. Although Michif is correctly 
called a mixed language, some have pointed out that 
the contributing languages are ‘intertwined’ rather 
than literally mixed together. Several hundred thou- 
sand Métis live in Manitoba, Saskatchewan, Alberta, 
and North Dakota, and Cree, Ojibwe, French, and 
more recently English have been widely spoken in 
Métis communities, which have typically been multi- 
lingual. Michif was spoken only by a small minority 
of Métis families, many of whom have now lost it. It 
is difficult to estimate the current number of 
speakers, but it is probably around 200, nearly all over 
70 years old. The most detailed source on the lan- 
guage is Bakker 1997; a dictionary has been published 
by Laverdure & Allard (1983). 


Micmac (Mi'kmaq, Mi'gmaq, Miikmaq) [47] is an 
Eastern Algonquian language spoken in over 25 
reserves scattered across the provinces of Nova Scotia, 
New Brunswick, Prince Edward Island, and Quebec, 
as well as in the United States. There are 8,145 first- 
language speakers in a total Micmac population of 
about 20,000, the most important speech com- 
munities being at Restigouche, Quebec; Big Cove, 
New Brunswick; and Eskasoni, on Cape Breton 
Island, Nova Scotia. There is also a large expatriate 
community of Micmacs in the United States, primar- 
ily in Boston, where perhaps a quarter of all first- 
language speakers reside. A distinctive pictographic 
script (“Micmac Hieroglyphs’) devised by seventeenth 
century missionaries, and apparently based on 
indigenous mnemonic designs, is still used in the 
Catholic liturgy on the Eskasoni Reserve and in a few 
other places. The standard orthography now in use 
has its origins in nineteenth century missionary work, 


Miguelenio. See Salinan 


Mikasuki [301] is the only surviving dialect of the 
Eastern Muskogean language known as Hitchiti (or 
Hitchiti-Mikasuki). It is spoken in Florida by most of 
the 400 enrolled members of the Miccosukee Tribe, 
as well as by many of the 2,700 members of the 
Seminole Tribe. There are five Seminole reservation 


communities, with Mikasuki the dominant language 
at the Big Cypress, Immokalee, Hollywood, and 
Tampa reservations, while the Florida Seminole dia- 
lect of Creek is dominant at the Brighton Reservation. 
Until the 1980s, many Miccosukee and Seminole 
children spoke little English when entering school. 
This situation has shifted in recent years, and many 
children now learn English at home. 


Miluk. See Hanis 


Miwok is the general common name for the lan- 
guages of the Miwokan language family, spoken 
aboriginally in north-central California from San 
Francisco Bay to the Sierra Nevada. Seven languages 
are usually distinguished, subdivided linguistically 
into Western Miwok and Eastern Miwok. 


Modoc. See Klamath-Modoc 


Mohave (Mojave) [210], a Yuman language of the 
River subgroup, is the heritage language of the Fort 
Mojave Tribe, near Needles, California, and of the 
Mohave members of the Colorado River Indian 
Tribes, near Parker, Arizona. In the two communities 
combined there are fewer than 100 fluent first- 
language speakers, nearly all of them elderly, in a total 
Mohave population of over 2,000. 


Mohawk [320] is a Northern Iroquoian language, 
originally spoken by the easternmost tribe of the Six 
Nations (Iroquois Confederacy) in the Mohawk River 
Valley of New York, between present Schenectady and 
Utica. There are six modern Mohawk communities, 
located primarily in Canada: Kahnawake and Kaneh- 
satake in the vicinity of Montreal; Ahkwesahsne on 
the St. Lawrence River at the US-Canadian border; 
Ohsweken (Six Nations) on the Grand River in 
southern Ontario; the Tyendinaga Reserve on the Bay 
of Quinté near Kingston; and a small settlement at 
Gibson (Wahta) east of Georgian Bay. Mohawk is the 
most vigorous of the Six Nations languages, with 
about 3,850 fluent first-language speakers. Of these, 
around 600 are at Kahnawake, 100 at Oka, 3,000 at 
Ahkwesahsne, and perhaps a dozen on the Six 
Nations Reserve, A small number of children are 
native speakers. 


Mohegan-Pequot [54] was an Algonquian language of 
the Southern New England group, spoken in eastern 
Connecticut and the eastern part of Long Island. 
There is scattered documentation, but dialect diver- 
sity is poorly known. The last speaker, a Mohegan, 
died in 1908. 


Molala [189] was formerly spoken along the western 
slopes of the Cascades of central Oregon, from the 
upper Rogue River to the vicinity of Mt. Hood. It is a 
Plateau (Plateau Penutian) language, distantly related 
to Klamath and Sahaptian; an earlier classification 
that linked Molala and Cayuse in the ‘Waiilatpuan’ 
family has been abandoned. The Molala were dis- 
placed and suffered a severe decline in population 
after the white occupation of their territory in the 
mid nineteenth century. The survivors were settled on 
the Grand Ronde Reservation, where their language 
and tribal identity was lost. The last speaker died in 
1958. 


Monachi. See Mono 


Mono [233] is a Western Numic language spoken in 
central California both on the western side of the 
Sierra Nevada, between Yosemite National Park and 
Kings Canyon National Park, and on the eastern side 
in Owens Valley from Lone Pine north to Big Pine. 
Mono has two main geographical divisions. Western 
Mono (or Monachi) is spoken in several communities 
that are close by (and socially connected to) Yokuts 
communities. The most important of these are at 
North Fork and Auberry (Big Sandy Rancheria), each 
of which has at least ten fully fluent speakers, the 
youngest in his 50s. There are also a few speakers at 
Tollhouse (Cold Springs Rancheria) and Dunlop. In 
addition, 100 or more people have some passive or 
second-language knowledge of the language. Eastern 
Mono (or Owens Valley Paiute) is spoken in the 
Owens Valley communities of Bishop, Big Pine, Lone 
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Pine, and Fort Independence. All Eastern Mono 
speakers are elderly and number not more than 30 in 
a total population of about 1,000. Each of the com- 
munities has sponsored language revival programmes. 


Monqui (Monqui-Didit). See Guaicurian 


Montagnais (Innu) is an Algonquian language closely 
related to Cree that is spoken by over 10,000 people in 
Quebec and Labrador. There are three major dialects 
or emergent languages. Western Montagnais [25] is 
spoken at Mashteuiatsh (Pointe-Bleue on Lac Saint- 
Jean) and Betsiamites, Quebec; it was formerly 
spoken on the Essipit reserve (Les Escoumins). Cen- 
tral Montagnais [26] is spoken on the Matimekush 
and Lac-John reserves near Schefferville and the 
Uashat and Maliotenam reserves at Sept-Iles. Eastern 
Montagnais [27] is spoken at Ekuanitshit (Mingan), 
La Romaine (Uanaman-Shipu), Natashquan, and 
Pakuashipi (St-Augustin) in Quebec and at Sheshat- 
shiu in Labrador. The speech of the Eastern Naskapi 
at Davis Inlet, Labrador, may be considered an add- 
itional Montagnais dialect. 


Moose Cree [19] is the Cree dialect spoken by about 
3,000 people at Fort Albany, Kasechewan, and Moose 
Factory, on the southwestern shore of James Bay in 
northern Ontario. While speakers of all ages in 
Kasechewan are fluent, the language is in serious 
decline in Moose Factory, where many young adults 
have little or no knowledge of it. Moose Cree inter- 
grades with Eastern Swampy Cree, especially at 
Kashechewan, where it has been influenced by the 
Swampy Cree spoken at nearby Albany Post. 


Mountain [85] is an emergent Athabaskan language 
within the Slavey dialects of the Dene complex. In the 
Northwest Territories it is the principal language of 
Fort Wrigley (100 speakers out of 170 total popula- 
tion), as well as of some of the population of Tulita 
(Fort Norman), in many cases people who also speak 
Bearlake. There are also speakers of Mountain Slavey 
in the Yukon at Fort Liard and at Ross River. 


Munsee (Canadian Delaware) [57] is an Eastern Algo- 
nquian language originally spoken from the area 
around Minisink Island in the upper Delaware River 
Valley to the middle Housatonic River in western 
Connecticut, including northern New Jersey, the 
lower Hudson River and the New York City metro- 
politan area, and western Long Island. At one time 
there were speakers of Munsee in a number of 
locations further west, the largest groups being on 
the Moraviantown, Caradoc (Munceytown), and Six 
Nations reserves in Canada. Other speakers ended up 
in eastern Kansas and among the Unami-speakers 
of Oklahoma, the Mahicans in Wisconsin, and the 
Senecas of western New York. A few speakers remain 
among the Delaware First Nation on the Moravian- 
town Reserve near Thamesville, Ontario, where most 
of the documentation of the language comes from. 
The divergent Wampano dialect on the Housatonic, 
attested by three hymn translations, had adopted 
some features from Mahican and the southern New 
England languages. 


Mutsun. See Costanoan 
Nambe. See Tewa 


Nanticoke-Conoy [59] was the Algonquian language 
of Chesapeake Bay and the Delmarva Peninsula. 
Conoy (Piscataway) is known principally from some 
Roman Catholic prayers translated by Andrew White, 
an English Jesuit; more extensive materials once said 
to be in Rome have not been found. The longest 
record of Nanticoke is a vocabulary taken down on 
the Choptank River in Dorset County, Maryland, in 
1792. A few words were also recorded from a group of 
Nanticokes who joined the Iroquois in the eighteenth 
century and ended up on the Six Nations Reserve in 
Canada, where it was spoken as late as the 1860s. 
Other Nanticokes lost their language after joining 
Delaware refugees in the west. 


Naskapi [23, 24] is a cover term for two varieties of 
Cree-Montagnais, Western Naskapi (in Quebec) and 


Eastern Naskapi (in Labrador). See Eastern Naskapi 
and Western Naskapi 


Nass-Gitksan, the more northerly of the two branches 
of the Tsimshianic language family of northwestern 
British Columbia, consists of two emergent lan- 
guages. (1) Nisga'a (Nisgha) [144] is spoken in four 
village communities along the Nass River and has 
between 400 and 500 speakers, with no first-language 
speakers under 30. (2) Gitksan [145] is spoken in six 
village communities along the Skeena River upriver 
from the Coast Tsimshians. It has about the same 
number of speakers as Nisga'a (400 to 500), but there 
are a small number of families in which children and 
young people are fluent speakers. 


Natchez [306] was spoken in the seventeenth and early 
eighteenth centuries by the Natchez Indians, who 
lived on the Mississippi River in the vicinity of 
present-day Natchez, Mississippi. It was also spoken 
by the Taensa, who lived across the river, probably in 
a form that was not too different from what the 
Natchez spoke. Two nicknames learned among 
the Colapissa of southern Mississippi and eastern 
Louisiana suggest that these people also spoke a 
closely similar language. The Natchez tribe was des- 
troyed by the French in a series of wars that ended in 
1731, and many of the survivors took refuge with the 
Creeks and the Chickasaws, moving with them to 
Oklahoma in the 1830s, where they also intermarried 
with the Cherokees. Four Natchez wordlists were col- 
lected in Oklahoma during the nineteenth century, 
but the most important documentation was made 
during the twentieth century, by Swanton in 1907-15 
and by Haas in 1934-36, who between them worked 
with all the surviving speakers; much of this material 
remains unpublished. The last known speaker died in 
1965. 


Nawathinehena [30] was an Algonquian language 
closely related to Arapaho that was spoken south of 
the Northern Arapaho bands on the High Plains in 
the early nineteenth century. Its phonology is quite 
distinct from that of Arapaho and shares some 
innovations with Cheyenne, which was presumably 
a southern neighbour at an earlier location. The 
Nawathinehena adopted the language of the 
Northern Arapaho, to whom they were closely related 
and allied, in the course of the nineteenth century, 
and the distinct Nawathinehena language is known 
only from a vocabulary collected by A. L. Kroeber 
among the Southern Arapaho in 1899. 


Naukan Yupik. See East Cape Yupik 


Navajo [105] is a well-established language within the 
Southern Athabaskan dialect complex. Some degree 
of mutual intelligibility exists between Navajo and the 
other emergent languages of the complex, in particu- 
lar Western Apache, but the Navajo and Apache 
communities have been politically and culturally dis- 
tinct since at least the early eighteenth century. In 
1990 an estimated 115,000 people living on the 
Navajo Nation in northern Arizona and northeastern 
New Mexico had fluency in Navajo, about 75 per cent 
of the reservation population, to which must be 
added a somewhat lower percentage of the 12,000 to 
15,000 Navajos living off-reservation. A conservative 
estimate of the total number of fluent speakers in 
1990 would be about 120,000. In 2001, although the 
population has increased, the number of speakers is 
probably smaller. Until the Second World War Navajo 
was the universal language of communication on 
the reservation, and there are still several thousand 
elderly near-monolingual speakers. As late as 1981 85 
per cent of Navajo children acquired Navajo as their 
first language, but the percentage has declined rapidly 
in recent years and some surveys show it now to be as 
low as 25 per cent. 


New River Shasta [193], a Shastan language, once 
spoken in a remote area near the head of Salmon 
River and New River, has been extinct for several 
generations and is poorly known. 


Nez Perce [188] is a Sahaptian language. Two dialects 
are distinguished, Upriver and Downriver, correlated 


with the original settlement pattern along the Snake 
River and its tributaries in eastern Washington and 
Idaho. The Upriver dialect is spoken fluently by a 
handful of elders at Kamiah and Lapwai on the Nez 
Perce Reservation in north-central Idaho, and by 
several more on the Colville Reservation in eastern 
Washington. The Downriver dialect is mainly pre- 
served by a few speakers on the Umatilla Reservation 
in Oregon (most of them descendents of Cayuse 
speakers who adopted Nez Perce in the nineteenth 
century). In addition to these fully fluent speakers 
there are between 30 and 40 speakers and semi- 
speakers of varying degrees of fluency, most of them 
in Idaho. 


Nicola [93] was an Athabaskan language formerly 
spoken in the Nicola and Similkameen Valleys of 
south-central British Columbia, adjoining Thompson 
Salish territory. It is documented only by fragmentary 
word lists that are insufficient to allow its relationship 
to other Athabaskan languages to be clearly deter- 
mined, although it appears closest to Chilcotin. 
Nicola was presumably extinct before the beginning 
of the twentieth century. 


Nisenan [164], also known as Southern Maidu, was 
the Maiduan language traditionally spoken in a num- 
ber of local village dialects along the Yuba River in the 
Sierra Nevada foothills east of Sacramento, Califor- 
nia. Nisenan descendants live at Auburn Rancheria 
in Placer County, Shingle Springs Rancheria in El 
Dorado County, and in other scattered locations in 
the area. 


Nisga'a. See Nass-Gitksan. 


Nishnaabemwin. See Ottawa (Odawa) and Eastern 
Ojibwe. 

Nitinaht (Ditidaht) [117] is the Southern Wakashan 
language of two groups on the west coast of Vancou- 
ver Island immediately to the south of Nootka terri- 
tory, now organised as the Ditidaht and Pacheenaht 
Bands. Although the Nitinahts were traditionally part 
of the Nootkan interaction sphere (and the Ditidaht 
Band is currently represented on the Nuu-chah-nulth 
Tribal Council), the Nitinaht language is sharply dis- 
continuous from the chain of Nootka dialects and 
closer to Makah. There are a dozen or so fluent 
speakers, all elderly. 


Nooksack [127] was a Central Salish language 
formerly spoken in the vicinity of Bellingham in 
northwest Washington, between Halkomelem and 
Lushootseed. The language has been well docu- 
mented by several linguists, although much of the 
documentation remains unpublished. The last fluent 
speaker died in 1977 and the last person with any 
degree of first-language competence died in 1988. 


Nootka (Nuuchahnulth) [116] is the Southern 
Wakaskan language of the west coast of Vancouver 
Island, from Cape Cook to Barkley Sound. Modern 
Nootkas are organised in 15 Bands, all represented by 
the Nuu-chah-nulth Tribal Council (a name adopted 
in 1985). There are a handful of speakers in most 
communities, totalling over 50 but fewer than 100, 
out of a population of around 5,000. The language is 
used primarily in ceremonies and in some political 
meetings. A school-based teaching programme has 
existed for over a decade at Port Alberni (Sheshaht 
Band) and about a dozen individuals are literate, 
although no fluent second-language speakers have 
been produced. The name Nuuchahnulth is used 
locally to include also the distinct Nitinaht language. 


North Alaska Inupiaq. See Inupiaq 
North Slavey. See Bearlake and Hare 


Northeastern Pomo [198], a language of the Pomoan 
family, was spoken on the eastern slope of the Inner 
Coast Range in Northern California, around Stony- 
ford, on the west side of the Sacramento Valley north 
of Clear Lake. The Northeastern Pomo were isolated 
from other Pomoan languages by territory belonging 
to the Patwin, with whom they culturally identified; 
bilingualism in Patwin was common. The last fluent 
speaker died in 1961. 
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Northern Algonquin (Algonquin du Nord) [34] is a 
distinct regional dialect within the Ojibwayan dialect 
complex, spoken in southwestern Quebec at Lac 
Simon, Grand Lac Victoria, La Barriere, and a few 
other communities. There are perhaps as many as 
1,000 fluent speakers. 


Northern Paiute (Paviotso) [234] is a Western Numic 
language formerly spoken in the western Great Basin 
from roughly the John Day River in Oregon south 
through the western third of Nevada, to the vicinity 
of Mammoth, California. Today limited numbers of 
speakers are found in reservation communities and 
colonies in Oregon, Nevada, California and Idaho, as 
well as in urban locations in these states. Principal 
communities are at Warm Springs and Burns, 
Oregon; Fort McDermitt, Owyhee, Winnemucca, 
Pyramid Lake, Reno-Sparks, Lovelock, Fallon, Yering- 
ton, and Walker River, Nevada; Lee Vining and Fort 
Bidwell, California. A variety (called Bannock) is also 
spoken by a few elderly people at Fort Hall, Idaho, 
where otherwise Shoshoni is the heritage language. 
There are two major dialects, with the Truckee River 
in west-central Nevada serving as the general dividing 
line. In addition, most of the individual communities 
have developed recognisable local varieties. Fluency 
in all communities except Fort McDermitt is confined 
to speakers 60 years and above, roughly 300 speakers 
total. Fort McDermitt has a fluency rate above 50 per 
cent (roughly 400 speakers), with about 20 per cent to 
30 per cent of children acquiring it as their first lan- 
guage. Semi-speakers from all areas add another 400 
to these figures. During the past 25 years nearly all 
communities have started teaching programmes, but 
few have continued. Beginning in the 1990s, Warm 
Springs, Reno-Sparks and Pyramid Lake began more 
sustained efforts. 


Northern Pomo [201], a language of the Pomoan 
family, has very few known elderly speakers at the 
Sherwood Rancheria near Willits, California. 


Northern Sierra Miwok. See Eastern Miwok 


Northern Straits Salish [128] is the Central Salish lan- 
guage of the southern tip of Vancouver Island, the 
San Juan Islands, and the mainland immediately to 
the south of the US-Canadian border. At least three 
dialects continue to be spoken in a number of small 
communities. (1) The Saanich dialect on Vancouver 
Island has fewer than 20 first-language speakers, the 
youngest nearly 60 years old. However, there are up to 
100 second-language speakers of varying degrees of 
fluency, and Saanich is frequently used in ceremonies 
and for tribal identity. A distinctive orthography 
based on English capital letters has been quite suc- 
cessful; literacy is high among first-language speakers 
and universal among second-language speakers. The 
local school has produced books of traditional stories, 
and there is a significant amount of Saanich material 
available on the Internet. (2) The Samish dialect of 
the San Juan Islands has about 5 remaining speakers, 
but their speech is mixed with other dialects or with 
Halkomelem and they do not form a distinct speech 
community. The Samish community in Anacortes, 
Washington, is developing a second-language teach- 
ing programme. (3) The Lummi dialect of the mainland 
has no active first-language speakers, and if passive 
first-language speakers exist they are likely to be 
elderly. 


Northern Tutchone [75] is an Athabaskan language 
(closely related to Southern Tutchone) spoken in the 
Yukon communities of Mayo, Pelly Crossing, Stewart 
Crossing, and Carmacks. There are about 200 
speakers out of a total population of 1,100. Early 
documentation of Northern Tutchone was carried 
out in the 1890s by the Anglican Archdeacon Thomas 
Canham, but most of his materials remain in 
manuscript. 


Nottoway [312] was a Northern Iroquoian language, 
closely related to Tuscarora. It was spoken in colonial 
times and into the nineteenth century along the 
Nottoway River in southeastern Virginia, adjacent to 
the original territory of the Tuscaroras in North 


Carolina. It is attested only in two early-nineteenth 
century wordlists. 


Nuuchahnulth. See Nootka 
Nuxalk. See Bella Coola 
Nxa7amxcin. See Columbian 


Obispefio [227] was the Chumash language spoken at 
San Luis Obispo mission, quite distinct from the 
other varieties of Chumash. 


Ofo [283] was a language of the Southeastern sub- 
group of Siouan, spoken in the 1670s on the east bank 
of the Mississippi River below the mouth of the Ohio, 
in present-day western Tennessee. The Ofos moved 
south to the Yazoo River in the 1690s and were sub- 
sequently absorbed into nearby tribes. The only 
documentation of Ofo is a vocabulary collected in 
1908 from a woman of Ofo descent, living among the 
Tunicas in Louisiana, who had learned some of the 
language from her grandmother. 


Oji-Cree. See Severn Ojibwe 


Ojibwayan [33-39] is a Central Algonquian dialect 
complex, varieties of which are spoken in a large 
number of communities in the north-central United 
States, and in Canada from Alberta to Quebec. 
There are approximately 43,000 Ojibwe speakers, the 
majority of them in Canada (where 31,625 first- 
language speakers were counted in the 1996 census). 
At least seven regional dialects or emergent languages 
can be distinguished: Severn Ojibwe, Northern Algo- 
nquin, Saulteaux, Central Southern Ojibwe (Anishi- 
naabemowin), Eastern Ojibwe (including Southern 
Algonquin), Ottawa (Odawa), and Old Algonquin. 
The first two are classified as Northern Ojibwe, and 
the rest as Southern Ojibwe 


Okanagan [140] is an Interior Salish language, spoken 
in a number of communities in southern interior 
British Columbia and northeastern Washington. 
There are seven Okanagan Reserves in British 
Columbia: Vernon, Douglas Lake, Westbank, Pentic- 
ton, Keremeos (Lower Similkameen), Hedley (Upper 
Similkameen), and Oliver (Osoyoos). Except for the 
Westbank Reserve, which may have as few as a dozen 
fluent speakers, all of the Okanagan Reserves have at 
least 50 speakers of varying degrees of fluency, the 
Vernon Reserve perhaps over 100. 


Okwanuchu [194], a Shastan language once spoken 
along the upper Sacramento River, near Mt. Shasta, is 
very poorly attested. It has been extinct for several 
generations. 


Old Algonquin [39] was the Algonquian language 
spoken in the early French missions in the lower 
Ottawa Valley and along the St. Lawrence, and is 
attested in a number of unpublished seventeenth and 
eighteenth century documents. It went out of use in 
the nineteenth century. Although it was part of the 
Ojibwayan dialect complex Old Algonquin was quite 
distinct from the Ojibwe dialects that are today called 
Northern and Southern Algonquin. 


Omaha-Ponca [277] is a Siouan language of the 
Dhegiha subgroup and — in two shallowly differenti- 
ated dialects — is the heritage language of both the 
Omaha Tribe of Nebraska and of the two Ponca 
tribes, the Northern Ponca of Nebraska and the 
Southern Ponca of Oklahoma. Most of the remaining 
speakers of the Omaha dialect live in Macy and 
Walthill in rural southeastern Nebraska. There are 
fewer than 50 fluent first-language speakers, the 
youngest about 60, and a larger number of semi- 
speakers and second-language learners. The Ponca 
dialect is spoken by about 35 elderly people in the 
Red Rock area of south central Oklahoma. 


Oneida [319] is a Northern Iroquoian language, 
originally spoken by a tribe of the Iroquois Confeder- 
acy (Six Nations) situated east of the Onondagas and 
west of the Mohawks, south of Oneida Lake. Most 
of the modern Oneidas live in two widely separated 
reservation communities, about 3,000 on the Thames 
River near London, Ontario, and 11,000 at Green Bay, 


Wisconsin. In addition, a few hundred Oneidas con- 
tinue to live in upstate New York, some on a small 
tract of land near the town of Oneida, and others 
dispersed into neighbouring white and Indian com- 
munities. There are about 200 fluent speakers of 
Oneida in Ontario and perhaps a dozen in Wisconsin; 
there are no reliable estimates for the New York 
Oneidas. 


Onondaga [317] is a Northern Iroquoian language, 
spoken in upstate New York by the central tribe (the 
‘firekeepers’) of the Iroquois Confederacy (Six 
Nations). Most modern Onondagas (about 1,600) live 
on a reservation in their old homeland, south of the 
city of Syracuse, but there is a smaller community on 
the Six Nations Reserve in southern Ontario. There 
are about a dozen elderly speakers of Onondaga in 
New York, the youngest in their 70s, and about 40 
more are reported in Ontario. 


O'odham (Upper Piman) is the northernmost of the 
languages of the Tepiman subfamily of Uto-Aztecan, 
and the only one spoken in the United States. There 
are between 14,000 and 15,000 fluent speakers of all 
ages in Arizona, and many additional speakers in 
Mexico. Two major dialects are distinguished, Tohono 
O'odham (or Papago) and Akimel O'odham (or 
Pima). Most Akimel O'odham speakers live on the 
Gila River, Salt River, and Ak Chin Reservations, in 
the vicinity of Phoenix. Most Tohono O’odham 
speakers in the United States live on the Papago Res- 
ervation in southern Arizona west of Tucson; there 
are also speakers on the San Xavier and Gila Bend 
Reservations. 


Osage [278] is a Siouan language of the Dhegiha sub- 
group, spoken in the eighteenth century by people 
living along the Osage River in Missouri. Since the 
1870s the Osage Tribe has been settled in the north- 
eastern corner of Oklahoma, around Pawhuska. 
Osage land has proved to be rich in oil, and a number 
of Osage families are quite wealthy. There are ten or 
fewer fluent speakers of Osage, all over 65, in a total 
tribal membership of 11,000, although there are a 
number of semi-speakers and second language 
speakers. 


Otoe-Missouria. See Chiwere 


Ottawa (Odawa) [38] is an Ojibwayan language ori- 
ginally spoken on the north shores of Lake Huron, 
where the most vigorous community of speakers 
today is at the Wikwemikong Reserve on Manitoulin 
Island, Ontario. There perhaps half of the population 
of over 3,000 has some fluency in the language. There 
are also several hundred speakers on Walpole Island 
and other reserves in Ontario, and in Michigan, 
where the largest community is on the Isabella Reser- 
vation. A number of Ottawas were relocated in Okla- 
homa during the nineteenth century, but the lan- 
guage does not survive there. Old Ottawa, the form 
of the language in the seventeenth and eighteenth 
centuries, is extensively documented, and there is a 
variety of Ottawa spoken on Sugar Island and else- 
where on the Upper Peninsula of Michigan that 
retains the unweakened short vowels of the older lan- 
guage. Ottawa has tended to replace Eastern Ojibwe, 
and the eastern varieties of Ottawa and the western 
sub-dialect of Eastern Ojibwe, which share the same 
pattern of vowel-weakening, now constitute an emer- 
gent language, called Nishnaabemwin, which is the 
focus of descriptive work and community language 
programmes. Some individuals and communities use 
forms of this language that intergrade with Central 
Southern Ojibwe. 


Owens Valley Paiute. See Mono 


Pacific Yupik (Alutiiq) [5], spoken on the south coast 
of Alaska from the Alaska Peninsula to Prince Wil- 
liam Sound, is a distinct language within the Yupik 
branch of Eskimo, although closely related to Central 
Alaskan Yup'ik. Speakers call themselves ‘Aleuts’ in 
English, reflecting the early Russian use of Aleuty to 
designate all of the native people of the south coast of 
Alaska; Alutiiq is the Pacific Yupik version of the same 
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word, and Sugpiag has also been used in recent dec- 
ades. Two dialects of Pacific Yupik are distinguished, 
Koniag in the west (on the upper part of the Alaska 
Peninsula and Kodiak Island) and Chugach in the 
east (on the Kenai Peninsula and Prince William 
Sound). 


Paipai [208] is a Yuman language spoken in several 
small communities in northern Baja California, near 
San Miguel, Santa Catarina, and San Isidoro. There 
are probably fewer than 100 speakers, most of them 
over 50. Paipai is most closely related to the Upland 
Yuman languages of western Arizona, from which it is 
separated, for historical reasons that are not clear, by 
the River Yuman languages (Mohave, Quechan, and 
Maricopa) and Cocopa. 


Palewyami Yokuts [180], formerly spoken along Poso 
Creek in Kern County, California, was the most 
divergent of the languages or emergent languages in 
the Yokuts complex. Its vocabulary is partially docu- 
mented in several wordlists, the longest and most 
reliable collected by Harrington in the 1920s. There 
have been no speakers of Palewyami since the 1930s. 


Panamint (Tiimpisa Shoshone) [235] is the Central 
Numic language formerly spoken in the region 
between the Sierra Nevada in California and the 
Nevada valleys east of Death Valley. Panamint has two 
main dialects, although intervening varieties show a 
gradation between them. Eastern Panamint includes 
the community around Beatty, Nevada. Western Pan- 
amint includes the communities permanently living 
in Lone Pine and Darwin, California. The Timbisha 
community in Death Valley and Lone Pine is tran- 
sitional between Eastern and Western Panamint. 
There are no monolinguals and no speakers who did 
not also learn English as small children. The ‘pure’ 
Eastern and Western dialects from Lone Pine and 
Beatty are almost extinct. The majority of no more 
than 20 speakers speak the Timbisha variety and all 
are elderly. There are very few, if any, passive speakers 
since there is a strong tendency to marry outside the 
tribe. 


Papago (Tohono O'odham). See O'odham 


Patwin [161] is one of the two languages of the Win- 
tuan family of northern California, and was originally 
spoken in two major dialects or dialect clusters along 
the Sacramento River from Colusa south to the Delta 
and in the foothills of the Coast Range to the west. 
Descendants live on small rancherias at Cortina and 
Colusa, and on the Rumsey (Cache Creek) Rancheria 
west of Woodland. In 1997 at least one speaker of the 
Hill dialect remained. 


Paviotso. See Northern Paiute 


Pawnee [290], a Northern Caddoan language closely 
related to Arikara, is spoken in two dialects, Skiri and 
South Band. The language was spoken in villages 
along the Platte River in central Nebraska until 1874, 
when the tribe was relocated to what is now Pawnee 
County in north-central Oklahoma, where they reside 
today. There are fewer than 10 speakers, all elderly 
and most speaking the South Band dialect, in a total 
tribal population of about 2,500. 


Pend d’Oreille. See Spokane-Kalispel 
Penobscot. See Abenaki 


Pentlatch [123] was a Central Salish language formerly 
spoken on the east coast of Vancouver Island to the 
south of the Island Comox. The last speaker died in 
1940. 


Perici [221] was spoken in the mountainous region 
at the southern tip of the Baja California peninsula 
around the mission at San José del Cabo; another dia- 
lect of the same language appears to have been 
spoken by the inhabitants of the islands off the east 
coast of the peninsula both north and south of La Paz 
Bay. Perici is sometimes hypothesised to belong 
to the Guaicurian family, though it is essentially 
unattested. . 


Picuris [253] is a Kiowa-Tanoan language spoken by 
nearly all of the 225 members of the small and iso- 


lated Pueblo of Picuris, 50 miles north of Santa Fe, 
New Mexico. Together with Taos, Picuris belongs to 
the Northern Tiwa branch of the Kiowa-Tanoan fam- 
ily, but these languages are not mutually intelligible. 


Pima (Akimel O'odham). See O'odham 


Piro [257] was a Kiowa-Tanoan language, formerly 
spoken in a number of now-abandoned pueblos in 
the Rio Grande Valley south of Isleta. After the 
Pueblo Revolt of 1680 most of the Piros, together 
with some Isletas and other Southern Tiwas, moved 
south to the vicinity of El Paso and established new 
settlements, only one of which, Ysleta del Sur, still 
remains. No speakers of Piro survived into the twen- 
tieth century and documentation is sparse, although 
small amounts continue to be discovered in Spanish 
mission records. Another language, Tompiro, attested 
in many of the same early sources, is probably a dia- 
lect of Piro. 


Plains Cree [15] is the dialect of Cree spoken in a 
large number of communities in the central prairies 
of Alberta and Saskatchewan. Northern and southern 
dialects are distinguished on the basis of phonology 
and morphology. Plains Cree has become the prestige 
dialect of Cree, largely because a considerable amount 
of nineteenth and early twentieth century religious 
literature was published in it, using the syllabic script. 


Plains Miwok [168] was a language of the Eastern 
Miwok branch of the Utian family of California, 
formerly spoken along the lower Cosumnes and 
Moquelumne Rivers to the southeast of Sacramento, 
from Ione to Stockton. Plains Miwok descendants 
now reside at the Ione Rancheria and the Wilton 
Rancheria. 


Pojoaque. See Tewa 


Pomo. See Pomoan and Central Pomo, Eastern 
Pomo, Kashaya, Northern Pomo, Northeastern 
Pomo, Southeastern Pomo, and Southern Pomo 


Pomoan. The seven Pomoan languages, Central 
Pomo, Eastern Pomo, Kashaya, Northern Pomo, 
Northeastern Pomo, Southeastern Pomo, and 
Southern Pomo, were spoken in a compact territory 
north of San Francisco, including the Russian River 
and adjoining coast in Mendocino and Sonoma 
Counties, and most of the region around Clear Lake. 
All are distinct, mutually unintelligible languages. 


Ponca. See Omaha-Ponca 


Potawatomi [40] is an Algonquian language closely 
related to the Ojibwayan dialect complex. It has about 
50 first-language speakers in several widely separated 
communities in the US and Canada. These include 
the Hannahville Indian Community (Upper Pen- 
insula of Michigan), the Pokagon and Huron 
Bands (southern Michigan), the Forest County Band 
(northern Wisconsin), the Prairie Band (eastern 
Kansas), and the Citizen Potawatomi Nation of 
Oklahoma. A few Potawatomi speakers also live 
among the Eastern Ojibwe in Ontario, particularly at 
the Walpole Island Reserve. The largest speech com- 
munities are in the Forest County and Prairie Bands, 
each with about 20 speakers, several conservatively 
fluent. 


Purisimefio [228] is the Chumash language once 
spoken at mission La Purisima Concepcién in the 
lower valley of the Santa Ynez River, near modern 
Lompoc, California, and in the territory to the north 
as far as Santa Maria. 


Quechan (Yuma) [211] is a Yuman language of the 
River subgroup, spoken by 150 to 200 of the 3,000 
members of the Quechan Indian Nation of south- 
eastern California, adjacent to Yuma, Arizona. 


Quileute [120], one of the two languages of the 
Chemakuan family, was spoken on the west coast of 
the Olympic Peninsula in northwest Washington, in 
a territory south of the Makah and north of the 
Quinault. The Quileutes now live on two small reser- 
vations, Quileute (at La Push) and Lower Hoh River, 
which have a combined population of less than 500. 
Although as recently as 1986 ten first-language 


speakers of Quileute remained, the last speaker died 
in 1999, 


Quinault [132], one of the four languages of the 
Tsamosan division of the Salishan family, was spoken 
on the west coast of the Olympic Peninsula in western 
Washington, in a small territory south of the Quileute. 
A substantial part of this territory is now the 
Quinault Reservation, centered on the community of 
Taholah, which the descendants of the Quinault- 
speaking groups share with several other tribes, 
including the closely related Lower Chehalis. The last 
first-language speaker of Quinault died recently. Up 
to half a dozen second-language speakers in their 50s 
have limited knowledge of vocabulary and phrases. 


Quiripi-Unquachog [55] was an Algonquian language 
of the Southern New England group, spoken in west- 
ern Connecticut and central Long Island. Documen- 
tation is sparse (although it includes an Unquachog 
vocabulary collected on Long Island by Thomas Jef- 
ferson). The language was extinct by the early nine- 
teenth century. 


Ramaytush. See Costanoan 


Rio Grande Keresan [259] is a complex of Keresan 
dialects spoken by members of five New Mexico 
Pueblos located near the Rio Grande or Jemez River 
north of Albuquerque: Zia, Santa Ana, San Felipe, 
Santo Domingo, and Cochiti. There is considerable 
local variation, but all varieties are mutually intelli- 
gible, as well as partially intelligible to speakers of 
Acoma-Laguna. Rates of language retention vary 
considerably from community to community, as do 
attitudes towards language preservation efforts. In 
general, outsiders are discouraged from learning the 
language, and this is especially strongly enforced at 
Santo Domingo and San Felipe. At Zia there are 
about 500 speakers out of a total population of 
approximately 800; there are relatively few speakers 
under 20, but at least some children acquire the lan- 
guage. At Santa Ana there are about 385 speakers out 
of a total population of approximately 650. At San 
Felipe an estimated 90 per cent of the population of 
over 2,600 speak the language, and most children 
acquire it. At Santo Domingo nearly all of the 
Pueblo’s population of approximately 2,850 are 
speakers. At Cochiti about half of the total popula- 
tion of approximately 1,200 speak the language, few if 
any of them children. 


Rumsen. See Costanoan 
Saanich. See Northern Straits Salish 


Sahaptin [187] is a Plateau (Plateau Penutian) lan- 
guage of marked dialectal diversity spoken along the 
Columbia River and adjacent Plateau in eastern 
Oregon and Washington. Together with Nez Perce it 
forms the Sahaptian branch of the Plateau family. 
Southern Sahaptin (or ‘River’) varieties were origin- 
ally spoken along the Columbia River from the Dalles 
to the Umatilla River, and included Tenino, Wayam, 
and Umatilla. Northwest Sahaptin varieties were 
spoken in the Yakima River drainage and included 
Klickitat, Taitnapam, Upper Nisqually, Yakima, and 
Kittitas. Northeast Sahaptin varieties were spoken on 
the Columbia River above Southern Sahaptin and 
along the lower Snake River, and included Walla 
Walla, Wanapam, and Palouse. The principal surviv- 
ing Sahaptin communities are on the Warm Springs 
Reservation in northern Oregon (about 50 speakers 
of Tenino); on the Umatilla Reservation in northeast- 
ern Oregon, near Pendleton (25 to 50 speakers of 
Umatilla and Walla Walla); and at Toppenish on the 
Yakama Reservation in south-central Washington 
(about 25 fluent speakers and a larger number of less- 
fluent speakers of Yakima). There is also a small 
Wanapam-speaking community near Priest Rapids 
Dam on the Columbia River. A scattering of Sahaptin 
speakers can also be found among speakers of Okan- 
agan and Nez Perce on the Colville Reservation in 
northeastern Washington. 


Salinan is an extinct language family of the cen- 
tral California coast. It consisted of two closely 
related languages or dialect clusters, a northern one, 
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Antoniano [206], primarily associated with Mission 
San Antonio de Padua, near Jolon in southern 
Monterey County, and a southern one, Miguelefio 
[207], primarily associated with Mission San Miguel, 
in northern San Luis Obispo County. The last 
speakers of both languages died in the late 1950s or 
early 1960s. 


Samish. See Northern Straits Salish 

San Felipe. See Rio Grande Keresan 

San Ildefonso. See Tewa 

San Juan. See Tewa 

Sanpoil-Nespelem. See Okanagan 

Santa Ana. See Rio Grande Keresan 
Santa Clara. See Tewa 

Santee-Sisseton. See Sioux 

Santiam. See Kalapuya 

Santo Domingo. See Rio Grande Keresan 


Sarcee (Tsuut'ina) [95] is the only northern Atha- 
baskan language spoken by a Plains group and, pri- 
marily on phonological grounds, constitutes its own 
subgroup within the Athabaskan family. It is spoken 
fluently by fewer than 10 elderly people on or near 
the Sarcee Reserve, east of Calgary, Alberta, along 
with a small number of semi-speakers and passive 
speakers. 


Sauk-Fox (Meskwaki) [41] is a Central Algonquian 
language, spoken by about 200 members of the 
Meskwaki Tribe in Iowa. It was the heritage language 
also of the historically separate Sauk tribe, whose des- 
cendants today are the Sac and Fox Tribe of central 
Oklahoma and the Nemaha Sauks on the Kansas- 
Nebraska border. The Meskwaki variety is also called 
‘Fox’; it differs from Sauk in minor details of 
pronunciation, grammar, and vocabulary, but the 
variation within Meskwaki alone is almost as great. 
Kickapoo was originally part of the same dialect 
complex, but for historical and social reasons it is 
treated as a separate language. 


Saulteaux [35] (pronounced ‘So-toe’) is an emergent 
language of the Ojibwayan dialect complex, closely 
related to Central Southern Ojibwe. The name 
‘Saulteaux’ refers to the historical origin of the group 
at Sault Ste. Marie and around Lake Superior; they 
moved westward onto the prairies with the expansion 
of the fur trade in the eighteenth and nineteenth cen- 
tury. Today, most varieties of Saulteaux are spoken in 
southern Manitoba in the vicinity of Lake Winnipeg, 
including a large urban population in the city of 
Winnipeg, although there are speakers as far west as 
British Columbia. The Saulteaux spoken north and 
east of Lake Winnipeg, and into Northern Ontario, 
shows considerable influence from Severn Ojibwe, 
including use of the syllabic orthography. These 
varieties are sometimes called Northern Ojibwe. Most 
Saulteaux varieties have a number of borrowings 
from Cree and other features that show Cree 
influence. Up to 10,000 people may be speakers of 
Saulteaux. 


Sechelt [124] is a Central Salish language spoken by 
members of the Sechelt Band on the north coast of 
the Strait of Georgia, British Columbia. Out of a total 
population of over 700 there are up to 45 speakers of 
varying degrees of fluency (only 15 fully fluent), the 
youngest in his 50s, with an additional 10 to 15 who 
can carry on limited conversations. 


Sekani [82] is an Athabaskan language, spoken in two 
remote communities in north-central British Colum- 
bia, Ware and Fort McLeod, as well as by some resi- 
dents of the Beaver community of Prophet River and 
the Tahltan community of Iskut. A fair degree of 
mutual intelligibility exitsts between Sekani and 
Beaver, Kaska, and Tahltan. There are about 50 
speakers. It is not spoken by children as a first lan- 
guage. There are few if any first-language speakers 
under 35. 


Seminole is sometimes used loosely to refer to 


Oklahoma Seminole Creek, a variety of Creek spoken 
in the Seminole Nation of Oklahoma. Florida Semi- 
nole is a closely related variety of Creek that is spoken 
by some members of the Seminole Tribe of Florida, 
although most speak Mikasuki. 


Seneca [315] is a Northern Iroquoian language, ori- 
ginally spoken by the westernmost tribe of the Six 
Nations Iroquois Confederacy in western New York 
and adjacent Pennsylvania. It is now spoken by about 
100 people in three reservation communities in New 
York: Cattaraugus, on Lake Erie; Allegany, in Sala- 
manca; and Tonawanda, near Buffalo. The youngest 
speaker is in his 50s. 


Seri. See §2.5.5, p. 51. 


Serrano [241], a language of the Serrano-Gabrielino 
division of the Takic subfamily of Uto-Aztecan, was 
originally spoken in much of the Mojave Desert 
and the San Bernardino Mountains of southern Cali- 
fornia. Serrano descendants live mainly at the San 
Manuel Reservation near San Bernardino, also a 
number of Serranos have intermarried with the 
Cahuillas at the Morongo and Soboba Reservations. 
Only a very few older people are speakers, none com- 
pletely fluent. 


Severn Ojibwe (Oji-Cree) [33] is a well-defined 
regional dialect within the Ojibwe dialect complex, 
spoken in northwestern Ontario in communities on 
Severn River, Winisk River, and Sandy Lake. First- 
language use of Severn Ojibwe is high, but Cree also 
has cultural prominence in these communities. 


Seward Peninsula Inupiaq. See Inupiaq 


Shasta (proper) [192], the only Shastan language to 
survive into the twentieth century, was originally 
spoken in several dialects in California across a rela- 
tively large territory that included Scotts Valley, near 
modern Etna and Ft. Jones; Shasta Valley, around 
Yreka; and the Klamath River between Karuk and 
Klamath-Modoc territory. Many modern Shastas 
have merged their political and cultural identity with 
the Karuk Tribe and consider Karuk their heritage 
language. (Shasta is often used as the cover term for 
all of the Shastan languages, a family of four lan- 
guages.) See also New River Shasta, Okwanuchu, and 
Konomihu. 


Shawnee [44] is an Algonquian language that was 
spoken in the early historical period in the Ohio 
Valley, mainly in the present State of Ohio. Today 
most of the descendants of the Shawnee live in 
Oklahoma, organised in three distinct groups. The 
Absentee Shawnee Tribe, located in and around the 
town of Shawnee, near the Citizen Potawatomi 
Nation, has about 2,000 members. At least 100 
Absentee Shawnees are fluent speakers of the lan- 
guage, constituting the largest Shawnee-speaking 
community. The language remains in use on cere- 
monial occasions, and informal language classes for 
adults are offered on an irregular schedule. The 
Eastern Shawnee Tribe, whose 1,500 members live in 
Ottawa County near the Oklahoma Seneca com- 
munity, has only a few elderly Shawnee speakers. The 
Loyal Shawnee (or ‘Cherokee Shawnee’), a group of 
about 8,000, reside in the Cherokee region of north- 
eastern Oklahoma, mainly around Whiteoak. 


Shoshone (Shoshoni) [236] is a Central Numic lan- 
guage, formerly spoken in a wide band stretching 
from Lida, Nevada northeast through Nevada, Utah, 
Idaho, and Wyoming as far north as Lemhi, Idaho 
and as far east as Wind River, Wyoming. Although 
there are a few large reservations which are exclusively 
(or at least half) Shoshone, there are dozens of 
smaller reservations and communities scattered 
throughout the region. Shoshone was a dialect con- 
tinuum without rigid isoglosses to separate the dia- 
lects, but several major clusters of varieties can be 
identified. Western Shoshone includes the com- 
munities throughout Nevada except for the Gosiute 
and Duck Valley communities. Northern Shoshone 
includes the Duck Valley and Fort Hall communities 
as well as the smaller communities of northern Utah 
and southern Idaho. Eastern Shoshone includes the 


Wind River community in Wyoming. Gosiute 
includes the Gosiute and Skull Valley communities in 
Utah. The largest speech community is at Fort Hall. 
Altogether, there are around 1,000 actively fluent 
speakers of Shoshone, and perhaps another 1,000 
with more restricted competence. While a few chil- 
dren still learn Shoshone as a first language in the 
Duck Valley and Gosiute communities, the majority 
of speakers are over 50. 


Shuswap (Secwepemc) [139] is an Interior Salish lan- 
guage spoken in east-central British Columbia along 
the Fraser River and its tributaries, upstream from 
Lillooet and Thompson territory and adjoining 
Chilcotin and Carrier territory on the north. The 
modern Shuswaps are organised into 17 bands, with 
the largest settlement at Kamloops on the Thompson 
River. There are between 200 and 500 fluent first- 
language speakers, most of them over 50, in a total 
population of over 6,000. 


Sioux [274] is the cover term for the varieties of the 
Dakotan dialect complex other than Assiniboine and 
Stoney. Three Sioux dialect groups can be dis- 
tinguished, from east to west: The Santee-Sisseton 
(Dakota) dialect is spoken in at least 15 widely dis- 
persed reservation communities in Minnesota, Mani- 
toba, and Saskatchewan, and in the eastern parts of 
Nebraska and the Dakotas. The Yankton-Yanktonai 
dialect is primarily spoken on the Yankton and Crow 
Creek Reservations in South Dakota, and on the 
northern part of the Standing Rock Reservation in 
North Dakota, although it also has speakers on the 
Devils Lake and Fort Peck Reservations in North 
Dakota and on a few reserves in Saskatchewan. Teton 
(Lakota) is the dialect of the Cheyenne River, Lower 
Brule, Pine Ridge, Rosebud, and Sisseton Reservations 
of South Dakota, as well as of the southern part of the 
Standing Rock Reservation in South Dakota and of 
the Wood Mountain Reserve in Saskatchewan. There 
are also substantial off-reservation communities of 
Sioux speakers, particularly in Rapid City, Min- 
neapolis, and other urban centres in the upper Mid- 
west. Together, there are nearly 25,000 first-language 
speakers of all Sioux dialects in a total population of 
103,000. Of these an estimated 4,755 reside in 
Canada. 


Sirenik [1] — also called Old Sirenik to differentiate it 
from the Sirenik dialect of Central Siberian Yupik — is 
a recently extinct language of the Yupik branch of 
Eskimo, formerly spoken in the village of Sireniki, 
Provideniya County, Chukchi Autonomous District, 
Russia. Quite divergent from the other Yupik lan- 
guages, Sirenik shows influence from Chukchi. It is 
sometimes considered a third branch of Eskimo, 
alongside Inuit and Yupik. At the beginning of the 
twentieth century there were 130 speakers, dwindling 
to approximately 30 by the early 1950s. Beginning in 
the 1950s, the indigenous population of Sireniki 
shifted first to Central Siberian Yupik, the language of 
more numerous forced immigrants, and then largely 
to Russian. The last speaker died in 1997. 


Siuslaw [153] was a language formerly spoken in two 
closely related local dialects, Siuslaw (proper) and 
Lower Umpqua, by adjacent tribes in a small territory 
on the central coast of Oregon. It is an isolate, often 
associated with the controversial Penutian stock. 
Although some Siuslaw people were removed to 
reservations between 1855 and 1875 most remained 
in their traditional territory, where more than 100 
descendants still reside. The Confederated Tribes of 
Coos, Lower Umpqua, and Siuslaw was recognised in 
1984, with tribal headquarters at Coos Bay. 


Skiiiixs. See Coast Tsimshian 


Slave [84], or South Slavey, is an emergent Athabaskan 
language within the Slavey dialect area of the Dene 
complex. Slave is spoken as a first language by about 
3,260 people in a number of communities in the 
Northwest Territories and in adjacent parts of north- 
ern British Columbia and Alberta. In the Northwest 
Territories, where it is one of the official languages, 
Slave has 1,260 speakers, about half of whom use it in 
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the home. The principal communities include: Fort 
Liard (310 speakers out of a total population of 510), 
Fort Providence (280 out of 750 total), Fort Simpson 
(290 out of 1255 total), Hay River Dene (85 out of 
250 total), Jean Marie River (30 out of 55 total), 
Nahanni Butte (50 out of 75 total), and Trout Lake 
(55 out of 65 total). It may also be spoken by some of 
the 100 Slavey speakers at Fort Wrigley, most of 
whom speak Mountain. There are also up to 60 
speakers of Slave in the town of Hay River. In 
northern British Columbia there are reported to be 
approximately 500 Slave speakers, with the principal 
community at Fort Nelson. In Alberta Slave is spoken 
by about 1,500 people at Meander River, Chateh Lake 
(Assumption) and at a few other places on the upper 
Hay River. 


Slavey is the general term for a group of Athabaskan 
dialects or emergent languages of the Dene complex, 
including Bearlake and Hare (together constituting 
North Slavey), Mountain, and Slave or South Slavey. 


Sm'algyax. See Coast Tsimshian 


Solano [267] was spoken early in the eighteenth 
century at the Franciscan mission of San Francisco 
Solano, at Eagle Pass on the Rio Grande River in 
southern Texas. It is attested only in a 21-word 
vocabulary in the mission records. Solano is con- 
sidered to be an isolate. 


South Slavey. See Slave 


Southeastern Pomo [199], a language of the Pomoan 
family, was spoken to the east of Clear Lake around 
East Lake and Lower Lake, and is the heritage lan- 
guage of the Elem Indian Colony at Sulphur Bank 
and the Lower Lake Rancheria. The last fluent speaker 
died in the 1990s. 


Southern Algonquin (Nipissing) [37] is an emergent 
language within the Ojibwayan dialect complex, 
primarily spoken at the River Desert Reserve, on the 
Gatineau River at Maniwaki, Quebec. Although 
speakers identify themselves and their language as 
Algonquin, Southern Algonquin is distinctly different 
from Northern Algonquin, as well as from the 
(extinct) Old Algonquin that was spoken in the 
Ottawa Valley in the seventeenth century. Dialectally 
Southern Algonquin is the eastern component of 
Eastern Ojibwe. 


Southern Paiute. See Ute-Chemehuevi 


Southern Pomo [203], a language of the Pomoan 
family, is spoken by very few elderly people along the 
lower course of the Russian River in Sonoma County, 
California. 


Southern Sierra Miwok. See Eastern Miwok 


Southern Tiwa [254] is a Kiowa-Tanoan language 
spoken by members of the Pueblos of Sandia and 
Isleta, on the Rio Grande near Albuquerque, New 
Mexico. At Sandia Pueblo, 15 miles north of Albu- 
querque, only middle-aged and elderly people are 
fully fluent, although some younger people use the 
language and at least a few children still acquire it. In 
all there are about 100 speakers in a total population 
of about 500. At Isleta Pueblo, 15 miles south of 
Albuquerque, there are approximately 1,500 speakers 
in a membership of about 4,000. 


Southern Tsimshian. See Maritime Tsimshian 


Southern Tutchone [76] is an Athabaskan language 
(closely related to Northern Tutchone) spoken in the 
southwestern Yukon in settlements at Aishihik, Bur- 
wash Landing, Champagne, Haines Junction, Kloo 
Lake, Klukshu, and Lake Laberge, as well as in the city 
of Whitehorse. There are about 200 speakers out of a 
total population of 1,400. 


Spokane-Kalispel [141] is an Interior Salish language, 
spoken in Washington, Idaho, and Montana in three 
major dialects. (1) The Spokane dialect is spoken on 
the Spokane Reservation in northeastern Washingon. 
It has only two fluent first-language speakers, a mar- 
ried couple in their 70s. (2) The Flathead dialect (also 
known as Montana Salish, in this context referring to 


a specific language and not the entire language fam- 
ily) is spoken on the Flathead Reservation in western 
Montana by members of the Confederated Salish and 
Kootenay Tribes. There are about 60 fluent first- 
language speakers of Flathead, one in his 40s, another 
about 55, all others 65 or older. A sub-dialect is said 
to characterise Flathead speakers of Pend d’Oreille 
descent, but no separate statistics are available. (3) 
The Kalispel dialect primarily survives on the Kalispel 
Reservation in northeastern Washington, where a 
handful of fluent speakers remain. There are also a 
few speakers living on the Spokane Reservation and 
in the nearby community of Chewelah, but informa- 
tion on them is scant. 


Spokane. See Spokane-Kalispel 


Squamish (Skwxwui7mesh snichim) [125] is the 
Central Salish language of the Squamish reserves on 
Howe Sound and Burrard Inlet, British Columbia, on 
the Strait of Georgia immediately north of the city of 
Vancouver. There are fewer than 20 first language 
speakers, the youngest in their late 60s, out of a total 
population of 2,000. 


St'at'imcets. See Lillooet 


St. Lawrence Island Yupik. See Central Siberian 
Yupik 


Stoney [276] is an emergent language in the Dakotan 
dialect complex of Siouan, historically related to — but 
clearly distinct from — Assiniboine. It is spoken on 
five reserves in Alberta located along the eastern base 
of the Rocky Mountains west of Calgary and Edmon- 
ton: Alexis, Paul, Bighorn, Morley, and Eden Valley. 
The principal community is Morley, midway between 
Calgary and Banff. The language is vigorous, with 
between 1,000 and 1,500 speakers in a total popula- 
tion of 3,200. 


Susquehannock [318] was a Northern Iroquoian lan- 
guage, spoken in the seventeenth and eighteenth cen- 
tury along the Susquehanna River in southeastern 
Pennsylvania and northeastern Maryland. It is known 
only from a list of about 80 words appended to a 
Delaware catechism compiled by a Swedish Lutheran 
missionary in 1696. 


Swampy Cree. See Western Swampy Cree and 
Eastern Swampy Cree 


Tagish [79] is an Athabaskan language, closely related 
to Tahltan and Kaska, that was spoken until the mid 
nineteenth century around the lakes at the head of 
the Yukon River south of Whitehorse. In the later 
nineteenth century the Tagish community shifted to 
Tlingit and by the mid-twentieth century only a 
handful of older people remembered Tagish from 
their childhood. As of 2001 there remains one semi- 
fluent speaker and another elderly speaker who is 
deaf. 


Tahltan [80] is an Athabaskan language of north- 
western British Columbia, closely related to Kaska 
(with which it is easily mutually intelligible) and to 
(nearly extinct) Tagish in the southern Yukon. It is 
principally the language of the remote community of 
Telegraph Creek, on the upper Stikine River, where 
there are five fluent speakers and perhaps 15 passive 
speakers out of a total population of 100. It is also 
spoken in the mixed Sekani-Tahltan community of 
Iskut, at Kinaskan Lake. No children are reported to 
speak or understand it. 


Takelma [156] was a language formerly spoken in the 
valley of the upper Rogue River in southwestern 
Oregon. The Takelmas were displaced from their 
homeland by the Rogue River War of 1855-6 and the 
survivors eventually settled on the Grand Ronde and 
Siletz Reservations in northwestern Oregon. There 
may have been several distinct local dialects, but only 
one is reflected in most of the documentation, the 
largest part of which comes from Sapir’s work in 1906 
with a single speaker. The language was extinct by the 
1940s. 


Tamyen. See Costanoan 


Tanacross [73] is an Athabaskan language belonging 
to the Tanana series, spoken at Healy Lake, Dot Lake, 
and Tanacross on the middle Tanana River of central 
Alaska. The total population is about 220, of whom 
about 60 speak the language. 


Tanaina (Dena'ina) [67] is the Athabaskan language 
of the Cook Inlet area of southern Alaska. Distinct 
local dialects are associated with the Kenai Peninsula, 
the Upper Inlet area above Anchorage, and coastal 
and inland areas of the west side of Cook Inlet. Of the 
total population of about 900 people, about 75 speak 
the language. 


Tanana is a series of Athabaskan varieties spoken in 
east-central Alaska and adjacent Canada, from the 
upper Kuskokwim River to the headwaters of the 
Tanana River. Four languages are usually distin- 
guished in the series: Upper Kuskokwim, Lower Tan- 
ana, Tanacross, and Upper Tanana. Speakers of 
adjacent languages can usually understand one 
another, although with some difficulty. None of the 
varieties of Tanana is flourishing; in most com- 
munities fewer than a third of the population are 
speakers, few or none of them children. 


Tano. See Tewa 


Taos [252] is a Kiowa-Tanoan language spoken by 
members of the Pueblo of Taos, 70 miles north of 
Santa Fe, New Mexico. Together with Picuris and 
Southern Tiwa, Taos belongs to the Tiwa branch of 
the Kiowa-Tanoan family; although close, these lan- 
guages are not mutually intelligible. There are 
approximately 800 speakers in a Pueblo population of 
1,600. 


Tarahumara. See §2.8, p. 54 
Tawasa. See Timucua 


Tataviam was apparently a Uto-Aztecan language of 
the Takic subfamily, once spoken in the Santa Clara 
Valley of southern California, but very poorly 
attested. 


Tesuque. See Tewa 
Teton. See Sioux 


Tewa [255, 256] is a Kiowa-Tanoan language spoken 
in distinct local varieties at seven Pueblos in northern 
New Mexico and Arizona. These include Santa Clara, 
San Juan, San Ildefonso, Nambe, Tesuque, and 
Pojoaque in New Mexico, and in Arizona the village 
of Hano on the First Mesa at Hopi. There are 
approximately 1,200 speakers of Tewa in the New 
Mexico Pueblos out of a total enrolment of approxi- 
mately 4,500. There are about 300 additional speakers 
at Hano, where at least some children are acquiring 
the dialect historically called Tano. At San Juan, the 
largest of the New Mexico Tewa Pueblos, as few as 30 
fully fluent speakers remain in a population of about 
2,000; most adults are semi-speakers, and no children 
are acquiring Tewa. 


Thompson [138] is an Interior Salish language spoken 
along the Fraser River Canyon in southwest British 
Columbia and along the adjacent Thompson and 
Nicola Rivers. A shallow dialect distinction exists 
between Upper Thompson varieties in the northern 
part of Thompson territory and Lower Thompson 
varieties in the south. The principal modern settle- 
ments are at Lytton on the Fraser River and at Lower 
Nicola and Merritt in Nicola Valley. There are about 
150 first-language speakers in a total population of 
over 5,000. 


Tillamook [136], a Salishan language usually classified 
as its own subgroup, was formerly spoken in several 
local varieties along the northwest coast of Oregon, 
from the Nehalem River to the Siletz River. The last 
speaker died in 1970. 


Timucua [308] was a large complex of dialects spoken 
in the sixteenth and seventeenth century over much 
of the northern half of the Florida peninsula, as well 
as in parts of Georgia and Alabama. An isolated and 
divergent dialect, Tawasa, spoken in east-central 
Alabama and attested only in a short vocabulary from 
1708, is probably best treated as a closely related 
language. 
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Timbisha. See Panamint 
Tipai. See Dieguefio 


Tlingit [64] is the traditional language of the Tlingit 
people on the southeastern coast of Alaska from 
Yakutat to Ketchikan, and constitutes a separate 
branch of the Na-Dene family alongside Eyak and the 
Athabaskan languages. The total Tlingit population in 
Alaska (organised as the Sealaska Regional Corpor- 
ation, divided into 16 village communities) is about 
10,000, of whom perhaps as many as 500, none of 
them children, are fluent speakers of the language. An 
additional 185 Inland Tlingit speakers live in Canada 
in several communities in the southern Yukon and 
northern British Columbia. The only other well- 
marked local variety is the phonologically archaic 
Tongass dialect, formerly spoken in the Ketchikan 
area but now nearly extinct. 


Tohono O'odham. See O'odham 
Tolkapaya (Yavapai). See Upland Yuman 


Tolowa [100], the only surviving language of the 
Oregon Athabaskan subgroup, is spoken by a few 
individuals at the Smith River Rancheria near 
Crescent City, California. It is nearly extinct as a first 
language (one elderly semi-speaker survives in 2001) 
but there is one fully fluent second-language speaker 
in his 40s. 


Tonkawa [288] was spoken in the eighteenth century at 
San Gabriel Mission, between Austin and Waco in 
east-central Texas, and apparently by some other 
groups in that region. After a tangled relationship 
with whites and other Indians in the first half of the 
nineteenth century, the Tonkawas were removed to 
Oklahoma, ultimately to a reservation in Kay County. 
The tribe currently has a membership of less than 
200, none of whom speak the language. Tonkawa is 
attested in three nineteenth century vocabularies, but 
was mainly documented by Hoijer’s extensive field- 
work with one of the last fluent speakers in 1928-9. 
Tonkawa is considered an isolate. 


Tsetsaut [94] was an Athabaskan language formerly 
spoken on Portland Canal, on the north coast of 
British Columbia adjacent to Nass-Gitksan territory. 
It is known exclusively from a single vocabulary col- 
lected by Boas in 1894, but the data are sufficient to 
show that Tsetsaut formed its own distinct subgroup 
within the Athabaskan family. The language has been 
extinct for over a century and the Tsetsaut people 
have been absorbed into neighbouring tribes. 


Tsimshian is a name commonly used for at least two 
different language units. On the one hand it is the 
term associated in much of the older literature with 
the entire Tsimshianic language family. More nar- 
rowly, it is often used as a synonym for Coast Tsim- 
shian (Sm'algyax), an emergent language in the Coast 
Tsimshian area of Tsimshianic. 


Tsuut'ina. See Sarcee 
Tualatin- Yamhill: See Kalapuya 


Tubatulabal [240], a Uto-Aztecan language that con- 
stitutes an independent branch of the family, is the 
heritage language of the unrecognised Tubatulabal 
tribe of Kern County, California. There are approxi- 
mately 900 Tubatulabals, about half of them still 
living in the tribe’s traditional territory in the Kern 
River Valley of the southern Sierra Nevada. Only a 
handful of elderly people — fewer than five — are 
speakers of the language. 


Tukudh. See Gwich'in 
Tule-Kaweah. See Yokuts 
Timpisa Shoshone. See Panamint 


Tunica [298], a heritage language of the Tunica-Biloxi 
Tribe of Marksville, Avoyelles Parish, Louisiana, was 
spoken in the seventeenth and eighteenth century 
along the Mississippi River near Vicksburg, Missis- 
sippi. Some materials on Tunica were collected in the 
late nineteenth and early twentieth century, but the 
principal documentation is by Haas, who collected 


extensive data between 1933 and 1939 and published 
a grammar, a grammatical sketch, a dictionary, and a 
collection of texts. The last speaker, with whom Haas 
worked, died after 1950. Tunica is an isolate. Tunica 
was also spoken by the neighbouring Tiou, and other 
dialects or closely related languages were probably 
spoken by the Koroa, Yazoo, and Grigra; no data from 
these varieties is known. 


Tuscarora [311] , a Northern Iroquoian language, was 
spoken until the early eighteenth century in eastern 
North Carolina. After 1711-3 many Tuscaroras 
moved north to join the League of the Iroquois in 
New York, settling near the Seneca. After the Ameri- 
can Revolution part of the group fled to Canada, join- 
ing other Iroquois on the Six Nations Reserve in 
Ontario. Only two or three speakers of Tuscarora 
remain, all over 80. 


Tutelo [285] was a Siouan language of the Ohio Valley 
(or Southeastern) branch that was spoken by the 
Tutelos, Saponis, and Occaneechis, who lived until the 
1670s in the Roanoke-Staunton Valley of southern 
Virginia, and apparently also by the Monyton on the 
Kanawha River in West Virginia. After a series of 
conflicts and removals, the remnant of the Virginia 
groups moved north under the name Tutelo in the 
mid-eighteenth century and became affiliated with 
the Iroquois, eventually settling on the Six Nations 
Reserve in Ontario with the Cayuga. The language is 
attested in the early eighteenth century by a vocabu- 
lary and some translated placenames. There were 
speakers until the 1980s. 


Tututni [98] was a language of the Oregon Atha- 
baskan subgroup formerly spoken along the south- 
west Oregon coast from the Coquille River to a few 
miles north of the California border. There were 
several distinct local varieties, the best attested of 
which are Coquille, Euchre Creek, and Chasta Costa. 
After the Rogue River War of 1855-6 the Tututnis 
were forcibly resettled in northern Oregon on the 
Grand Ronde and Siletz Reservations, where Tututni 
continued to be spoken for several generations. The 
last fluent speaker died in 1983. 


Twana [131] was a Central Salish language formerly 
spoken along Hood Canal on the east side of the 
Olympic Peninsula in northwestern Washington. The 
Twana were concentrated on the Skokomish Reserva- 
tion in 1859, where dialect differences were lost but 
the language and culture survived for several gener- 
ations. The last fluent speaker died in 1980. 


Uchiti (Huchiti) is an essentially unattested language 
once spoken on the east coast of the peninsula of Baja 
California, around La Paz and on the west coast north 
of the mission of Todos Santos. Uchiti (Huchiti) is 
sometimes thought to be a Guaicurian language, 
though it may well have been only a variant of Guai- 
cura (Waikuri) itself. 


Unami (Oklahoma Delaware, Lenape) [58] was an 
Eastern Algonquian language originally spoken in 
several dialects in southern and central New Jersey, 
eastern Pennsylvania, and northern Delaware. With 
the westward migration of the Delawares, Southern 
Unami came to be spoken in Ohio, Indiana, Kansas, 
and elsewhere, ending up after the Civil War in 
Oklahoma, where it was the heritage language of the 
Delaware Tribe, near Bartlesville and Dewey, Okla- 
homa, and the Delaware Tribe of Western Oklahoma, 
near Anadarko. The last speaker died in 2002. The 
Northern Unami dialect was extensively documented 
by German-speaking Moravian missionaries in the 
late eighteenth century and survived into the twen- 
tieth century among the Canadian Munsees. The 
Unalachtigo dialect, originally from the Trenton area, 
appears also to be reflected in the Moravians’ 
materials. 


Upland Yuman [209] is a Yuman language, closely 
related to Paipai, spoken by three historically and cul- 
turally distinct groups in western Arizona, the Huala- 
pai, the Havasupai, and the Yavapai, the last trad- 
itionally divided into four regional subtribes. Each 
community speaks a distinct variety, with the Yavapai 


varieties forming a well-defined dialect, although all 
varieties are mutually intelligible with little difficulty. 
Hualapai (Walapai) is spoken at the Hualapai Indian 
Reservation in Peach Springs by approximately 1,000 
people, slightly more than half the total population. 
Speakers are of all ages and at least some children 
continue to acquire Walapai as their first language. 
Havasupai is spoken by more than 500 people of all 
ages, nearly the entire population of the village of 
Supai in Havasu Canyon, at the western end of the 
Grand Canyon. Yavapai is spoken in four small reser- 
vation communities, Prescott, Fort McDowell, Camp 
Verde, and Clarkdale. Local varieties, however, reflect 
pre-reservation subtribes and include Yavepe, Tolka- 
paya, Keweevkapaya, and Wipukpaya, speakers of 
which are found in all four communities. None of the 
Yavapai varieties are thriving, and most of the esti- 
mated 100 to 150 speakers (out of a total population 
of about 1,000) are middle aged or older. 


Upper Chehalis, one of the four languages of the 
Tsamosan division of the Salishan family, was for- 
merly spoken on the upper Chehalis River in 
southwestern Washington. Most Upper Chehalis des- 
cendants live on the Chehalis Reservation, west of 
Centralia, which they share with the descendants of 
several other groups. The last active speaker died at 
the age of 90 in 2001. 


Upper Kuskokwim (or Kolchan) [71] is an Athabas- 
kan language belonging to the Tanana series, spoken 
in the villages of Nikolai, Telida, and McGrath in the 
Upper Kuskokwim River drainage of central Alaska. 
Of a total population of about 160 people, about 40 
still speak the language. 


Upper Piman. See O'odham 


Upper Umpqua [97] was a language of the Oregon 
Athabaskan subgroup formerly spoken in the upper 
drainage of the Umpqua River in southwestern 
Oregon. After the Rogue River War of 1855-6 the 
Upper Umpquas were forcibly resettled in northern 
Oregon on the Grand Ronde and Siletz Reservations, 
and the language rapidly fell out of use. The last 
speaker died in the 1940s. 


Upper Tanana [74] is an Athabaskan language belong- 
ing to the Tanana series, spoken mainly in the Alaska 
villages of Northway, Tetlin, and Tok, but also in the 
Beaver Creek area of the Yukon. Each of these com- 
munities has a different local variety. The Alaskan 
population is about 300, of whom at most 100 speak 
the language. 


Ute-Chemehuevi [239] is a dialect chain within 
Southern Numic that extends from central Colorado 
westward across Utah and southern Nevada to the 
eastern Mojave Desert in California. There are three 
major regional varieties, all mutually intelligible. Ute 
(Colorado and central Utah) is spoken by about 900 
people in and around three reservation communities: 
(1) Southern Ute (Ignacio, Colorado), where there 
about 100 first-language speakers, the youngest about 
55, out of a total population of 1,300. (2) Ute Moun- 
tain Ute (Towaoc, Colorado), where there about 500 
first-language speakers, the youngest about 25, out of 
a total population of 1,500. (3) Uintah & Ouray 
(Northern) Ute (Ft. Duchesne, Utah), where there 
about 300 first-language speakers, the youngest about 
45, out of a total population of 2,000. Southern 
Paiute (southern Utah and Nevada) is spoken in ten 
widely separated communities in Utah, Arizona, and 
Nevada. The five Utah communities constitute the 
Paiute Tribe of Utah and have a total population of 
about 600. The San Juan Paiute Tribe is settled on the 
Navajo Reservation in Utah and Arizona and has a 
population of 220. The Kaibab Paiute Tribe, with a 
reservation north of the Grand Canyon, has a popula- 
tion of 212. The three southern Nevada tribes 
(Moapa, Las Vegas, and Pahrump) have a combined 
population of over 400. The language is spoken to a 
varying extent in all communities, but only in the San 
Juan tribe are children still acquiring it as their first 
language. Chemehuevi (southern California) is 
spoken on the Colorado River Indian Reservation at 
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Parker, Arona (which the Chemehuevis share with 
Mohaves, Navajos, and Hopis), and on the neighbour- 
ing Chemehuevi Reservation in California. There 
are fewer than 20 first-language speakers, with the 
youngest nearly 40. 


Ute. See Ute-Chemehuevi 


Venturefio [231], a Chumash language, was spoken at 
San Buenaventura mission and in most of Ventura 
County. There were at least six different dialects. 


Virginia Algonquian [60] is the name given to the 
Algonquian language attested in two vocabularies 
collected at Jamestown between 1607 and 1611. It 
presumably was the language of the Powhatan con- 
federacy in tidewater Virginia. There may have been 
speakers as late as 1790, but no further documenta- 
tion exists. 


Waikuri. See Guaicurian 


Wappo [225] is a Yukian language, a small family of 
the Coast Range of northern California. Wappo was 
originally spoken from Napa Valley to Clear Lake, in a 
compact, mostly mountainous, territory hemmed in 
by speakers of the Pomo languages on the west, Lake 
Miwok on the north, and Patwin on the east and 
south. During the nineteenth century most of the 
surviving Wappos joined these surrounding groups. 
Wappo appears recently to have become extinct. 


Wasco-Wishram. See Kiksht 


Washo [223], an isolate, often associated with the 
controversial Hokan stock, is spoken by members of 
the Washoe Tribe of Nevada and California, whose 
traditional homeland centers on Lake Tahoe in the 
High Sierra. There are several dozen fluent first- 
language speakers, all middle-aged or elderly, in a 
total population of over 1,500, divided among four 
small reservations in both Nevada and California. 


Western Abenaki [50] was spoken in New Hampshire 
and adjacent areas of Vermont. The largest modern 
community is in Quebec, at the Odanak Reserve on 
the St. Francois River, where a handful of elderly flu- 
ent speakers survive. In addition about 2,000 people 
of Western Abenaki descent live in Vermont around 
the northern end of Lake Champlain. Attempts are 
underway there to revive the language and teach it in 
the Vermont school system. Western Abenaki was 
extensively documented by a series of village school- 
masters from the Odanak community in the nine- 
teenth and twentieth centuries, including Joseph 
Laurent, and by Gordon Day, who published a 
dictionary. 


Western Apache [106] is an emergent language within 
the Southern Athabaskan dialect complex, spoken as 
a first language by up to 14,000 people in several res- 
ervation communities in southeastern Arizona. Of 
these, about 6,000 live on the San Carlos Reservation 
and 7,000 on the Ft. Apache Reservation (White 
Mountain Apache Tribe), making up about 65 per 
cent of the population of those two tribes. Much 
smaller numbers of speakers are found at the Tonto 
Reservation at Payson, at the Camp Verde Reservation 
(shared with the Yavapais), and at the Ft. McDowell 
Reservation near Scottsdale (shared with the Yavapais 
and the Mojaves). A few children at San Carlos and 
Ft. Apache speak Western Apache as their first lan- 
guage, but most children and young adults are passive 
speakers or semi-speakers. 


Western Canadian Inuktitut. See Inuktitut 
Western Eskimo. See Yupik 


Western Miwok, one of the two branches of the 
Miwok languages of north-central California, consists 
of two languages: Coast Miwok [165], originally 
spoken in Marin and Sonoma counties, is extinct. 
Lake Miwok [166] has two or three semi-speakers, 
none of them actively using the language, who live at 
the Middletown Rancheria, on ancestral Lake Miwok 
territory to the southeast of Clear Lake. 


Western Naskapi [23] is the first language of nearly 
800 people on the Kawawachikamach Reserve, near 


Schefferville, Quebec. The community moved from 
Fort Chimo in 1956. The dialect is most similar to 
Eastern Naskapi, but it also shares some features with 
Northern East Cree. 


Western Swampy Cree [18] (Algonquian language of 
the Cree-Montagnais branch) is spoken in north- 
eastern Manitoba at York Factory, Fox Lake, Shamat- 
tawa, Churchill, and Norway House. The dialect of 
Fort Severn has been influenced by Eastern Swampy 
and by Ojibwe. Most band members are speakers. 


Wetsuwet'en. See Babine 


Wichita [292] is a Northern Caddoan language, 
spoken by the Wichita Tribe, formerly a confederacy 
of autonomous bands that until the late nineteenth 
century lived in an area extending from central Okla- 
homa through central Texas. Today they live in Caddo 
County in central Oklahoma, primarily in and 
around the town of Anadarko, an area in which the 
Caddo, Delaware, and Kiowa also live. There are fewer 
than ten elderly speakers of the language in a total 
tribal population of approximately 2,100. 


Winnebago (Ho-Chunk, Hochank) [282] is a Siouan 
language of the Chiwere-Winnebago subgroup, 
spoken in central Wisconsin. Winnebago has over 250 
fluent first-language speakers, divided between the 
Winnebago Tribe of northeastern Nebraska and the 
Ho-Chunk Nation of central Wisconsin. The number 
may be higher; 2,000 speakers were reported by reli- 
able sources in 1980. 


Wintu-Nomlaki [160] is one of the two languages of 
the Wintuan family of northern California, and was 
originally spoken in the northern half of the Sacra- 
mento Valley, on the upper Sacramento River below 
Mt. Shasta, and in the upper drainage of the Trinity 
River and on Hayfork Creek in Trinity County. There 
were two major dialects, Nomlaki, spoken along the 
Sacramento River south of Red Bluff, and Wintu, 
spoken elsewhere in the territory. There appears to 
have been no significant difference between the 
variety of Wintu spoken in the Trinity-Hayfork area 
and the Sacramento Valley variety. At least one fluent, 
traditional speaker of the Wintu dialect remains, 
although elderly, as well as several semi-speakers. 


Wipukpaya (Yavapai). See Upland Yuman 


Wiyot [62] was an Algic language of the Humboldt Bay 
region of northwestern California, distantly related to 
the Algonquian languages as well as to adjacent 
Yurok. It is the heritage language of the Wiyot tribe, 
whose principal modern community is at Table Bluff 
Rancheria near Ferndale. The last speaker died in 
1961. 


Woccon [287] was one of the two languages in the 
Catawban branch of the Siouan-Catawba family. It 
was spoken in the early eighteenth century in eastern 
North Carolina, and is attested only in a vocabulary 
of 143 words that was printed in 1709. The relation- 
ship between Woccon and Catawba (proper) is not 
close. 


Woods Cree [17] is the variety of Cree spoken in 
communities in the forested interior of northwestern 
Manitoba and east-central Saskatchewan. 


Wyandot. See Huron-Wyandot 


Yana [205] was spoken in several distinct dialects 
in the rugged country west of Mt. Lassen in the 
northern Sacramento Valley. It is an isolate, often 
associated with the controversial Hokan stock. There 
were three principal dialects: Northern Yana, spoken 
in a small area around Montgomery Creek; Central 
Yana, on Cow Creek; and Southern Yana, spoken in 
the southern two-thirds of the territory. Ishi, the 
well-known last survivor of the Yahi group, spoke 
the Southern dialect. At least one speaker of Yana 
survived until about 1940. 


Yankton-Yanktonai. See Sioux 


Yaqui (Cahita, Yoeme) [249] is a language of the Tara- 
cahitan subfamily of Uto-Aztecan, primarily spoken 
along the lower Yaqui River in southern Sonora, 


Mexico. In the early twentieth century many Yaquis 
fled to the United States to escape political persecu- 
tion and settled in southern Arizona. About 8,000 
Yaquis are now members of the Yaqui Tribe of Ariz- 
ona, which since 1964 has had a reservation at New 
Pascua, southwest of Tucson. Although older Arizona 
Yaquis speak their language fluently or are bilingual 
in Yaqui and Spanish, younger tribal members have 
largely switched to English. 


Yaquina. See Alsea. 

Yavapai. See Upland Yuman 

Yavepe (Yavapai). See Upland Yuman 
Yoeme. See Yaqui 


Yokuts is a large complex of dialects, spoken abori- 
ginally in the San Joaquin Valley of south-central 
California and the foothills of the Sierra Nevada to 
the east. There were over 40 local varieties of Yokuts, 
each associated with a small independent community, 
often only a single village or close-knit group of 
villages. Although the classification is somewhat 
arbitrary, six emergent languages are usually dis- 
tinguished, three of which (Palewyami [180], Buena 
Vista [181], and Gashowu [184]) are extinct. Still 
spoken are: Tule-Kaweah [182], a cluster of dialects 
originally spoken in the Sierra Nevada foothills along 
the Tule and Kaweah Rivers, east of Porterville. Fewer 
than ten speakers of the Wukchumne (Wikchamni) 
dialect of Tule-Kaweah remain, most of them on the 
Tule River Reservation. Kings River [183], a cluster of 
dialects originally spoken in the Sierra Nevada foot- 
hills east of Fresno. Half a dozen elderly speakers or 
semi-speakers of the Choinumne (Choynimni) dia- 
lect live in scattered locations in and around their 
traditional homeland. Valley Yokuts [185], a large 
complex of shallowly differentiated dialects spoken 
mainly in the San Joaquin Valley. There are speakers 
of at least three Valley Yokuts dialects, including up to 
25 fluent and semi-fluent speakers of Yowlumne 
(Yawelmani) on the Tule River Reservation, a few 
semi-speakers of Chukchansi at the Picayune and 
Table Mountain Rancherias in the foothills northeast 
of Fresno, and a few speakers of Tachi at the Santa 
Rosa Rancheria near Lemoore. 


Yoncalla. See Kalapuya 


Yuchi (Euchee) [307] is an isolate, possibly distantly 
related to the Siouan languages. Originally an inde- 
pendent tribe located in central Tennessee, the Yuchis 
have been politically associated with the Muscogee 
Creeks since the early nineteenth century and most of 
the 1,500 members of the group live among the 
Creeks in northeastern Oklahoma, near Sapulpa, 
Hectorsville, and Bristow. The language is still spoken 
fluently by five to seven elderly people, only one 
younger than 75. 


Yuki [224] was a complex of distinct but closely related 
dialects that were spoken in northern Mendocino and 
Lake Counties, California, from Round Valley to the 
coast. Together with Wappo, the Yuki dialects form 
the Yukian family. The major dialects were Coast 
Yuki, spoken along a short stretch of the rugged coast 
between Fort Bragg and Rockport; Huchnom, spoken 
along the South Eel River north of Willits; and Yuki 
(proper), or Round Valley Yuki, spoken in a number 
of village communities in and around Round Valley. 
The last fluent speaker died around 1990. 


Yupik (Western Eskimo). See Central Alaskan Yup'ik, 
Naukanski Yupik, Pacific Yupik, Central Siberian 
Yupik 


Yurok [63], an Algic language distantly related to the 
Algonquian languages, is the traditional language of 
the Yurok Tribe of northwestern California and of 
three nearby independent rancherias of Yurok heri- 
tage, Reseghini, Big Lagoon, and Trinidad (Cher-Ae). 
Only a dozen or fewer elderly people have full first- 
language fluency in a combined tribal enrolment of 
well over 4,000, although there are perhaps three 
times as many semi-speakers and passive speakers, all 
middle aged or older. 
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Zia. See Rio Grande Keresan 


Zuni [260] , an isolate, is the language of Zuni Pueblo in 
western New Mexico. Zuni remains the primary lan- 
guage of most of the more than 9,000 tribal members, 
although almost all are bilingual in English. The Zuni 
Tribe has formally adopted a practical orthography 
and has collaborated with the public school district 
in developing a literacy programme designed to help 
preserve the language in written form. 


1.6 Contact languages 


Contact among languages, in North America as else- 
where in the world, has resulted in distinctive kinds 
of language, often showing features of more than one 
linguistic ancestor. The term ‘pidgin’ is applied to 
‘trade languages’ in which vocabulary is drawn pri- 
marily from one language, but morphological struc- 
ture is drastically simplified; examples in Native 
North America include Chinook Jargon, Delaware 
Jargon and Mobilian Jargon. In one view, when a 
pidgin language develops into the first language of a 
community, it becomes a creole. It is possible that 
Chinook Jargon reached this stage in some areas 
during the nineteenth and twentieth centuries. When 
structural features from two different languages are 
combined in a single language, without grammatical 
simplification, we can speak of a ‘mixed language’; 
North American examples are Mednyj Aleut and 
Michif. Details on all these languages are given below. 


Chinook Jargon was the lingua franca of the Pacific 
Northwest, from the Alaska Panhandle to northwest- 
ern California in the eighteenth and nineteenth cen- 
tury. Initially based on Nootka vocabulary diffused by 
European and American seafarers, the major basis for 


its vocabulary was the Chinook language of the lower 
Columbia River area, with further words added from 
Salishan languages. When speakers of French and 
English arrived in the area, vocabulary from those 
languages was also added. ‘Jargon’, as Chinook Jargon 
is often called in the area, may have been spoken as a 
second language by as many as 100,000 people at one 
time, and is still occasionally used by a few. 

Mednyj Aleut was a mixed language spoken on 
Mednyj Island, off the coast of Kamchatka, in the 
nineteenth and twentieth century by people of mixed 
Russian-Aleut descent. The language contained 
vocabulary from both language; in noun forms 
mostly employ Aleut morphology, while in verb 
forms, Russian inflectional endings were used. 


Michif. See Michif (above) 


Mobilian Jargon was at one time widely used in the 
southeastern US, among numerous Indian tribes, 
Choctaw, Chickasaw, Apalachee, Alabama, Koasati, 
Chitimacha, Natchez, Tunica, Ofo, Biloxi, Caddo, and 
others, as well as by Europeans. It was based primarily 
on Muskogean languages. A detailed discussion is that 
of Drechsel (1997). 


Pidgin Delaware, developed from Unami Delaware, 
was used between the Delaware and European colon- 
ists in the seventeenth and eighteenth centuries. 
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1.7 Post-Columbian languages 
of the Americas 
J. Lachlan Mackenzie 


Canada recognises English (approx 17.1m_ native 
speakers) and French (approx. 6.7m, the mother 
tongue of over 80 per cent of the natives in the Que- 
bec province) as national languages, and in Nunavut 
and Northwest Territories recognises various Native 
Canadian languages. Of the Native Canadian lan- 
guages still extant, the largest numbers of speakers are 
reported for (especially Western) Ojibwa, various dia- 
lects of Cree, Eastern and Western Inuktitut, and 
Mi'kmagq. Although Gordon (2005) notes that the 
usage of these languages is ‘vigorous’, all native lan- 
guages are numerically in decline. In Canada, a huge 
range of immigrant languages is spoken in the family 
and local community: languages with over 150,000 
speakers each are Arabic and Chinese (each in various 
forms), Dutch, German, Italian, Panjabi, Polish, 
Portuguese, Spanish, Tagalog and Ukrainian. The 
official language of St Pierre and Miquelon is French. 

The United States of America, though home to 
a multiplicity of languages, recognises only English as 
an ‘unofficial official language’ (Campbell, forth- 
coming), adding Hawai'ian in the state of Hawaii, 
French in Louisiana and Spanish in New Mexico. By 
far the largest minority language is Spanish with, 
according to the US Census Bureau’s 2004 American 
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Figure 1.1 Languages of the Caribbean 
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Community Survey, over 30m speakers; other lan- 
guages with large numbers of speakers, many exceed- 
ing a million speakers, are Arabic, Armenian, Chinese 
(2.3m), French (including Cajun or Acadian French), 
German (in various dialects), Italian, Korean, Romani, 
Tagalog and other Philippine languages, and Yiddish. 

Among the significant varieties are Gullah, an 
English-based creole spoken by some 125,000 Afro- 
Americans, mostly on the islands off South Carolina 
and Georgia; African American Vernacular English 
(or Ebonics), possibly with creole origins, and spoken 
by upwards of 30m people in inner cities across the 
USA; Pennsylvania German (popularly known as 
Pennsylvania Dutch), a form of German gradually 
attriting into a dialect of English; Louisiana Creole 
French (or Négue), spoken mainly by older people of 
mixed French-Canadian and Afro-American origin in 
eastern Louisiana and eastern Texas; and Hawai'i 
Creole English. 

Of the Native American languages spoken within 
the United States, the following were in 1990 the lan- 
guages with the largest numbers of speakers according 
to Campbell (forthcoming); note that Gordon (2005) 
now gives significantly lower numbers for most: 


Navajo c. 80,000 (incl. 7,616 
monolinguals, acc. to 
Gordon 2005) 

Central Alaskan Yup'ik — c. 20,000 

Sioux (Dakota) c. 20,000 

Creek (Muskogee) c. 18,000 

Tohono O'odham c. 15,000 

Choctaw c. 11,000 


Half of the traditional Native American languages are 
extinct, and very many of the surviving languages 
are now used only by a handful of people. None of 
the native languages is expected to survive very long, 
since most are spoken only by older people and the 
number of children learning the languages is declin- 
ing rapidly. As the figures indicate, Navajo is best 
placed to maintain itself, but Spolsky (2002) is now 
less hopeful about that than in earlier studies of his. 
Of various attempts at revitalising native languages, 
Campbell (forthcoming) writes that ‘Success is hoped 
for, but not often anticipated.’ 

Mexico’s offical language is Spanish and the 
population is very largely Spanish-speaking, with 
some 8 per cent speaking an indigenous language. 
This pattern is replicated throughout much of Cen- 
tral America: Spanish is the official and numerically 
dominant language of Guatemala, El Salvador, Hon- 
duras, Nicaragua, Costa Rica and Panama; in Belize, 
however, the official language is English, although the 
population is largely bilingual in Spanish (dominant 
inland) and Belize Creole English (dominant in the 
cities and by the coast). English-based creoles are 
found on the Bay Islands in Honduras, in Nicaragua 
on the coast by Bluefields, around the banana plant- 
ations of Costa Rica, where Mekitelyu (an English- 
based creole) is spoken, and in Panama, with some 
300,000 speakers of Panamanian Creole English. 

As for the West Indies, we see a mixed picture 
strongly influenced by colonial history, with official 
languages that originate in Europe, but in actual 
speech practice it is creole languages that predomin- 
ate. Thus English is the official language of the 
Bahamas, where Bahamian Creole is the native lan- 
guage of almost all the inhabitants; similarly in Haiti 
(French; Haitian Creole), Cayman Islands (English; 
Cayman Islands Creole), Jamaica (English; Jamaican 
Creole), Turks and Caicos Islands (English; Turks 
and Caicos Creole), British Virgin Islands and US 
Virgin Islands (English; Virgin Islands Creole), 
Anguilla, Antigua and Barbuda, Montserrat, St Kitts 
and Nevis (English; Leeward Creole), Guadeloupe 
(French; Lesser Antillean Creole French), Dominica 
(English; French-based Dominica Creole), Mar- 
tinique (French; Lesser Antillean Creole French), 
St Lucia (English; Lesser Antillean Creole French), 
Barbados (English; Bajan Creole), St Vincent and the 
Grenadines (English; Vincentian Creole), Grenada 
(English; Windward Creole), Trinidad and Tobago 
(English; Tobagonian and Trinidadian Creoles). The 


Netherlands Antilles and Aruba have Dutch as their 
official language, with 84 per cent of the population 
speaking the Iberian-based and Dutch-influenced 
creole Papiamentu. Bermuda’s language is English. In 
Cuba, it is Spanish that is spoken, as also in the 
Dominican Republic, where Samana English is used 
on the island of Hispaniola; in Puerto Rico, the 
official languages are Spanish and English, with the 
former statistically dominant (90 per cent); in all 
three countries, the Spanish of some speakers displays 
creolisation. 

The official language of most South American 
countries is Spanish. This is true of Columbia, 
Venezuela, Ecuador, Bolivia, Argentina, Chile and 
Uruguay; in the first-mentioned countries creolised 
Trade Spanish may be used in interaction between 
speakers of Spanish and indigenous languages; in 
Ecuador, Media Lengua is a mixed language com- 
posed of Spanish vocabulary and Quechua grammar. 
In Peru, official status is granted to Spanish and 
Cuzco Quechua and in Paraguay to Spanish and 
Guarani. Paraguay has substantial minorities of 
German (also Low German) and Portuguese speakers. 
The national language of Brazil is Portuguese, with 
sizeable minorities of speakers of German (1.5m), 
Italian (0.5m) and Japanese (0.4m); a French-based 
creole is used around Macapa in northern Brazil: 
Amapa Creole. The official language of Guyana is 
English, with the majority speaking Guyanese Creole. 
That of Suriname is Dutch, the dominant creole 
being Sranan, with considerable numbers of speakers 
of Saramaccan (an English-based creole spoken 
inland), Aukaan (or Ndjuka, another English-based 
creole spoken along the Marowijne river), creolised 
Hindi (Sarnami) and creolised Javanese. The official 
language of French Guyana is French, the major 
native language being the creole Guyanais. The lan- 
guage of the Falkland Islands is English. 


Pie-charts 
The pie-charts given here, based on the latest available 


census figures, illustrate the linguistic composition of 
a number of representative cities in the United States. 
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Figure 1.2 Languages spoken in US cities: Chicago 
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Figure 1.3 Languages spoken in US cities: Honolulu 
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Figure 1.4 Languages spoken in US cities: Houston 
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Figure 1.5 Languages spoken in US cities: Los Angeles 
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Figure 1.6 Languages spoken in US cities: Miami 
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Figure 1.7 Languages spoken in US cities: New York 
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Figure 1.8 Languages spoken in US cities: San 
Francisco 


1.8 Literature 

A number of useful general books on North Ameri- 
can Indian languages have appeared in recent years. 
A survey of the entire field, with sketch grammars 
of several languages and a large map, is Goddard, 
ed. (1996a). A discussion of the genetic classification 
of all the Native languages of the Americas is pro- 
vided by Campbell (1997). A survey of the ethno- 
linguistic and sociolinguistic characteristics of North 
American languages is offered by Silver and Miller 
(1997). The volume of Mithun (1999) presents struc- 
tural features of the languages and a catalogue of 
families and languages, with comments on salient 
characteristics of many languages. These books con- 
tain detailed bibliographies; users of this atlas who 
want more information on a particular language can 
refer to these books. 


Note on the depiction of the 
contemporary languages of 
North America 


Maps 8 and 9 show a summary of the contemporary 
languages spoken in North America, with an 
emphasis on the locations of the remaining speakers 
of Native American languages. The editors have 
attempted to show where speakers of Native Ameri- 
can languages are concentrated, based on the 
locations of Native reservations and reports of other 
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concentrations of speakers. However, some reserva- 
tions have more than one native tribe officially pres- 
ent, and hence speakers of more than one Native 
American language (or none). Native speakers often 
now live in urban areas, away from reservations, and 
in concentrations too low to depict on small-scale 
maps. In some cases, the map shows the contempor- 
ary location of a Native language spoken by only a 
handful of people as it is deemed to be important to 
locate such speakers. In practice, speakers of many 
immigrant languages far outnumber speakers of 
some Native American languages, although they are 
not indicated on summary maps of this sort. 

As explained in Section 1.7, the dominant lan- 
guages of European origin spoken in the region are 
English, Spanish and French which have either official 
status or de facto status as national languages. The 
situation is linguistically complex, especially in major 
conurbations, and maps of this scale can show only 
an indicative situation. Maps 8 and 9 show the three 
European languages in their principal locations, their 
borders determined partly by the political boundaries 
of countries or provinces and partly by information 
compiled from published sources. However, in prac- 
tice boundaries between them are not rigid. For 
example, although French is the dominant language 
in Quebec, English is also important. Where signifi- 
cant numbers of speakers of minority languages exist 
in concentrations, the maps use a system of letters (E, 
S, F) to show indicative locations of these groups. 
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Latin America: general remarks 


Inasmuch as the work on Latin America for this atlas 
has primarily been done by one person, there are 
numerous similarities in the manner of organization 
between Meso-America (MA) and South America 
(SA). Ideally, the mode of presentation would be 
entirely uniform. There are, however, numerous ways 
in which our knowledge of MA and SA languages is 
not comparable: SA is vastly larger than MA in size 
and linguistic diversity; MA can be shown on a single 
map, while SA takes several plates; MA is rather 
well studied with many viable languages, while SA 
is rather poorly studied, with relatively few viable 
languages — still, SA has more viable languages than 
Anglo-America and Franco-America; the internal 
classification of MA language families can be speci- 
fied in great detail, while for SA such a possibility is 
rare; the orthographic devices for spelling language 
names differ somewhat between MA and SA. 

In this second edition, unlike the first, both the 
indigenous languages of the region and the European 
colonial languages are plotted. Obviously the colonial 
languages have become the first languages of the vast 
majority of those born in the western hemisphere. 
Because of the vast changes in distribution and 
numbers of indigenous languages in the western 
hemisphere over the past five hundred years, for 
every area depicted a contact period map (for ‘time 
of contact’) has been prepared, as well as a twenty- 
first-century (‘present-day’) map. On the twenty-first- 
century maps, national languages have been added. 

The maps are colour-coded so that each family 
or isolate has a separate colour. 


Terminology for linguistic units 


It will be helpful to define a few unfamiliar terms for 
the general reader. Linguistic diversity — how different 
two languages are — can be measured by the years 
since they began to diverge from a common speech. 
Research comparing 100 or 200 common word-forms 
in various well-documented languages, over a millen- 
nium or longer, seems to show that related languages 
diverge from each other at a predictable speed. 

Terms especially to be noted are the comple- 
mentary terms language complex/virtual language 
and language area/emergent language. Language 
complex refers to a geographically continuous zone 
that contains linguistic diversity greater than that 
found within a single language (between 1,000 and 
1,500 years), but where internal linguistic boundaries 
similar to those that separate clearly discrete lan- 
guages are lacking. Familiar instances are Ibero- 
Romance: Portuguese and Castilian; Gallo-Romance: 
Catalan, Provencal and French. The divisions of a 
language complex are virtual languages. Language 
area refers to a situation where there is a good 
amount of internal diversity (600-900 years), with 
clear-cut linguistic divisions or boundaries, but a 
good deal of mutual intelligibility within the whole 
area. The components of a language area are emer- 
gent languages. 

Language areas are not normally distinguished 
from languages in Europe because the linguistic 
diversity in places such as Britain, France and Ger- 
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many is in vernacular and official usage attributed 
simply to dialect variation. In Latin America, on the 
other hand, speakers of different dialects of what is 
obviously the same language (to a linguist) often con- 
sider themselves to be ethnically distinct; thus the lin- 
guist in all such instances has to look at the linguistic 
facts with considerably more care for terminology than 
is normally done in the literate parts of the world. 

The terms typically used to name groupings of 
communities speaking genetically related forms of 
speech are ranked in decreasing order of inclusive- 
ness: stock, family, branch, group, subgroup, language 
complex, language area, language, virtual language, 
emergent language, dialect area, dialect, variety. The 
terms used here vary somewhat from stock to stock, 
being based on the hierarchies adopted by the author- 
ities cited. Only in rare cases will dialects be specified 
in the classification or plotted on the maps. 


Levels of relationship, kinds of languages 
and genetic classification 


The categories group, language and dialect are quite 
problematic in the classification of languages. Often 
the distinctions are hard to make, not because the 
categories are imprecise, but because not enough dis- 
tinct categories are recognized and named. 

It is common to find a set of two or more lan- 
guages existing side by side that are closely related 
and where the linguistic boundaries between one lan- 
guage and the next are distinct and clear-cut. It is also 
often the case that an area of comparable linguistic 
diversity might lack clear-cut ‘language’ boundaries. 
The presence of this latter situation does not call the 
existence of the former category into question; the 
different kinds of language/dialect situations that we 
find are the results of different patterns of linguistic 
change and diversification. 


Language and dialect 


Two sets of communities speak somewhat differently, 
but their natural speech is mutually intelligible with- 
out prior familiarity. All these communities speak 
the same language in two different dialects. Some- 
times two dialects are quite well defined on a variety 
of structural traits, yet there is overall intelligibility 
between them. Research into the languages of the 
Americas and elsewhere indicates that a physical 
barrier between two sets of communities speaking the 
same dialect will eventually lead to loss of mutual 
intelligibility, but this seems to take at least 500 years. 


Language and genetic group 


Two or more different sets of communities speak 
similarly to each other, but though they can be shown 
to be genetically related and descend from a common 
ancestor, their speech is not mutually intelligible, 
though there may be a moderate degree of ability to 
recognize cognate words, and there is a clear bound- 
ary — sometimes an impassable one — between one set 
of communities and the other. The existence of an 
impassable boundary is not the only situation where 
there can exist a clear-cut linguistic boundary 
between closely related languages; migration and 


the extinction of intervening dialects are the main 
causes of this kind of situation. These communities 
speak separate languages belonging to the same 
genetic group. 

This is the paradigm case, and examples (with 
centuries of diversification in parentheses) abound: 
Examples with no geographical separation include 
Kiché (Spanish Quiché) vs. Kakchikel (Spanish Cakchi- 
quel) (10 centuries), Otomi vs. Masawa (Spanish 
Mazahua) (10 centuries) and Tzeltal vs. Tzotzil (14 
centuries). Examples with geographical separation 
include Nawa (Spanish Nahua) vs. Pipil (11 cen- 
turies), Mange (Spanish Mangue) vs. Chiapaneko 
(Spanish Chiapaneco) (13 centuries) and Kabil 
(Spanish Chicomucelteco) vs. Wasteko (Spanish 
Huasteco) (9 centuries). 


Language area and emergent language 


Two or more sets of communities each have a well- 
defined set of local linguistic traits; there are clear 
boundaries between one set of communities and 
another, yet there is a high degree of mutual intelligi- 
bility (to the untutored) across these well-defined 
boundaries. There is a percentage of cognates on 
Swadesh’s standard 100-word list ranging between 76 
and 84, which correlates with an internal time depth 
of between 600 and 900 years. These sets of com- 
munities constitute a language area; its constituent 
members are emergent languages (em. lang.); they 
themselves may have dialects or local varieties. (Note: 
‘language area’ is not to be confused with ‘linguistic 
area’ or Sprachbund.) Examples are, with number 
of emergent languages and centuries of diversification 
in parentheses: Kanjobal-Akateko-Jakalteko (three, 
7 centuries); Northern Sapoteko (three, 9 centuries); 
Kakchiquel-Tzutujil (two, 8 centuries?); Trike (two, 
9 centuries); Nawa (four, 8 centuries); Tlapaneko- 
Suitaba (three, 8 centuries); Matlatzinkan-Okwilteko 
(two, 8-9 centuries). 


Dialect chain 


There are several sets of communities. Each set shows 
a high degree of internal cohesion. Across these sets 
there is intelligibility with all adjacent sets, though 
less overall linguistic similarity throughout the whole 
chain than between adjacent dialects. Overall, from 
one end of the area to the other, there is a good deal 
or even a great deal of linguistic diversity, and 
between communities at the extremes of the area 
intelligibility is very low or lacking altogether; 
yet, even though there may be well-defined dialect 
clusters/regions, there are not clear-cut boundaries of 
intelligibility and structural distinctiveness that 
would enable us to recognize a ‘language’ boundary. 
This is a language whose topological structure is that 
of a dialect chain with serial intelligibility. There is a 
percentage of cognates on the Swadesh word-list, 
ranging from 64 per cent upwards, that correlates 
with a maximum time depth of 1,500 years. 


Language complex 


In general, the situation is as described for a dialect 
chain, but there are recognizable sets of dialects that 
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can be postulated as constituting the functional 
equivalents of distinct languages with a genetic group. 
The distinct constituents of a language complex are 
virtual languages. Examples are, with the number of 
virtual languages in parentheses: Sapoteko (five); 
Misteko (three, 15 centuries). 


Ranking of categories 


Genetic groups can be made up of languages, dialect 
chains and language complexes. Languages can be 
made up of dialects, which can occur in groups. Dia- 
lects can have varieties. Dialect chains are made up of 
dialects, which can occur in groups. Language com- 
plexes can be made up of virtual languages and lan- 
guage areas. Virtual languages are made up of dialects 
and dialect groups. Language areas are made up of 
emergent languages. Emergent languages can have 
dialects and dialect groups. 


Family, stock and phylum 


The unmarked genetic group is called a family. A 
set of families is called a stock. A set of stocks has 
often been called a phylum. Intermediate ranks (other 
than language complex, language area and dialect 
chain) within families can be called subfamilies, 
(sub)groups, branches and divisions. The typical 
minimum time depth for a family is 1,000 (up to 
5,000) years, for a stock, 4,000 (up to 8,000) years. 
Examples of well-known families, as compared with 
Germanic (25 centuries), Bantu (15 c.) and Semitic 
(30 c.), are: Mayan (42 c.), Algonquian, Siouan, 
Salishan, Pomoan, Yuman, Wakashan, Mije-Sokean 
(Spanish Mixe-Zoquean) (35 c.), Sapotekan (Spanish 


Zapotecan) (24 c.), Oto-Pamean (36 c.). Examples 
of well-known stocks, as compared with Indo- 
European (55-60 c.) and Tibeto-Burman and 
Austronesian (70 c.), are Oto-Mangean (60 c.) and 
Uto-Aztecan (70 c.). 


Clusters and long-range comparison 


Phyla (also known as superstocks) are notorious 
rather than well-known, because the existence of not 
a single one of them has been established through 
reconstruction, and for this reason the term is 
avoided in this section of the atlas. This is primarily 
due to the natural limitations on the applicability 
of the comparative method. The comparative 
method depends on valid etymologies. Etymologies 
are validated by the successful postulation of 
proto-languages (which are made up of proto- 
phonologies, proto-morphemes and proto-grammar 
fragments), and plausible phonological and gram- 
matical changes yielding a significant component of 
each of the languages that are claimed to be its 
descendants. 

For C. FE. Voegelin, phyla were by definition of 
such great internal depth that proto-languages model- 
ling their ancestors could not be reconstructed 
because not enough cognate material remained such 
that hypotheses could be tested and proposed ety- 
mologies validated. It remains to be seen whether the 
‘phyla’ proposed by Stephen Wurm in New Guinea 
conform to Voegelin’s conception or whether they are 
subject to reconstruction after all — and, if so, what 
time depth can be reached. Sino-Tibetan may be a 
large stock rather than a phylum. On the other hand, 
Nilo-Saharan in the sense of Greenberg, Bender, 


Ehret, Fleming and others is probably a phylum in 
Voegelin’s sense. Obviously, hypotheses of long-range 
genetic connection will not be demonstrated unless 
someone works at them. 

Some hypotheses of distant genetic relationship, 
if thoroughly studied and demonstrated through 
reconstruction, pass from what might have been 
thought to be a phylum to a stock. This may well 
happen in the case of Hokan and Penutian. That is, 
these entities are hypothetical and subject to verifica- 
tion through reconstruction, which may or may not 
be successful, but it is not necessarily claimed that 
their time depth, if they are valid genetic groups, is in 
fact more than 7,000 years. Thus, if we need a distinct 
name for a hypothesized non-obvious genetic group 
of indeterminate time depth, Hokan and Penutian 
should not be termed phyla or superstocks until their 
linguistic history has been successfully reconstructed. 
The prefix ‘macro-’ has been floating about for some 
time, with no very well-defined applications. For a 
while superstocks/phyla were named ‘macro-x’. For 
the most part, macro- has been used to label the pro- 
posed union of two previously recognized genetic 
entities, at least one of which was a stock. Most of the 
proposed macro-groupings have not yet been dem- 
onstrated. I propose using the element macro- to des- 
ignate a set of language families and/or isolates that 
seem likely to be genetically related, but that have not 
been demonstrated to be related. 

The groups designated by a macro-label I would 
call clusters. As the genetic relationships within a 
proposed cluster are validated or debunked, the valid 
genetic groupings should receive names that do not 
include macro-. Thus macro- would be used in an 
unambiguous way. 
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2.1 The Meso-American culture 
and linguistic area 


Meso-America is a well-defined cultural region of the 
western hemisphere, including the southern half (and 
a bit more) of Mexico, all of Belize, Guatemala and El 
Salvador, and parts of Honduras, Nicaragua and 
Costa Rica. 

Meso-America is defined by Kirchoff (1943). In 
general I follow Kirchoff, with the following con- 
straints: archaeologically, Meso-America is defined 
by architecture, not pottery. This limits its north- 
ward extension. Any area occupied by a language at 
home in Meso-America is classified as part of Meso- 
America on our map: that means that some parts of 
Honduras, Nicaragua and Costa Rica (colonized by 
Nawa, Sutiaba and Mange) are assigned to Meso- 
America. 

Meso-America owes its distinctive traits to the 
early domestication in or around the Tehuacan valley 
(Central Mexico) of maize, beans and other crops, as 
much as 7,000 years ago. By about 2000 Bc, cultural 
phenomena that are basic to the distinctive Meso- 
American culture began to appear in central and 
southern Mexico. The southernmost and western- 
most parts of Meso-America, as it could be defined as 
of 1500, were not Meso-Americanized until the late 
first millennium of our era. Since 1000 Bc, virtually 
all Meso-American indigenes have been peasant agri- 
culturalists. The cultural traits spread in the process 
of Meso-Americanization included linguistic traits 
(discussed in Campbell, Kaufman and Smith-Stark 
1986). 

The genetic diversity of Meso-American lan- 
guages far antedates the origins of Meso-American 
culture and civilization. Some of the languages 
spoken in the region have relatives in North America, 
and certain still unproven hypotheses link some of 
the languages to South America as well. 

Meso-America lies entirely within the tropics. 
Except for Belize, where it is English, the official lan- 
guage in all of Meso-America is Spanish. 


2.1.1 Linguistic diversity in Meso-America 


With respect to linguistic diversity, the Eurasian land 
mass is atypical of the earth’s surface, since over the 
past 5,000 years various empires have arisen that 
spread a single language at the expense of many 
others. In the New World, this happened only once: 
the Inca empire of Peru, which had come into exist- 
ence no earlier than ap 1200. Thus, the linguistic 
diversity in the New World in 1500 was comparable 
to that of Africa and Oceania of the same period; that 
is, extremely diverse and at the same time normal. 


2.1.2 Documentation 


Most Meso-American languages were documented in 
some fashion by Spanish Roman Catholic mission- 
aries in the colonial period (1519-1814), some of 
whom, especially in the sixteenth and early seven- 
teenth centuries, were quite learned (for their time) 
and talented; all, however, failed to provide fully 
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specified phonological information. Quite a few lan- 
guages became extinct in the colonial period without 
any documentation; still, most survived, though some 
only precariously. Scientific linguistic research in 
Meso-America dates from the early 1930s. Since then, 
most surviving languages have been the object of 
study by at least two technically trained linguists. 
Much documentation has been published; much 
more remains unpublished. Especially productive (as 
a corporate group) in the documentation of Meso- 
American languages has been the Summer Institute 
of Linguistics (SIL). Somewhat behind SIL have been 
non-missionary academic linguists trained in the 
United States. 


2.1.3 Ghost languages 


Pupuluca de Conquaco refers to manuscript data from 
the town of Oluta, Veracruz, Mexico (Oluta Mije), 
data mistakenly believed by Otto Stoll to have come 
from a town called Conguaco in south-east Guate- 
mala (see Kaufman 1974a: 43). 

Aquacateco II is the name given to some seventy 
words collected in Guatemala in the late nineteenth 
century by Otto Stoll (1958) from his non-indigenous 
maid, who hailed from Aguacatan, and said she had 
learned them from some Indians there. They are not 
in Awakateko, the only language ever documented 
from Aguacatan. Some have thought that some of 
these words resemble words found in Mije-Sokean 
languages. This material is probably faked; other 
individuals in Meso-America have invented linguistic 
material when an outsider who mistakenly believed 
them to be speakers of an indigenous language has 
tried to collect data from them. This has been 
observed by seekers after Kabil (Mayan) and Kare 
(Lenkan). 

Subinbd, the name given to some material col- 
lected in Socoltenango, Chiapas, Mexico, in 1788-9, 
is not a single language but a mixture of material from 
Tzeltal and Tojolabal (see Kaufman 1974a: 44-5). 


2.1.4 The sociolinguistic situation 


Except in Belize, the official language throughout 
Meso-America is Spanish. During the whole repub- 
lican period (since independence from Spain) there 
has been heavy external pressure from Spanish 
speakers on speakers of indigenous languages, aimed 
at inducing them to abandon their native languages 
and cultures. Current economic situations contribute 
to the tendency to linguistic attrition in indigenous 
communities, many of which are bilingual. Those 
groups who are not likely to give up their languages 
are those that have always been very large or have 
always lived in economically undeveloped parts of the 
region, since economic development means that any 
profitable local resources will be appropriated by 
outsiders. 

Such linguistic entities as Nawa (Nahuatl, about 
1,316,000 speakers), Sapoteko (Zapoteco, about 
445,000), Misteko (Mixteco, about 292,000), Otomi 
(about 262,000), Chinanteko (Chinanteco) and 
Masateko (Mazateco) are not single languages but 
groups of a few languages each with between 1,000 
and 1,500 years of time depth. It is not feasible, given 


current usage, to convert these names to family 
names ending in -an, because such terms have already 
been applied to other usages. Nawa appears to be the 
most numerous linguistic entity in Meso-America, 
but if the comparable units within Mayan are com- 
pared, Nawa (with about 1,100 years of time depth) 
is comparable to Kichéan proper (Kiché-Sakapulteko- 
Tzutujil-Kaqchikel, or Quiché-Zacapulteco-Tzutujil- 
Cakchiquel, about 1,000 years’ time depth), which has 
1,309,500 speakers; the difference in numbers is neg- 
ligible when the inaccuracy of population figures is 
taken into account. Kiché, with 800,000 speakers, and 
Yukateko (Yucateco), with 665,000 speakers, both 
outnumber Northern Nawa, with 604,000 speakers. 


2.1.5 Changes in language distribution 
since 1500 


Most suitable land in Meso-America before 1500 was 
under cultivation, and even marginal land was in 
many cases irrigated and terraced to make it cultiv- 
able. In the tropical lowlands (tierra caliente), three 
crops of corn could be harvested per year with irriga- 
tion, but the area was unhealthy and thinly populated 
even before European diseases were introduced, and 
afterwards the lowlands were effectively unpopulated 
for more than a century. Between the contact period 
and the late twentieth century, the geographical 
spread of individual languages may have changed in 
several different ways: 


1 The territory shrank due to depopulation or 
shift to a European language. 

2 The territory shifted due to a movement, usually 
forced, of the population. 

3 The territory of one language expanded at the 
expense of the territory of another language. 


The first is the typical case. Between 1520 and 1620, 
the indigenous population is widely believed to have 
fallen from about 22 millions to about 1 million, 
about 4.5 per cent of what it was at the beginning of 
the period — this owing to the effects of disease, war- 
fare and forced labour (Gerhard 1972: 22-3). 

Items 1 and 2 are combined in the case of 
Wasteko (Huasteco). The Wastekos, a million strong 
in 1500, after being conquered by the Spanish, 
revolted three times against them during the six- 
teenth century (in 1520, 1523 and 1525-6) and were 
almost wiped out by being slaughtered or sold into 
slavery. All their towns were destroyed and the sur- 
vivors were settled in newly founded communities 
(which were nevertheless within their ancestral terri- 
tory). By 1680 there were about 10,000 Wastekos in 
about thirty settlements (Gerhard 1972: 215-16, 357). 
In 1980 there were about 100,000 Wastekos. 

Item 3 is fairly rare. There are two notable 
instances. Otomi speakers often moved onto mar- 
ginal land abandoned after the conquest by other 
indigenes (whose numbers shrank so drastically 
during the sixteenth century) and ignored by the 
Spanish. Early ethnohistorical sources tell us that 
“Otomi’ was spoken in the state of Jalisco in the six- 
teenth century. Since no linguistic data from this area 
have been forthcoming, it is not clear if this was 
Otomi as we know it. Nawa speakers were used as 
auxiliary troops by the Spanish and were settled in 
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many (to them) new areas at the time the Spanish 
conquered them. Nawas also expanded (in the seven- 
teenth century) into some formerly Wasteko territory 
freed up by the drastic reduction in Wasteko popula- 
tion after the disastrous aftermath of their revolt. 
Other cases of these types of changes may be observed 
by comparing the contact-period map with the 
twenty-first-century map. 


2.1.6 Pre-1500 population shifts 


In many cases, closely related languages, or even 
dialects of the same language, are found in widely 
separated parts of Meso-America. 

The Nawa presence in Meso-America is relatively 
recent, dating from about ap 500 and later. Before 
500, Nawa occupied an area — outside Meso-America 
as here defined — somewhere between Kora-Wichol- 
Kaskan-Tekwexe (Cora-Huichol-Cascan-Tecueje) on 
the west and Wasteko on the east. Around ap 500 
Nawa arrived in the Valley of Mexico, and soon after 
this an outlier, Pochuteko, established itself on the 
Pacific coast of Oaxaca. Before 800, Northern Nawa, 
coming in from central Mexico at a time when no 
traceable local dialects had yet developed, was planted 
in the area shown on the contact-period map. This is 
the origin of the form of Nawa used by the Toltecs. By 
800, Eastern Nawa and Pipil began to be colonized 
from the Northern Nawa area. By 1000, the Western 
Nawa area started being colonized from the Central 
Nawa area. By 1400 the distribution of Nawa shown 
on the map had been accomplished. 

In every region that Nawa expanded into, it came 
into contact with other languages that had already 
been there for some time, and in every case Nawa 
underwent at least some influence from the locally 
established languages. While still outside Meso- 
America, pre-Nawa borrowed a number of lexical 
items from Wasteko, and underwent some phono- 
logical influence from the (not closely) related Koran 
languages. It also borrowed some lexical items of 
Mije-Sokean origin, possibly a result of the presence 
of Olmec outposts in central and western Mexico in 
the period 1000-400 sc. 

On arriving in the Valley of Mexico, Nawa 
encountered, among other languages, Totonako, from 
which it borrowed the ‘diminutive’ suffix -tzi:n and 
the phoneme /tl/. Whether rapidly or gradually, the 
Totonakos were eventually expelled from the central 
Mexican highlands. All forms of Nawa show Wasteko 
and ‘Totonako traits. Other languages that were 
overlaid by Nawa included Matlatzinkan, Chocho, 
Tlapaneko, Kwitlateko, Soteapaneko, Popoluka, 
Xinkan, Lenkan and Koran. None of these other lan- 
guages has influenced Nawa generally — they were 
contacted by Nawa one at a time in the process of 
colonization stemming from either the Northern or 
Central Nawa regions. 

In the last 1,500 years, several southward popula- 
tion displacements have taken place, as a result of (a) 
the Nawa incursions into central Mexico, with atten- 
dant disruptions, about 500; (b) Toltec origins and 
expansionism, about 800-900; (c) Central Nawa expan- 
sionism out of central Mexico, about 1000-1400; (d) 
Totonako expansion into the central Gulf coast, about 
1100-1200; (e) the Toltec decline, about 1150-1200. 
Other ramifications along the time line were these 
(with approximate dates ap and likely cause a—e): 


500: Chinanteko moved from between 
Matlatzinkan and Tlapaneko to Northern 
Oaxaca (a). 

500-700: Chiapaneko-Chorotega moved from 
between Otomi and Chocho to western 
Chiapas (a). 

700: Chorotega moved to Honduras and 
Nicaragua (a). 

800-900: Northern Nawa colonized the central 
and southern Gulf coast as Eastern 
Nawa (b). 

900: Pipil left the southern coast and 
established colonies in Chiapas, 
Guatemala and El Salvador (b). 


1100: Kabil moved from south of Wasteko to the 
Grijalva Valley of Chiapas (d). 

1200: Sutiaba separated from Tlapaneko and 
moved to Nicaragua (c, e). 


2.1.7 The most important languages 


In terms of numbers of speakers from the sixteenth 
century to the present day, the languages shown in 
Table 2.1, which had at least 100,000 speakers in 1980, 
are and have been the most important. 

The single most numerous emergent languages 
within Sapoteko, Misteko, Otomi and Masateko — all 
Oto-Mangean language areas — may well have under 
100,000 speakers, but this possibility has not been 
investigated, and it would be very difficult (if not 
impossible) to establish, given available sources. 

The colonial period in Meso-America can be 
broken down into three sub-periods: 


early colonial 1520-1620 
mid-colonial 1620-1720 
late colonial 1720-1820 


All the languages in Table 2.1 were documented in 
either printed books or manuscript before 1620 (the 
early colonial period), with six exceptions. First 
documented in the mid-colonial period were Masawa 
(1637) and Mam (1644); first documented in the late 
colonial period were Qeqchi (1725) and Totonako 
(1752); and Northern Nawa and Masateko may not 
have been documented in the colonial period at all. 

Languages important in the early and mid- 
colonial periods that do not appear in Table 2.1 are: 
Tarasko (1558/9); Cholti (1625), Chiapaneko (1633), 
Matlatzinkan (1638) and Soke (1672). 


2.1.8 Literacy in Meso-American 
languages 


At the time of the conquest, only Yukateko was being 
written — by members of a priestly/noble/scribe class. 
The writing system is called Lowland Mayan writing, 
and is attested since the third century ap. Its first 
known use was for writing common Cholan (or 
Greater Tzeltalan) and common Yukatekan. The only 
other known fully elaborated writing system, the 
Epi-Olmec script, was in use by the second century 
and persisted down to the ninth century at least; it 
has been partly deciphered, and shown to be used for 
writing a Sokean language (Justeson and Kaufman 
1993). Mayan and Epi-Olmec writing were analogous 
to those systems used for writing ancient Egyptian 
and Luvian: there were syllabic signs representing 
phonetics, logograms representing roots and lexemes, 
and semantic signs (“determinatives’) that helped dis- 
ambiguate phonetic representations that were not 
always fully written out. 

The use of Mayan writing among the Yukatekos 
was suppressed during the sixteenth century by Span- 
ish Roman Catholic missionaries headed by Diego 


de Landa. Directly after the conquest, the Latin alpha- 
bet was adapted for the writing of several Meso- 
American languages; generally these writing systems 
were used by missionaries and not to any serious 
extent by native speakers, but in a few cases native 
speakers applied the missionary-devised Latin alpha- 
bets to the preparation of legal documents, ethnohis- 
torical works or literary compositions. Notable in this 
regard are Nawa, Yukateko, Kiché and Kaqchikel. A 
very great amount of written material was produced 
in Nawa during the colonial period. No other Amer- 
indian language was so widely used in written form 
during the colonial period; none even comes near 
Nawa. 

In the past century many Meso-American lan- 
guages have been the object of the attention of 
Protestant missionaries (Wyclif Bible Translators/ 
Summer Institute of Linguistics), who have translated 
parts of the Bible into Meso-American languages, 
devised orthographies and prepared primers for those 
wanting at least to read materials in their native lan- 
guages. These efforts do not seem to have created a 
significant group of writers in any native language. 
Other efforts have recently (since 1970) been mounted 
to train indigenous Meso-Americans in linguistics 
and to help develop groups of native writers: these 
efforts have mainly been directed to speakers of 
Mayan languages of Guatemala (Proyecto Lingiiistico 
Francisco Marroquin, Academia de las Lenguas 
Mayas), and Chiapas (Tzotzil and Tzeltal Writers’ 
Workshop/Sna Jtz'ib'ajel), but other languages have 
been addressed as well. 

Except for those languages currently spoken by at 
least 100,000 speakers, the prognosis for the viability 
of most Meso-American languages is not good; there 
are strong pressures, mostly economic, for learning — 
and then shifting to — Spanish. Even those languages 
with more than 100,000 speakers will be threatened 
unless the languages expand the domains of their use, 
for example in writing, music, drama, radio, televi- 
sion, videotape, political organizing and especially in 
education. 


2.2 The maps 


For Meso-America there are two full-page maps: one 
for the contact period (by T. Kaufman) and one for 
the twenty-first century (by Ernesto Diaz-Couder). 

The contact-period map, by Kaufman, is based 
on the following sources: Peter Gerhard 1972, 1979, 
1982 (for Mexico), Anne Chapman 1978 (for Hon- 
duras), Walther Lehmann 1920 (for the whole area), 
Robert E. Longacre 1967 (whole area). It was drafted 
by Stephanie Koerner before being turned over to the 
publisher for redrafting. 

The term ‘contact period’ refers to whenever an 
identifiable particular language is first noted as being 
used in a particular place. In Latin America, most 


Table 2.1 Major indigenous languages in Meso-America, 1980 


Language or language area Family Approximate number of speakers 
Kiché language area Mayan 1,310,000 
Kiché Mayan 800,000 
Kaqchikel Mayan 450,000 
Yukateko Mayan 665,000 
Nawa language area Yuta-Nawan 1,316,000 
Northern Nawa Yuta-Nawan 604,000 
Central Nawa Yuta-Nawan 541,000 
Sapoteko language area Sapotekan (Oto-Mangean) 448,000 
Mam Mayan 300,000 
Misteko language area Mistekan (Oto-Mangean) 291,000 
Qeqchi Mayan 277,000 
Otom/ language area Oto-Pamean (Oto-Mangean) 261,000 
Totonako Totonakan 215,000 
Masawa Oto-Pamean (Oto-Mangean) 212,000 
Tzeltal Mayan 212,000 
Tzotzil Mayan 136,000 
Masateko language area Popolokan (Oto-Mangean) 133,000 
Wasteko Mayan 104,000 
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languages are known from the sixteenth and seven- 
teenth centuries; in the interior of South America, 
some languages became known only in the twentieth 
century. 


2.3 Spelling of Meso-American 
language names 


All language names are spelled according to a uni- 
form system, based on Spanish letter values, but not 
limited to them. This system is essentially the same as 
that used and recommended by the Academia de las 
Lenguas Mayas of Guatemala (see South America 
3.10). In it, every sound is spelled in one way only; 
such letters and digraphs as c, z, h, hu, gu, qu are not 
used, and g has only one pronunciation, namely [g]. 
Note that stress falls on any accented vowel; other- 
wise, stress falls on the vowel preceding the rightmost 
consonant unless that consonant is n or s. Only the 
five vowel qualities of Spanish are noted. Ch is [tf], 
tz is [e], x is [J], j is [h] or [x]. All other symbols are 
read the same in both Spanish and English. The 
names of languages spoken in the United States and 
Canada use h rather than j and sh rather than x. 


2.4 The classification scheme 


Although dead languages will be included, only docu- 
mented languages will be listed and classified. 

The classification given here is by Kaufman (after 
consultation with Doris Bartholomew on the classifi- 
cation of Otomi, and with Seren Wichmann on the 
classification of Mije). 


2.4.1 Scholarship 


Historical-comparative work is the most advanced 
for the Mayan, Yuta-Nawan, Mije-Soekan and Oto- 
Mangean groups; that for Totonakan and Hokan lags 
behind. Most of the remaining genetic units are isol- 
ates (Wave, Kwitlateko, Tarasko) or so poorly docu- 
mented that comparative work is not very rewarding 
(Xinkan, Lenkan). Xinkan and Lenkan are also 
extinct, or nearly so. 

There are eleven genetic groups/entities recog- 
nized for Meso-America: eight groups, families or 
stocks, and three isolates. Nevertheless, every one of 
these genetic entities has been the subject of one or 
more proposals that it is genetically related to some 
other genetic unit, either within or outside Meso- 
America. The following links have been proposed: 


1 Radin, Freeland, Sapir, McQuown, Swadesh, 
Brown and Witkowski, and Greenberg (1987) 
have all contributed to the hypotheses that 
Mayan, Mije-Sokean, Totonakan and Wave are 
all mutually related, either in a group to be 
called ‘Macro-Mayan’ or ‘Mexican Penutian’, or 
individually to the Penutian languages of 
Oregon and California. 

2 A proposal by Whorf and Trager linking Yuta- 
Nawan with Tanoan has been around since 
1937. Greenberg (1987) proposes a special gen- 
etic connection between Oto-Mangean, Yuta- 
Nawan and Tanoan languages; Kaufman 
(1990b) disagrees and proposes a special genetic 
connection between Oto-Mangean and Hokan. 

3 Greenberg (1987) includes Kwitlateko, Tarasko, 
Xinkan and Lenkan in his ‘Chibchan-Paezan’ 
grouping; Swadesh (1959: 19) proposed linking 
Kwitlateko with Yuta-Nawan; Holt proposed 
linking Yuta-Nawan, Chibchan and Panoan. 


None of these proposals has met with general accept- 
ance; none has been thoroughly tested. Those cited 
under 2 and 3 are not all mutually compatible. 

In this classification the name assigned to each 
language is the local vernacular Spanish name unless 
that name is insulting or ambiguous; in such cases the 


autonym (self-name, without the replication of non- 
European phonological traits) is preferred. Belize has 
no languages not also found in Spanish-dominant 
areas. 

For every language name the normal Spanish 
orthography will be specified, as well as any synonyms 
widely found in the literature. The English spellings 
most commonly found in the literature can usually be 
obtained by removing any final unaccented -o, if 
present, of the Spanish orthography for the name. 
Otherwise the English spelling will be the same as the 
Spanish, unless otherwise noted. 

For the whole of Meso-America, population 
figures are cited from Ethnologue, 15th edition 
(Gordon, ed., 2005). Sometimes there are great dis- 
crepancies between the figures cited here and official 
census figures. 

The computational technique of glottochron- 
ology allows the assignment of approximate dates to 
the time of separation of pairs of languages of known 
common origin. The glottochronological time depths 
given (in centuries) in the listings below are by 
Kaufman for Mayan and Mije-Sokean, by Swadesh 
and Hale for Yuta-Nawan, by Manrique, Bartholo- 
mew and Cazés for Oto-Pamean (harmonized by 
Kaufman), and by Swadesh for the rest. 

Numbers given beside language names corres- 
pond to the number on the map keys. 


2.5 The languages listed and 
classified 


2.5.1 The Mayan family 
See Table 2.5 on page 52. 


2.5.2 The Mije-Sokean family 
Table 2.2 Mije-Sokean family (Mexico) — 35 centuries 


2.5.4 Wave (isolate) 
Table 2.4 Wave isolate (Mexico) 


Language No. on map Population Location 


Wave (Huave) 31 16,000 Oaxaca 


2.5.5 Hokan 


Only two groups within the Hokan stock are located 
in Meso-America. There may be two main branches, 
which we could call Northern Hokan and Southern 
Hokan. The Southern Hokan languages will be classi- 
fied here, but only Chontal and Tol are mapped. 

The putative Northern branch consists of the 
Pomo family (see pp. 8, 20) in California, which 
includes the extinct Chimariko language (see pp. 8, 
12), the extinct Yana group (see pp. 8, 25), the Karik 
language (see Karuk, pp. 8, 16), the extinct Shasta 
Konomihu group (see pp. 8, 21), the Washu lan- 
guage of Nevada (see Washo, pp. 8, 24), and the 
extinct Esselen (see Esselen, pp. 8, 14) and Salina (see 
pp. 8, 21) languages. Southern Hokan embraces the 
Yuma-Kochimi family, the Seri language of Sonora, 
the extinct Pajalate and Yué (Kotoname) languages, 
and the extinct Yeméan (Komerkrudo) group, once 
spoken mainly in Texas but also possibly in Coahuila 
state. 

Greenberg and Swadesh (1953: 221), using their 
200-word lexicostatic list, estimated the time depth of 
Hokan as 55 + 7 centuries. Swadesh (1959), based on 
an application of the 100-word glottochronological 
list, gives parts of Hokan at least 55 centuries, so that 
all of Hokan must have had at least 2,000 years more 
of time depth. Jacobsen (in Campbell and Mithun 
1979: 545) comments: ‘Hokan seems to be rather 
comparable to Indo-European in its time depth and 
diversity of branches, and may well be even older.’ 
Around 8,000 years may be a fairly good guess. 


Grouping Language (Spanish name) No. on map Population Location 
Sokean branch (14 centuries) 
Soke (Zoque) language area 22 36,500 
Oaxaca Soke em. lang. 22a Oaxaca 
Chiapas Soke em. lang. 22b Chiapas, Tabasco 
Soteapaneko (Sierra Popoluca) 23 30,000 Veracruz 
Teksistepekenyo (Popoluca de Texistepec) 24 427 Veracruz 
Mijean branch (age unknown) 
Sayulteko (Popoluca de Sayula) 25 4,000 Veracruz 
Oluteko (Popoluca de Oluta) 26 102 Veracruz 
(Oaxaca) Mije (Mixe) lang. area 27 85,700 Oaxaca 
Northern Highland Mije em. lang. 27a 
Southern Highland Mije em. lang. 27b 
Midland Mije em. lang. 27¢ 
Lowland Mije em. lang. 27d 
Tapachulteko (Tapachulteco) (extinct) 28 0 Chiapas 


Note: 


Teksistepekenyo is now spoken only in the town of Texistepec, Veracruz. In the sixteenth century a large area was occupied by this and related forms 
of speech, which are not however documented. This whole area is dubbed ‘Isthmus Sokean’ for the purposes of this map. For present-day speakers, 
the term ‘Popoluca’ in Spanish is considered offensive, and is not used on the map. 


2.5.3. The Totonakan family 


Table 2.3 Totonakan family (Mexico) — 26 centuries 


Grouping Language (Spanish name) No. on map Population Location 

Totonako (Totonaco) 29 230,000 Puebla, Veracruz 
Highland Totonako dialect Puebla, Veracruz 
Lowland/Papantla Totonako dialect Veracruz 
Juarez Totonako dialect Puebla 
Misantla Totonako dialect Puebla, Veracruz 

Tepewa (Tepehua) 30 10,000 Veracruz, Hidalgo, Puebla 
Tlachichilco Tepewa dialect Veracruz 
Huehuetla Mecapalapa Tepewa dialect Hidalgo 
Pisaflores Tepewa dialect Puebla 
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Table 2.5 Mayan family (Mexico, Guatemala, Belize) — 42 centuries 


Grouping Language (Spanish name) Autonym Number on map Population Location 
Wastekan branch (9 centuries) 
Wasteko (Huasteco) Te:nek 1 120,000 San Luis Potisf, Veracruz 
Kabil (Chicomucelteco) (extinct) 2 0 Chiapas 
Yukatekan/Maya branch (10 centuries) 
Yukateko (Yucateco) Ma:zya 3a 700,000 Yucatan, Campeche, Quintana Ro6, Belize 
Lakanttin (Lacandén) Ma:zya 3b 700 Chiapas, Guatemala 
Itzaj (Itza) Maya 3c 12 Guatemala 
Mopan Maya 4 9,600 Guatemala, Belize 
Western Mayan branch (30 centuries) 
Greater Tzeltalan group (19 centuries) 
Cholan subgroup (14 centuries): 
Cholti (extinct) 5a 0) Guatemala 
Chortf (Apay) Ch'orti' 5b 31,500 Guatemala 
Chol Ch'ol 6 125,000 Chiapas 
Yokotan (Chontal) Yokot'an 7 55,000 Tabasco 
Tzeltalan proper subgroup (14 centuries) 
Tzotzil B'atz'i k'op 8 265,000 Chiapas 
Tzeltal B'atz'il k'op 9 300,000 Chiapas 
Greater Qanjobalan group (21 centuries) 
Chujean subgroup (16 centuries) 
Tojolabal 10 36,000 Chiapas 
Chuj 11 41,000 Guatemala 
Qanjobalan proper subgroup (15 centuries) 
Qanjobal language area (7 cent.) 12 
Qanjobal emergent language 12a 77,700 Guatemala 
Akateko em. lang. (Acateco) 12b 48,500 Guatemala 
Popti em. lang. (Jacalteco) 12c 20,000? Guatemala 
Kotoke (Cotoque) language area (6 centuries) 13 
Moché em. lang. (Motozintleco) 13a 168 Chiapas 
Mucht em. lang. (Tuzanteco) 13b 100? Chiapas 
Eastern Mayan branch (34 centuries) 
Greater Mamean group (26 centuries) 
Mamean proper subgroup (15 centuries) 
Teko (Tectiteco) 14 2,200 Chiapas, Guatemala 
Mam Qyo:l 15 490,000 Guatemala 
Ixilan subgroup (?age unknown) 
Awakateko (Aguacateco) 16 18,000 Guatemala 
Ixil 17 69,000 Guatemala 
Greater Kichéan group (26 centuries) 
Uspanteko (Uspanteco) 18 3,000 Guatemala 
Kichéan proper subgroup (10 centuries): 
Kiché (Quiché) 19a 647,000 Guatemala 
Sakapulteko (Sacapulteco) 19b 36,000? Guatemala 
Sipakapenyo (Sipacaperse) 19c 6,000 Guatemala 
Kaqchikel-Tzutujil language area (8 centuries) 
Tzutujil em. lang. 19d 83,800 Guatemala 
Kaqchikel (Cakchiquel) em. lang. 19e 439,400 Guatemala 
Poqom subgroup (10 centuries) 20 
Pogomam 20a 41,000 Guatemala 
Pogomchi (Pocomchi) 20b 85,000 Guatemala 
Qeqchf (Kekchi) 21 400,000? Guatemala 
Table 2.6 Hokan stock (Mexico, Honduras) major branches, Northern and Southern. Some of the 
' fe Feu plas Southern languages are spoken in Meso-America, and 
RE ener ee rears ween these are shown on the map. The remaining Southern 
languages will be classified in this section as well, but 
Chontal group (Tequistlateco) 32 4,500 Oaxaca 5uas ; ‘ : 
Northern languages will not be listed. The Southern 
Lowland Chontal dialect area (Huamelulteco) : 
Highland Chontal dialect area (Tequistlateco) — fall into two groups: Sonoran and Nawan. 
ee Table 2.8 on page 54. 
Tol family (Jicaque) (Honduras) P 8 . 
: Just what kind of Nawa was spoken by the Nica- 
Western Tol 33a extinct . : 
Eastern Tol 33b 350 Meiauias raos, a Nawa-speaking group of Nicaragua, cannot be 


Note: 


The Tol people and their languages have usually been called ‘Jicaque’, but this is a term of abuse: in Kiché it means ‘wild Indian, savage, cannibal’, 


which obviously comes from a vernacular and derogatory Spanish usage. 


2.5.6 Oto-Mangean 


Oto-Mangean is made up of eight families and indi- 
vidual languages whose internal make-up has been 
known since roughly 1900. Connections among them 
had been observed since late in the previous century, 
but the existence of Oto-Mangean was outlined 
basically during the 1920s. For one of the families, 
Mistekan, there has been some question about 
whether to include or exclude Amusgo and Trike. 
Between the constituent members are two levels of 


structure. There seems to be a major division between 
‘east’ and ‘west’, and then each of these divisions has 
subdivisions above the family level. As for labels for 
levels of relatedness, Oto-Mangean is called a ‘stock’, of 
the order of Indo-European. Intermediate groupings 
are still being worked out. See Table 2.7 on facing page. 


2.5.7 Yuta-Nawan 


This stock, traditionally called “Uto-Aztecan’ (48 cen- 
turies), belongs to the USA and Mexico; it has two 


determined, owing to lack of data. Garbled versions 
of it are embodied in a vernacular Nicaraguan folk 
drama in Spanish. 


2.5.8 Other known languages 
See Table 2.9 on page 54. 


2.5.9 Linguistically undocumented ethnic 
groups 


The ethnic groups of Meso-America discussed below 
lost their languages (and eventually their ethnic iden- 
tities) before their languages could be recorded, and 
consequently they cannot be classified linguistically. 
In north-western (or far western) Meso-America, 
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Table 2.7 Oto-Mangean stock (Mexico, Nicaragua, Costa Rica) 


Grouping Language No. on map Population Location 
Western Oto-Mangean division 
Oto-Pamean Chinanteko branch (40 centuries) 
Oto-Pamean-family (36 centuries) 
Pamean/Northern Oto-Pamean subfamily (25 centuries) 34 
Chichimeko (Chichimeco) 34a 1,200 Guanajuato, Mex. 
Pame complex (14 centuries) 5,600 San Luis Potisi, Mex. 
Northern Pame virtual lang. 34b 1,000 
Central Pame virtual lang. 34c 3,000 
Southern Pame virtual lang. 34d extinct 
Southern Oto-Pamean subfamily (24 centuries) 
Matlatzinkan-Okwilteko lang. area (8-9 centuries) 35 Mexico, Mex. 
Matlatzinkan em. lang. (Matlatzinca) 35a 100 
Okwilteko em. lang. (Ocuilteco) 35b 50 
Otomian group (10 centuries) 
Otomi lang. area (8 centuries) 36 216,000 
Northeast Otomi em. lang. 36a Veracruz 
Northwest Otomi em. lang. 36b Hidalgo 
Western Otomi em. lang. 36c Queretaro/Michoacan 
Tilapa Otomi em lang. 36d Mexico 
Ixtenco Otomi em. lang. 36e Tlaxcala 
Jalisco Otomf (extinct, undocumented) 36f Jalisco 
Masawa (Mazahua) 37 365,000 Mexico/Michoacan 
Chinanteko family (Chinanteco) (15 centuries) 38 86,000 Oaxaca 
Ojitlan Chinanteko 38a 
Usila Chinanteko 38b 
Quiotepec Chinanteko 38c 
Palantla Chinanteko 38d 
Lalana Chinanteko 38e 
Chiltepec Chinanteko 38f 
Tlapaneco-Mangean branch (40 centuries) 
Tlapaneco-Sutiaba lang. area (8 centuries) 39 66,000 
Malinaltepec Tlapaneko em. lang. 39a Guerrero 
Azoyt Tlapaneko em. lang. 39b Guerrero 
Sutiaba em. lang. (extinct) 40 Nicaragua 
Mangean family (13 centuries) 
Chiapaneko (Chiapaneco) 4] extinct Chiapas 
Mange 42 extinct Honduras, Nicaragua 
Eastern Oto-Mangean division (47 centuries) 
Popolokan Sapotekan branch (35 centuries) 
Popolokan (Popoloca) family (24 centuries) 
Masateko (Mazateco) complex (10 centuries) 43 174,500 
Huautla-Mazatlan Masateko lang. 43a 72,000 Oaxaca 
Ayautla Soyaltepec Masateko lang. 43b 27,500 Oaxaca, Puebla 
Jalapa Masateko lang. 43c 15,500 Oaxaca, Veracruz 
Chiquihuitlan Masateko lang. 43d 2,500 Oaxaca 
Chochoan subfamily (12 centuries) 43c 
Iskateko (Ixcateco) 44 119 Oaxaca 
Chocho-Popoloka lang. area (8 centuries) 45 
Chocho em. lang. 45a 770 Oaxaca 
Northern Popoloka em. lang. 45b 5,000 Puebla 
Western Popoloka em. lang. 45c 6,000 Puebla 
Eastern Popoloka em. lang. 45d 5,000 Puebla 
Sapotekan family (24 centuries) 
Sapoteko (Zapoteco) complex (14 centuries) 46 498,400 of all varieties 
Northern Zapoteko lang. area 46a Oaxaca 
Eastern Sapoteko lang. area 46b Oaxaca 
Central Sapoteko lang. area 46c Oaxaca 
Southern Sapoteko lang. area 46d Oaxaca 
Chatino lang. area 47 36,000 of all varieties 
Yaitepec Chatino em. lang. 47a Oaxaca 
Tatalpepec Chatino em. lang. 47b Oaxaca 
Zenzontepec Chatino em. lang. A7c Oaxaca 
Amusgo-Mistekan branch 
Amusgo (Amuzgo) 48 26,200 Oaxaca, Guerrero 
Mistekan family (37 centuries) 
Misteko-Kwikateko (Mixteco-Cuicateco) subfamily 
Misteko (Mixteco) complex (15 centuries) 49 376,000 of all varieties 
Northern Misteko lang. area 49a Oaxaca, Puebla 
Central Misteko lang. area 49b Oaxaca 
Southern Misteko lang. area 49c Oaxaca, Guerrero 
Kwikateko (Cuicateco) 50 18,500 Oaxaca 
Trike (Trique) lang. area (10 centuries) 51 23,000 of all varieties 
Chicahuaxtla Trike em. lang. 6,000 Oaxaca 
Copala Trike em. lang. 15,000 Oaxaca 


in the Mexican states of Nayarit, Durango, Jalisco and 
Colima, only a few of the languages have survived. 
The rest have perished without documentation. 
Those that perished include some otherwise known, 
such as Nawa and Otomi. 

The group known as ‘Sayultecos’ is likely to have 
spoken Nawa, and is thus assigned on the contact- 
period map. The groups known as ‘Xilotlantzincas’, 
“Tamazultecos’ and ‘Cuyotecos’ are believed by Ger- 
hard to have possibly spoken Nawa, but they are left 
unclassified by Longacre. They are here assigned to 
Nawa, but with cross-hatching. 

The groups in this area known as Pinome 
(Pinotl), Coca (Koka), Tecuexe (Tekwexe), Cazcan 
(Kaskan) and Tecoxquin (Tekoxkin) are all widely 
believed to have spoken Yuta-Nawan languages (but 
not Nawa), and I find this plausible. I suggest that 
they may all have been part of the Koran group of 
Sonoran languages. It should be pointed out, how- 
ever, that these languages were not documented 
before extinction. “Totorame’ is asserted in ethnohis- 
torical sources to have been closely related to Kora. 
All the names just mentioned are plotted and 
assigned to Koran, but with cross-hatching for those 
names for which no linguistic evidence has been 
found. Pinotl, pl. pinomeh, is a Nawa word meaning 
‘speaker of a foreign language’. Chontalli means ‘for- 
eigner’ and popoloca means ‘barbarian, babbler, 
mumbler’. These names were applied by the Nawas to 
a variety of unrelated languages and ethnic groups. 

The surviving languages of this area are Kora, 
Wichol (often called “Tecual’ in colonial sources) and 
a dialect of Western Nawa known locally as Mejican- 
ero (self-name nawal, which is virtually the same as 
the dialect spoken at Aquila on the south-west coast 
of Michoacan and known there as Pémaro. The 
extinct forms of Nawa that were spoken in this region 
may all have been forms of Western Nawa. 

Since far western Meso-America was Meso- 
americanized relatively late, it is possible that some of 
the now extinct languages of this area were linguistic 
isolates or represented otherwise unknown branches 
of recognized linguistic families. 

On the south-east coast of Michoacan and the 
south-west Guerrero coast three ethnic names are 
attested for which no linguistic data are known: 
Chumbia, Tolimeco and Panteco. 

Certain now-extinct ethnolinguistic groups may 
have spoken Yuta-Nawan languages: Zacateco, Gua- 
chichil, and Laqunero. Guamare lay just west of Pame. 
All these groups, and the Pame as well, were called 
chichimeka (‘savages, wild Indians’) by the Nawas of 
central Mexico. Pame, although its original distribu- 
tion lay just outside Meso-America, is included on the 
Meso-American map because it would otherwise be 
the only Oto-Mangean language unmapped, and in 
any case the present-day Pame are Mesoamericanized. 
The ancestor of the Nawan languages began to 
diversify about 1,500 years ago, at the time when the 
Nawas entered Meso-America from the barbarian 
north. Their previous homeland was somewhere 
between Wasteko on the east and Coran on the west, 
possibly on the territory of the Guachichil speakers, 
but I have no linguistic data from Guachichil that 
would tell us whether Guachichil was closely related 
to Nawa. 

The Tepecanos, Tepewanes, Tawe-Mokoritos and 
Akaxee-Xiximes, all of north-western Mexico, had no 
Meso-American architecture as of 1520, but had 
Meso-American style pottery. I might consider 
extending the Meso-American frontier to embrace 
these groups. Due to space constraints, however, they 
are not plotted on the maps. The Tepekanos and 
Tepewanes speak dialects of the Tepiman language 
area, a part of the Sonoran branch of Yuta-Nawan. 
The Tawe-Mokorito and Akaxee-Xixime peoples have 
died out, and their languages were not documented, 
but it is generally believed that they spoke Sonoran 
languages, not, however, as members of the Tepiman 
language area. 
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Table 2.8 Yuta-Nawan stock 


Grouping Language No. on map Population Location 
Northern Yuta-Nawan branch (not given here) 
(Uto-Aztecan: see Section 1, pp. 8-9) 
Southern Yuta-Nawan branch 
Sonoran family 
Kahita (Cahita) 56,400 Sonora 
Yaki (Yaqui) dialect 16,400 
Mayo dialect 40,000 
Teweko (Tehueco) dialect (extinct) 
Tubar (extinct) 
Opata-Eudeve lang. area (7 centuries) 
Opata (extinct) 
Eudeve (extinct) 
Tepiman lang. area (7 centuries) 
Pima-Papago, Névome em. lang. 15,000 Sonora 
Mountain Pima (Pima Bajo) em. lang. 800 Sonora 
Northern Tepewan (Tepehuan) em. lang. 8,000 Chihuahua 
Southern Tepewan (Tepehuan) em. lang. 
South-east Tepewan dialect 8,000 Durango 
South-west Tepewan dialect 6,000 Durango 
Tepekano (Tepecano) dialect (extinct) 
Taraumara Warijiyo lang. area (7 centuries) 
Taraumara (Tarahumara) em. lang. 65,400 of all varieties 
Central (Samachique) dialect Chihuahua 
Northern (Ariseachi) dialect Chihuahua 
South-western (Tubare) dialect Chihuahua 
Western (Rocoroibo) dialect Chihuahua 
Warijiyo (Guarijio) 5,000 total 
Highland dialect Chihuahua 
Lowland dialect Chihuahua 
(possible dialect or separate language: Maculai) 
Koran group (15 centuries) 
Kora (Cora) 52 15,000 Nayarit (+USA) 
Wichol (Huichol) 53 20,000 Nayarit, Jalisco 
[undocumented: Tekoxkin (54), Kaskan (55), Tekwexe (56), Koka (57)| 
Nawan family 
Pochuteko (extinct) 58 
Nawa (Nahuatl) lang. area (11 centuries) 59 1,532,600 of all varieties 
Central Nawa dialect area 59a Guerrero, Puebla, Mexico, 
Tlaxcala, Morelos 
Western Nawa dialect area 59b Michoacan, Durango 
Northern Nawa dialect area 59c Hidalgo, San Luis Potisi, 
Veracruz, Puebla 
Eastern Nawa dialect area 59d Puebla, Veracruz 
Pipil em. lang. 59e 20 EI Salvador 
[undocumented: Pinotl] (59f) 
Table 2.9 Other native languages of Meso-America 
Grouping Language No. on map Population Location 
Kwitatleko isolate 60 2 Mexico, Guerrero 
Tarasko isolate (Tarascan/Purepecha) 61 120,000 Mexico, Michoacan 
Shinkan complex (Xinka) 62 extinct Guatemala 
Yupiltepegue virtual language 62a 
Jutiapa virtual language 62b 
Jumaytepegue virtual language 62c 
Chiguimuliya virtual language 62d 
Guazacapan virtual language 62e 
Lenkan family/group 63 : 
Serkin (Cerquin) 63a a few Honduras 
Kare (Care) 63b extinct Honduras 
Lenka (Lenca) 63c extinct Honduras 
Kolo (Colo) 63d extinct Honduras 
Poto (Chilanga) 63e extinct EI Salvador 
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3.1 The known languages, 
language families, genetic 
units and isolates of South 
America 


South America is defined as continental South 
America, lower Central America and the Antilles; 
that is, Latin America without Meso-America. The 
boundary between Meso-America and South America 
assigns Xinka (Spanish Xinca), Lenkan and Tol 
(Spanish Jicaque) to Meso-America — and Arawakan 
(Garifuna), Misumalpan (Miskitu), Sumu-Ulwa, 
Kalaopera, Matagalpa and Chibchan (Paya, Rama) 
languages to South America. Pipil, Chorotega-Mange 
and Sutiaba-Maribio are Meso-American outliers in 
lower Central America. 

Lower Central America has traits of culture and 
natural environment that are intermediate between 
those of Meso-America and Colombia, and the fact 
that the modern states (earlier Spanish colonies) 
of Central America, except for Panama, have (since 
1525) been culturally fairly closely connected to 
Mexico may mislead the casual observer into 
supposing that lower Central America is naturally 
linked to Meso-America. In terms of linguistics and 
prehistory, however, the Chibchan languages belong 
with continental South America; and, since the 
Misumalpan languages are most probably related to 
Chibchan, they too belong in South America. 
Garifuna is an Arawakan language that around ap 
1800 colonized the Atlantic coasts of Honduras, 
Guatemala and Belize — thereby claiming its 
territory for South America. The line between 
Meso-American and South American languages thus 


separates upper Central America (in Meso- 
America) from lower Central America (in South 
America). 


This second edition of the atlas, unlike the first, 
also plots the languages of colonial inheritance in 
South America, by far the most widespread of which 
is Spanish, followed by Portuguese (in Brazil), as well 
as Dutch, French and English. 


3.2 Research on South 
American indigenous 
languages 


The systematic study of South American indigenous 
languages began only in the 1940s. There are four 
main sets of professional linguists: 


e foreign-born missionaries, mostly Protestants, 
mostly members of the Summer Institute of 
Linguistics, mostly North Americans, with a 
small number of Europeans; 

e foreign-born academics, mostly North Ameri- 
cans, but with a fair number of Europeans; 

e native-born neocolonials, many trained locally, 
many abroad, quite a few of the latter in Europe; 


e native speakers of the languages themselves — a 
minute, but growing, number. 


Since the mid-1950s, the amount of published 
material on South American languages has been 
gradually growing, but even so the number of 
researchers is far smaller than the number of lin- 
guistic communities whose speech should be docu- 
mented. Given current employment opportunities, it 
is not likely that the number of specialists in South 
American indigenous languages will increase fast 
enough to document most of the surviving languages 
before they go out of use, as most of them unavoid- 
ably will. More work languishes in personal files than 
is published, but this is a standard problem. 

It is fair to say that South America and New 
Guinea are the poorest-documented parts of the 
world. However, in the early 1960s fairly systematic 
efforts were launched in Papua New Guinea, and that 
area — much smaller than South America, to be sure — 
is in general much better documented than any part 
of indigenous South America of comparable size. 


3.3 The classification 


The following classification was originally prepared in 
1985 as an internal document of the South American 
Indian Languages Documentation Project (SAILDP); 
an earlier attempt at this was made in 1966 (see 
Kaufman 1990: 28-30). It is based on the overall 
agreements in the classifications of Loukotka (1968), 
Greenberg (1987), Suarez (1974) and Swadesh (1959) 
as to what genetic groups and isolates are found in 
South America. It is also based on an examination of 
the evidence found in Loukotka as well as many other 
(but by no means all) studies of language classifica- 
tion in South America. The present classification was 
revised up to 1989 through study of data and consult- 
ation with specialists for many of the South American 
families and stocks. The data of Greenberg (1987) still 
await careful examination, though they do not so 
much bear on the validity of the groups recognized 
here, or on their internal classification, as on possible 
genetic connections across them. 

The purpose of this section of the atlas is to list 
the genetic groups and isolates of South America, to 
name the languages and dialects found in each 
genetic group, and to lay out the internal branching 
of each genetic group. For each language and dia- 
lect, information as to location, population and 
synonymies will be provided. 

All the documented languages of South America 
are grouped into 118 stocks, families, language com- 
plexes, language areas (see the section on Meso- 
America for the meaning of these two terms) and 
isolates, collectively called genetic units. Of these 118 
genetic units, 70 are isolates and 48 are groups of at 
least two languages. Each of these latter groups con- 
tains only languages of undoubted genetic relation- 
ship (common origin). There are certainly genetic 
relationships across and among many of these 118 
isolates and groups. In a few cases only, some initial 


evidence that roughly conforms to the demands 
of the comparative method has been offered in 
support of cross-unit genetic connections. Genetic 
connections across non-controversial genetic units 
are in general not accepted herein unless there is 
widespread agreement as to their validity and some 
reason to believe them beyond mere assertion by 
authorities. 

In this listing, reference will be made to the 
number of languages, amount of internal diversifica- 
tion, and glottochronological time depth as calculated 
by Swadesh. 

The internal classification of genetic groups con- 
taining more than two languages is a matter of 
considerable uncertainty in most cases, because 
reconstruction and the tracing of common innov- 
ations has not yet been carried out. Even so, a con- 
siderable literature exists whose topic is the internal 
classification of South American language stocks and 
families. It is reasonable to examine these in detail 
whenever they are based on first-hand examination of 
the (or some) linguistic data. Such a classification is 
called an original classification. 

Since Loukotka looked at South American data 
over several decades, and was very reluctant to lump 
groups together, his classification and internal sub- 
groupings should be taken seriously. 

Greenberg’s methods are discussed in Kaufman 
(1990). What is relevant to the current listing is that 
for each of the genetic groupings named herein, 
Greenberg typically simply lists the languages in 
alphabetical order. Thus he offers no opinion as to the 
subgrouping of the genetic units established here. 
In the few cases where he does offer partial internal 
classifications, this will be noted. 

For a discussion of the ‘time depth’ assigned to 
South American languages, those of the Andean 
region in particular, see Adelaar (2004). 

Some classifiers of South American languages are 
compilers, in that they form a synthesis of the results 
of earlier classifications. While they may in a few 
cases propose innovations based on the examination 
of linguistic data, generally they are not reporting on 
the results of personal research. This latter type of 
classification is called a derivative classification. 

Classifications are also of two types in terms of 
their structure: type B classifications specify the gen- 
etic groups by naming the languages (or ethnolinguis- 
tic groups) included in each group; type A classifica- 
tions in addition specify the internal classification of 
each genetic group. Type A classifications are more 
highly valued. 

Original classifiers (type B and type A) of South 
American languages are: 


Brinton 1891 

Chamberlain 1907 (57 families) 
Chamberlain 1913 (83 families) 

Rivet 1924 (77 families, 1,240 languages and 
dialects) 

Loukotka 1935 (Spanish, 94 families) 
Loukotka 1939 (Portuguese, Brazil) 
Loukotka 1944 (German, 114 families + 27 
unclassified languages) 
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Nimuendaju 1981 (1944) (Lowland South 
America) 

Rivet and Loukotka 1950 (108 families) 
Greenberg 1956, 1987 

Swadesh 1959 

Loukotka 1968 (English) 

Rodrigues 1986 (Brazil) 
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It should be kept in mind that a large number of 
previously unknown languages have been discovered 
in the past century, and several new linguistic families 
as well. The increase between Rivet 1924 (77) and 
Loukotka 1968 (117 families) is partly a result of 
more languages coming to light and partly because 
Rivet was a bold ‘lumper’ while Loukotka was a 
cautious one. 

Derivative classifiers (type B and type A) of 
South American languages are: 


McQuown 1955 

Tovar 1961, Tovar and Tovar 1984 
Voegelin and Voegelin 1964, 1965 

Suarez 1974 (some original discoveries) 
Migliazza 1980 (some original discoveries) 
Ethnologue (Grimes 2000, mostly following 
Ruhlen 1987) 
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The classification offered here falls between the two 
categories, ‘original’ and ‘derivative’. Comparison of 
the classifications of Chibchan, for example, by Rivet 
(1924), Rivet and Loutkotka (1950) and Loukotka 
(1968), with the author’s own classification (1988) 
based on all available reconstructive efforts, yielded 
similar results and allows some confidence in this 
compromise classification. 


3.3.1 Nomenclature 


All family and stock names — except those that refer to 
the group only and none of its constituent members 
(such as Je or Chon) and excepting those families that 
contain just two languages (such as Tikuna-Juri or 
Jaruro-Takame) — contain the suffix -an, in conform- 
ity with the usage of linguists writing in English. In 
South America, usage is in flux, and virtually all my 
specific choices have already been favoured by one 
previous writer or more. In Spanish, the suffix -ano, 
-ana is used to form language family names, in the 
same way as -an in English. 

Names for unwritten languages tend to be 
subject to change in the usage of outsiders, and every 
scholar feels more or less free to choose among avail- 
able synonyms. Below I outline the principles used in 
deciding which available synonym to accept. If the 
language is still spoken, I prefer to use names that 
remain current in vernacular usage in the national 
language. That is, unless unacceptable, an exonym 
(name used by speakers of a colonial language) is 
preferred, if only to avoid pejoratives used by neigh- 
bouring speech communities. Whether a language is 
alive or dead, its name should not be any of the 
following: 


e an ordinary lexical item in the local colonial 
vernacular (‘national language’): for instance, Sp. 
coto ‘howler monkey’, orején ‘big-eared’, colorado 
‘red’, Portuguese botocudo ‘lip plugged’, coroado 
‘crowned’, jabuti ‘turtle’; 

e pejorative in Spanish, Portuguese, English, 
French, Dutch, Nawa, Tupi or Kechua: for 
example, Tupi maku ‘slave, underling’, Mashko, 
Alakaluf, Kechua auka ‘savage’, Nawa popoluca 
‘babbler’; 

e¢ a misnomer: for example, a word that refers to 
land or people rather than to language: Mapuche 
‘earth place’ (instead of Mapudungu ‘earth lan- 
guage’), Waorani ‘Sabela people’ (instead of Wao 
‘Sabela’); 

e ablunder: for example, Digiit (name of a chief); 

e associated with any other linguistic data; 

e not standardized in the local colonial vernacular; 

e ambiguous through the accident of having been 
applied to more than one language: for example, 
Apiakaé, Juma, Kanamari, Katukina, Kulina, 


Chontal. Such names as these can be used only 
once each. 


Some languages, of course, do not have an autonym — 
having simply a word that means ‘our language’ or 
‘real language’ — and so they must be known by an 
exonym. Although the use of Spanish and Portuguese 
lexical items as language names has generally been 
avoided, they have been provisionally accepted in a 
few cases where all other alternatives have been 
unacceptable. 

In Meso-America, a vast number of place names 
and ethnolinguistic names are of Nawa origin, due 
both to the pre-existing spread of Nawa settlements 
and outposts, and the policy of the Spanish of settling 
Nawas in new places. The Nawa way of referring to a 
non-Nawa ethnic group was the one most likely to be 
known to Spanish speakers, and thus usually became 
the origin of the vernacular Spanish form of today. In 
the Amazon basin, and in all of lowland tropical 
South America except the Pacific slope, Tupi (now 
known as Jeral where it survives; it is sometimes said 
to be creolized, but this seems to be an exaggeration) 
was the language of colonization alongside or instead 
of Portuguese. Consequently, the names in Spanish 
and Portuguese of the Arawakan and Tukanoan (and 
many other) languages of the Amazon are often 
words of Tupi origin. In the Andes, and the Pacific 
slope of Ecuador and Peru, vernacular Spanish names 
for Indian groups are often of Kechua origin, for par- 
allel reasons. 

The prefix ‘macro-’ has been floating about for 
some time, with no very well defined applications. 
For a while superstocks or phyla were named ‘macro- 
X’. For the most part ‘macro’ has been used to label 
the proposed union of two previously recognized 
genetic entities, at least one of which was a stock. 
Most of the proposed macro-groupings have not yet 
been demonstrated. I propose using the element 
‘macro-’ to designate a set of language families and/or 
isolates that seem likely to be genetically related, but 
that have not been demonstrated to be related. The 
groups designated by a macro-label I would call clus- 
ters. As the genetic relationships within a proposed 
cluster are validated or disproved, the valid genetic 
groupings should receive names that do not include 
‘macro-’. Thus ‘macro-’ would be used in an 
unambiguous way. 


3.3.2 Spelling of names 


I use a standardized spelling system for the names of 
all languages, dialects and genetic groups of South 
America. The spelling system is independent of that 
of Spanish, Portuguese, English or French, and simi- 
lar to each in different ways. Normal Spanish spell- 
ings of each name are given in brackets at the first 
instance of use. 

A single orthographic system has been used for 
the spelling of all names and a single spelling has 
been adopted for each name. This system must be 
able to represent the sounds of both Spanish and Por- 
tuguese and does not fully follow the principles of 
either. Each symbol and digraph has only one value. 
The symbols p, 6, m, n, | 1, i, u, a, y are pronounced as 
in Spanish and Portuguese. In other cases, w equiv- 
alates to Spanish gu, hu; ly to Spanish Il, Portuguese 
Ih; ny to Spanish 7, Portuguese nh; k to Spanish/ 
Portuguese c, qu; and sh to Portuguese x. 


3.3.3 Larger groupings: clusters 


Hypotheses exist linking many of the 118 genetic 
units set up in Kaufman (1990). Some of these seem 
plausible, either because of agreement between sev- 
eral prominent scholars or because the (always 
informal) evidence for the proposed connections is 
fairly persuasive. Those that I consider to be promis- 
ing, or worth looking into due to the prestige of their 
proponents, are presented here. Groupings that are 
recognized by at least two scholars and to which 
Swadesh assigns time depths of less than 50 centuries 
of diversification are called stocks. Groupings of 
greater time depth, or ones to which Swadesh does 


not assign time depths at all, are called clusters. The 
clusters set up here must for the present be viewed 
as attempts to solve a puzzle that involves juggling 
opinions rather than resolving questions, but I think 
that this effort can serve to identify the hypotheses 
that most need testing. Names for clusters are set up 
by prefixing ‘macro-’ to the name of one or two 
member families. The clusters set up here do not all 
just simply fall out of a straightforward comparison 
of existing classification schemes. When any two of 
the four classifiers (Loukotka, Swadesh, Greenberg, 
Suarez) agree on associating two genetic groups, I 
assign them to the same cluster. 


3.3.4 Geolinguistic regions 


In order to devise a manageable way to list the genetic 
units of South America, the area has been broken up 
into “geolinguistic’ regions, and cross-unit proposals 
have been taken into account in setting them up, as 
well as in the ordering of genetic units within the 
geolinguistic regions, and in the external ordering of 
the geolinguistic regions with respect to one another. 
I have tried, as far as possible, to make sense of and 
harmonize the classifications referred to above. 


3.3.5 Invasion zones 


There are five regions in South America that are 
occupied almost entirely by languages whose closest 
relatives are concentrated elsewhere. These are: 


e the mouth of the Amazon, where both Maipu- 
rean (Palikur, Arudn) and Tupian (Wayampi) 
languages are found far from home; 

e the upper Xingu, where along with unaffiliated 
Trumai are found Kariban (Bakairi group), 
Maipurean (Paresi), Macro-Je (Rikbaktsa) and 
Tupian (Aweti, Kamayura) languages. The latter 
two are not all that far from home, but they are 
disjunct with respect to their congeners; 

e the upper Amazon, where Tupian languages 
(Kokama-Kokamilya, Omawa-Kampeva) and 
one Maipurean (Morike) find themselves far 
from home; 

e the Parana valley, where Tupian (Tupi-Guarani, 
Aché) and Maipurean (Terena) languages find 
themselves far from home; 

e — the north-west Chaco, occupied by Chiriguano, a 
Tupi-Guarani language. 


3.3.6 Unidentified languages 


Unlike North and Meso-America, in lowland South 
America, especially in Brazil, Colombia, Venezuela, 
Ecuador, Bolivia and Peru, there are still some 
indigenous groups who are not in regular contact 
with representatives of the national culture, and in 
some cases avoid all contact with outsiders. Of course 
the languages of uncontacted groups are unknown to 
us. Over the past century as many as fifty such groups 
have been contacted, and their languages identified 
and sometimes studied. In most cases these groups 
speak languages not known before; in some cases they 
speak languages known from before 1900 but lost 
from view, or languages also spoken by groups in 
regular contact with the outside world. In any case, 
from time to time previously unknown languages are 
found in South America. 

In the current compilation it is not considered 
appropriate to list any ethnic group whose language is 
largely undocumented. In all of Latin America, there 
are ethnic groups who have disappeared through 
death and/or acculturation before their language was 
recorded or identified (or all records of such docu- 
mentation have disappeared). 

In sum, this list of languages is provisional. Suf- 
ficient data to distinguish one postulated language 
from another are often lacking. Data to show that 
what is called one language is actually more than one 
are also often lacking. 

The number of ethnic and linguistic groups that 
became extinct during the colonial period is very 
large — in the hundreds. 
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The numeration system used here assigns a language areas are assigned numbers. Language areas _ total number of items on the list is therefore greater 
sequential number to each language, throughout the = are made up of entities that are, in my judgement, — than 427, somewhat of the order of 475. 
set of maps. Colonial languages, such as Spanish and = almost but not quite distinct languages. I call them A hyphen is used between names where lan- 


Portuguese, are spoken over virtually the whole emergent languages (em. lang.). Emergent languages —_ guages have a shared identity; the name given first is 
region and are marked with letters. Languages and —_—have numbers followed by a, b, c, d and so on. The the one most commonly used to cover both dialects. 


3.4 The languages listed and classified 


3.4.1 North-western region 


Language Alternative name/autonym Number Population Location 


Yurimangi 1 extinct Valle de Cauca, Colombia 


‘Yurimangui’ is the name of a river in Colombia, department of Valle de Cauca. Loukotka (1943) placed Yurimangi in ‘Chibchan’ as a ‘mixed language’. Rivet (1943) tried to 
show that Yurimangi belongs to the Hokan stock. Greenberg (1987) follows him in this. Swadesh (1959) sets up a macro-Zamuco of 88 centuries’ differentiation including 
Awishiri=Tekiraka, Samtikoan, Charrtian and Ofayé. Quite obviously the linguistic kin of Yurimangi are far to seek. Yurimangi is dead and poorly documented. 


Timotean family (15 centuries) 
Timote-Kuika 2 extinct? Cordillera de Mérida, Venezuela 


Although apparently dead, Timote may survive as Mutt, as reported in Ethnologue. There are several hundred speakers in the town of Mutts, a little above Pueblo Llano, 
Barinas State, Venezuela. Mutt is so far unstudied, and its relationship is thus unclear. 


Mukuchi Mokochi/Mirripu, Maripu 3 extinct Cordillera de Mérida, Venezuela 


Timote had two dialects, Timote proper and Kuiku. Mukuchi had two dialects, Mukuchi proper and Mirript. Most other classifiers identify the two languages of this family 
as Timote (with Mukuchi and Mirript as dialects) and Kuika, but unless the data in Loukotka (1968) are mislabelled, Timote and Kuika are dialects of the same language, 
and Mukuchi and Mirript are dialects of a second language. This suggests that Kuika is geographically displaced and that Timotean basically belongs in Mérida. ‘Mucutw’ is 
the name of an undocumented variety that may belong to either language. Timotean has also been called ‘Mukw’. There are no generally held hypotheses regarding outside 
connections of Timotean. 


Hiraharan family (19 centuries) 


Hirahara Jirajara 4 extinct Falcén, Venezuela 
Ayoman Ayaman 5 extinct Lara, Venezuela 
Gayon Coyén 6 extinct Lara, Venezuela 


Greenberg places Hiraharan in ‘Paezan’. Swadesh (1959) links it with Guat6, Taruma, Mirandela, Kariri and Kariban. Shows lexical similarities with both Arawakan and 
“Chibchan’. There are no generally accepted hypotheses about outside connections of Hiraharan. 


Chok6 family (7 centuries) 
Noanama Waunana 7 6,000 SE Panama, Colombia 


Emberd group or language area 


South Embera em. lang. 8a 3,500 Panama; Antioquia, Colombia 
North Ember em. lang. 8b 20,000 Panama; Antioquia, Colombia 
Sintfana Cenufara 9 extinct Bolivar, Colombia 

Kimbaya Quimbaya 10 extinct Valle de Cauca, Colombia 


The time depth of seven centuries calculated by Swadesh (1959) may be too low, if Sintfana and Kimbaya are to be included. Otherwise the internal diversification of Choké 
does not seem to be very great. Tovar and Suarez think Choké is related to Kariban. Since Loukotka (1968) cites no data from Sintfana and Kimbaya, and since those who 
think Chok6 is Kariban do not usually include Kimbaya, it may be that Kimbaya does not belong to this family. Loukotka (1944) sees similarities with Paes in South 
Embera, and similarities with Arawakan in Noanamia. Rivet and Loukotka (1950) include Choké in Kariban. Comparative study of Choko is still in initial stages. 


Macro-Paesan cluster 


Kunsa-Kapishand stock (49 centuries) 
Kunsa Cunza 11 extinct Atacama, Chile; Potosi, Bolivia 
Kapishana Kapixana/Kanoé 12 extinct Rondonia, Brazil 


Swadesh (1959) groups these together with a fairly low time depth and the lexical evidence looks promising, but Greenberg does not agree. Both Swadesh and Greenberg 
link Kunsa to their versions of macro-Paesan. 

Betoi Hirara, Jirara 13 extinct Arauca, Colombia 
Loukotka (1968), Suarez (1974) and even Nimuendajtt (1981) all include Betoi in Chibchan (which for them includes Paesan), and Greenberg locates it more specifically 


in Paezan. 


Paes-Barbakéan stock (54 centuries) 


Andaki Andaqui 14 extinct Huila, Caqueta, Colombia 
Paes group 

Paes Paez 15 50,0002 Huila, Colombia 

Pansaleo Panzaleo 16 extinct Pichincha, Cotopaxi, Ecuador 
Kokonuko group 

Kokonuko Coconuco 17 extinct? Cauca, Colombia 

Totor6 18 extinct? Cauca, Colombia 

Wambiano Guambiano, Mogés 19 9,000 Cauca, Colombia 


Without the inclusion of Andaki, this grouping is often called ‘Interandine’. Andaki seems in fact to be more closely similar to the Paes group than to the Kokonuko group. 
The data cited in Loukotka (1968) do not support including Pansaleo in Paesan. 


Barbakoan family 

Northern Barbakéan group 
Koaiker (Awa-)Coaiquer 20 21,000 Cauca, Colombia; Carchi, Ecuador 
Muelyama Muellama 21 extinct Narifio, Colombia 
Pasto 22 extinct Carchi, Ecuador; Narifio, Colombia 
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Language Alternative name/autonym Number Population Location 


Southern Barbakéan group 


Kayapa Cayapa/Chachi 23 5,000 Esmeralda, Ecuador 

Tsafiki Colorado/Tsachela 24 2,100 Pichincha, Los Rios, Ecuador 
Kara Karani, Caranqui/Imbaya 25 extinct Imbabura, Ecuador 

Itonama Machoto 26 >110 Beni, Bolivia 

Warao Guarao 27 18,000 Guyana; Suriname; Amacuro, Ven. 


The Macro-Paesan cluster contains certain of the genetic units grouped by Greenberg in his more inclusive ‘Paezan’. Swadesh (1959) links Kapishana to Kunsa, and they in 
fact do share lexical similarities. He links Kunsa, Kapishana, Paes-Barbak6éan and Itonama as parts of his more inclusive ‘Macro-Paez’. Suarez (1974) links Betoi, Paes- 
Barbakoan, Itonama and Warao within his more inclusive ‘Macro-Chibchan’. Loukotka specifically links Betoi and Paes-Barbakéan within his more inclusive “Chibchan’. 
The Macro-Paesan cluster is thus supported from many quarters, though the work needed for developing the arguments in favour of this hypothesis remains to be done. See 
Kaufman (1990), which offers a somewhat different formulation. 


Chibcha-Misumalpan stock 


There is general agreement that Chibcha and Misumalpa are genetically related. Holt (1986) has assembled some lexical evidence for this, and Craig and Hale have 
assembled some grammatical data for it. The similarities are not numerous. 


Chibchan (sub)stock 


The Chibchan languages are a set of twenty-one known languages, of which five are dead. They are spoken in lower Central America (in Honduras, Nicaragua, Costa Rica 
and Panama) and Colombia. They seem more diverse than Mayan, and less diverse than Oto-Mangean; probably about as diverse as Yuta-Nawan. I view Chibchan as a stock 
rather than a family. Chibchan is made up of fourteen fairly obvious groups, and the ungrouped languages. If some less-than-obvious relationships are accepted, the number 
goes down to nine. The Chibchan stock was recognized in 1888 by Max Uhle. The Chibchan stock must be distinguished from the membership of the hypothetical ‘Macro- 
Chibchan’ superstock: ‘Macro-Chibchan’ contains Chibchan as a core, with several additional families and isolates stuck onto it almost without restraint. The current 
incarnation of the Macro-Chibchan hypothesis is Greenberg’s (1987: 382), where Kwitlateko, Lenkan, Misumalpan, Tarasko, Shinka, Yanoma'man, Warpe, Kunsa, Betoi, 
Chimutan, Itonama, Hiraharan, Muran, Andaki, Barbakéan, Chok6, Paesan, Timukwa and Warao are all accreted to the Chibchan stock. Only a handful of the Chibchan 
languages have been documented by descriptive linguists who have made their work public. The best-documented languages to date are Chibcha, Kogi, Kuna, Rama, Bribri 
and Ika. 

I give here my best guess as to how the Chibchan languages should be classified. The classification offered here is a working hypothesis, which is subject to change, but 
not in all aspects. The position of Chimila within Eastern Chibchan may be specifiable in greater detail; it may group with Arwako, for instance. Teribe may fit within 
Central Chibchan after all, thus eliminating the need for a distinction between Core Chibchan and Central Chibchan. Gradually, parts of this classification will become 
verified or modified as the reconstruction of proto-Chibchan proceeds, and the diversification of the Chibchan languages is worked out. 


Paya Pech 28 994 NE Honduras 


Core Chibchan division (61 centuries) 


Central Chibchan family (51 centuries) 
Watuso-Weétar Guatuso, Guetar 29 extinct N & W Costa Rica 


The extinct Wétar was closely related to Watuso, and may have been a dialect of it. Suerre (on the Tortuguero River) cannot be definitely identified linguistically, but if 
anything it is Kabékar or Bribri. 
Rama 30 24 SE Nicaragua 


Eastern Chibchan branch (42 centuries) 


Arwako group (27 centuries) 
Tairona 31 extinct Magdalena, Colombia 


Tairona, no longer an ethnic language, is said to be in use as the shamanic/priestly language of the Kogi. 


Kogi Coqui/Kaugian 32 4,000 Sierra Nevada de Santa Marta, Col. 
Wamaka Guamaca/Malayo 33 3,225 Sierra Nevada de Santa Marta, Col. 
Atanke Atanque/Kampanake, Busintana, 

Buntiwa, Kankuama 34 extinct Sierra Nevada de Santa Marta, Col. 
Ika Bintukwa, Arwako 35 14,301 Sierra Nevada de Santa Marta, Col . 

Arwako has always been used by comparative linguists to refer to the whole group to which Ika belongs. 

Chimila 36 2,000 Sierra Nevada de Santa Marta, Col. 
Bari Motil6n, Dobokubi 37 850 Zulia, Venezuela 
Tunebo 38 1,800 Sierra Nevada de Cocuy, Boyaca, 


Arauca, Colombia; Apure, Venezuela 
The numerous dialects of Tunebo have not been clearly classified. 
Cundimamarca group 
Chibcha Muiska, Muisca 39 extinct Cundinamarca, Colombia 


“Muisca’ is a rendering of the native term muyska ‘person’. “Chibcha’ is a native term for the ethnic group and language. It is the first name cited by Lugo, and is established 
in general usage. The preference by some scholars for ‘Muisca’ seems designed to leave the term “Chibcha’ as the denomination of the stock, but not of any language in it. 
Chibcha was the language of a large and important ethnic group at time of contact, and was moderately well documented during the seventeenth century. 


Duit 40 extinct Cundinamarca, Colombia 

Kuna Cuna 41 1,300 E Panama, central W Colombia 
Waimi branch (37 centuries) 

Mobe Guaymi, Move/Ngobere 42 128,000 Central N Panama 

Bokota Buglere, Bogota 43 2,500 Central N Panama 

Doraske Changena 44 extinct W Panama 


Talamanka branch (43 centuries) 
Boruka Borunca, Borunka 45 5 women SE Costa Rica 


Kabékar-Bribri language area (10 centuries) 


Kabékar Cabécar 46a 3,000 NE Costa Rica 
Bribri 46b 6,000 NE Costa Rica 
Teribe Terraba/Naso 47 1,500 W Panama, E Costa Rica 


Comparative work has been done on Chibchan: Costenla’s (1981) dissertation reconstructs the phonology of proto-Chibchan, using six languages as the basis for the 
reconstruction — Watuso, Kabékar, Bribri, Teribe, Boruka and Chibcha — and accounting for the historical phonology of thirteen others. Holt’s (1986) dissertation 
reconstructs the phonology of proto-Chibchan, using six languages as the basis for the reconstruction: Paya, Rama, Bribri, Kuna, Kagaba, Chibcha. 
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Language Alternative name/autonym Number Population Location 
Misumalpan family 
Miskitu Mosquito 48 183,400 E Nicaragua, E Honduras 
Sumu-Ulwa lang. area 49 7,400 Nicaragua, Honduras 
Matagalpa language area (10 centuries) 
Matagalpa em. lang. 50a extinct Nicaragua, El Salvador 
Kakaopera em. lang. Cacaopera 50b extinct El Salvador 
Kamsa Cams, Sibundoy, Coche 51 4,022 Putumayo, Colombia 


3.4.2 Western Amazonia 


Language Alternative name/autonym Number Population Location 
Tiniwan family 

Tiniwa Tinigua 52 extinct Meta, Colombia 

Pamiwa Pamigua 53 extinct Meta, Colombia 


There are no commonly held hypotheses relating Tiniwan to any other genetic grouping. Lokoutka (1935) groups Tiniwan with Sdlivan. Nimuendajt groups Tiniwan with 
Wahivoan. Greenberg put it in ‘macro-Arawak’ with Katembri, Otomako, Wahivoan, Arawakan, Wamo, Chapaktran and Uru-Chipaya. 


Macro-Otomakoan cluster 

Macro-Otomakoan represents the intersection of Greenberg’s more inclusive ‘Equatorial’ and Swadesh’s more inclusive ‘macro-Mact’. No systematic effort has yet been 
made to validate this particular grouping. The geographical discontinuity is considerable, but the same can be said for many other larger groupings, some of which are not 
controversial. See Kaufman (1990) for a slightly different formulation. 


Tuyoneri language area 


Wachipaéri em. lang. Huachipaeri/Harakmbut 54a 130 Cuzco, Madre de Dios, Peru 

Amarakaéri em. lang. 54b 500 Madre de Dios, Peru 
Otomakoan family 

Otomako 55 extinct Apure, Venezuela 

Taparita 56 extinct Apure, Venezuela 

Trumai 57 78 Mato Grosso, Brazil 


Wamo-Chapakuran stock 


Wamo Guamo 58 extinct Portuguesa, Venezuela 
Chapakuran family 
Itene/Central Chapaktiran group 
Wanyam Wanham 59 a few? Rondonia, Brazil 
Abitana-Kumana 60 a few? Rondonia, Brazil 
Kabishi Kabixi, Cabichi 61 extinct? Mato Grosso, Brazil 
Itene More 62 a few? Beni, Bolivia 


Wari/Southern Chapakuran group 


Kitemo-Nape 63 extinct Santa Cruz, Bolivia 
Chapakura Wachi, Huachi 64 extinct Santa Cruz, Bolivia 
Urupa-Jart 65 a few? Rondonia, Brazil 
Orowari Pakaasnovos 66 1,833 Rondonia, Brazil 
Tora 67 40 Amazonas, Brazil 


Macro-Arawakan cluster 

All the genetic units assigned to macro-Arawakan belong to Greenberg’s more inclusive ‘Equatorial’. Both Swadesh and Loukotka link Wahivoan to Arawakan/Maipurean, 
and Suarez links Arawan to Arawakan/Maipurean. Greenberg links all three in his ‘macro-Arawakan’. Matteson (1972) simply included Arawan within Arawakan in her 
rudimentary effort to reconstruct the latter. See Kaufman (1990) for a slightly different (and more inclusive) formulation. 


Wahivoan family (23 centuries) 


Wahivo Guajibo/Hiwi 68 20,000 Arauca, Colombia; Apure, Venezuela 
Kuiva Cuiba 69 2,650 Vichada, Arauca, Col.; Apure, Ven. 
Wayavero Guayabero/Hiw 70 1,200 Meta, Colombia 

Churuya 71 extinct Vichada, Venezuela 


Maipurean (sub)stock (45 centuries) 

The Maipurean or Arawakan stock contains more distinct languages than any other family in the western hemisphere, and is internally more diverse than any other South 
American genetic group. The name Arawakan is the one normally applied to what here is called Maipurean. Maipurean used to be thought of as a major subgroup of 
Arawakan, but all the living Arawakan languages, at least, seem to need to be subgrouped with languages already found within Maipurean as commonly defined. The sorting 
of the labels Maipurean and Arawakan will have to await a more sophisticated classification of the languages in question than is possible at the present state of comparative 
studies. 


Upper Amazon branch 
The subgrouping of the Upper Amazon branch is very difficult, since most of the languages are dead and scantily documented. The living or well-documented languages can 
be grouped fairly well, but the remainder of the tentative classification given here is subject to some improvement. 


Western Nawiki sub-branch: Wainumd group 


Wainuma Wai 72 extinct Amazonas, Brazil 

Mariaté 73 extinct Amazonas, Brazil 

Anauya 74 extinct Amazonas, Venezuela 
Piapoko group 

Achawa Achagua 75 400 Meta, Colombia 

Piapoko Piapoco/Tsase 76 3,100 Vaupés, Col.; Amazonas, Ven. 

Amarizana 77 extinct Meta Colombia 

Kaviyari Cabiyari 78 50 Amazonas, Colombia 
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Language Alternative name/autonym Number Population Location 
Warekena group 

Warekena Guarequena 79 705 Amazonas, Venezuela & Brazil 

Mandawaka Mandahuaca 80 3,000 Amazonas, Venezuela & Brazil 
Rio Negro group 

Jumana 81 extinct Amazonas, Brazil 

Pasé 82 extinct Amazonas, Brazil 

Kaiwishana 83 extinct Amazonas, Brazil 


Jukuna language area: 


Jukuna em. lang. 84a 800 Amazonas, Colombia 
Gart em. lang. 84b extinct Caqueta, Colombia 
Eastern Nawiki sub-branch 
Tariana Tariano 85 100 Amazonas, Brazil; Vaupés, Col. 
Karu lang. area 86 
Ipeka-Tapuia dialect group/em. lang. 86a 135 Amazonas, Brazil, Col., Ven. 
Baniwa dialect group/em. lang. 86b 5,460 Amazonas, Brazil & Venezuela 
Kadaupuritana dialect group/em. lang. 86c 2 Amazonas, Brazil 
Resigaro 87 14 NE Peru 
Central Upper Amazon sub-branch: Baré group 
Marawa 88 extinct Amazonas, Brazil 
Baré Balé 89 a few? Amazonas, Brazil & Venezuela 


Baré is a term apparently used for several Maipurean Arawakan languages of the area: Baré, Mandawaka,. Warekena, Baniwa and Piapoko, according to Ethnologue. 


Ginao 90 extinct Bolivar, Venezuela 


Yavitero group 


Yavitero 91 extinct Venezuela 

Baniva 92 extinct Amazonas, Venezuela 

Maipure 93 extinct Vichada, Col.; Amazonas, Ven. 
Manao group 

Manao 94 extinct Amazonas, Brazil 

Kariat 95 extinct Roraima, Brazil 


Uncertain grouping: 


Waraikti 96 extinct Amazonas, Brazil 
Yabadna 97 extinct Roraima, Brazil 
Wirina 98 extinct Roraima, Brazil 
Shriana Xiriana 99 ? Roraima, Brazil 


Maritime branch 


Artan 100 extinct Marajo, Brazil 
Wapishana I. ar. Ar'uma 101 7,500 SW Guyana; Roraima, Brazil 
Ta-Maipurean sub-branch 
Taino 102 extinct Cuba, Santo Domingo, Puerto Rico, 
Jamaica 


The Taino language was spoken on all the large islands of the Caribbean. It was the source of numerous lexical borrowings into the Spanish of the Caribbean, and later of 
Meso-America and northern South America. 


Wahiro group 
Wahiro Guajiro/Wayuu 103 305,000 Meta, Col.; Zulia, Ven. 
Parauhano Paraujano/Anyu 104 20 Zulia, Venezuela 
Arawak Aruak/Lokono 105 2,400 Guyana, Suriname, French Guiana, 
Venezuela 
Inyeri language area 
Kalhiphona em. lang. 106a extinct Dominica, St Vincent 
Garifuna em. lang. 106b 197,000 N Honduras, SE Guatemala, S Belize, 
Nicaragua 


Kalhiphona and Garifuna get their names from the Kariban word /karipona/, which is their autonym. It was imposed on the Inyeris by their conquerors, Kariban men who 
eventually learned the Inyeri language while keeping a sizeable vocabulary of Kariban origin for use among men. 


Eastern branch 


Palikur language area 
Palikur em. lang. 107a 1,400 Amapa, Brazil; Fr. Guiana 
Marawan-Karipurda em. lang. 107b extinct Amapa, Brazil 


Western branch 
Amoesha Yanesha 108 9,000 Cuzco, Peru 
Chamikuro Chamicuro 109 5 Loreto, Peru 


Central branch 


Paresi group 
Paresi Parecis/Haliti 110 1,200 Mato Grosso, Brazil 
Sarave Saraveca 111 extinct Santa Cruz, Bolivia; Brazil 
Waura group 
Waura-Meinaku 112 240 Xingu Park, Brazil 
Yawalapiti 113 140 Xingu Park, Brazil 
Kustenau 114 extinct Mato Grosso, Brazil 


Southern Outlier branch 
Terena 115 15,000 Mato Grosso do Sul, Brazil 
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Language Alternative name/autonym Number Population Location 
Moho group 
Moho language area Mojo 
Ignaciano em. lang. 1l6a 4,000 SC Beni, Bolivia 
Trinitario em. lang. 116b 5,000 SC Beni, Bolivia 
Baure 117 a few? Beni, Bolivia 
Pauna-Paikone language area 118 extinct Santa Cruz, Bolivia 
Piro group 
Piro Manitereni, Maxineri 119 400 Acre, Brazil; Loreto, Peru 
Inapari Inamari 120 4 Madre de Dios, Peru 
Kanamaré 121 extinct Acre, Brazil 
Apurina Kangite 122 2,000 Amazonas, Acre, Brazil 


Kampa branch 


Kampa language area 


Ashéninga em. lang. 123a 5,000 Cuzco, Peru 

Ashaninca em. lang. 123b 25,000 Cuzco, Peru 

Matsigenga em. lang. 123c 6,000 Cuzco, Peru 
Unclassified non-Maipurean Arawakan languages: 

Shebaye 124 extinct Trinidad? 

Lapachu 125 extinct La Paz, Bolivia 

Morike Mayoruna 126 extinct Brazil/Peru border 


This classification acknowledges the help of David Payne. Comparative work on Maipurean languages has been done. Taylor (1958) compares Garifuna, Lokono and Wahiro 
and extracts some sound correspondences. About seventy-five cognate sets are discussed. 

Shafer (1959) assembles about forty cognate sets from a large number of Maipurean languages, mostly from the Amazon and Rio Negro area, though the whole range of 
languages is touched on. Thirteen consonantal correspondences are recognized. Shafer did not respect the orthographies of his mainly nineteenth-century pre-linguistic 
sources, so some information was lost. 

Taylor (1960) gives more on the same topic as Taylor (1958), with more extensive data, about 125 sets. This is the most systematic comparison of phonological 
correspondences in Arawakan yet made. 

Matteson (1964), though examining only thirty-one cognate sets here, presents data from and sound correspondences between Amoesha, Kampa, Piro, Terena, 
Apurina, Baure and Garifuna. Unfortunately, Arawan and Harakmbut data are also cited, though these languages are not Arawakan or Maipurean. 

Matteson (1972) reconstructs the phonology of proto-Arawakan, using seventeen languages, most of the ones that are still alive. Matteson includes data from Arawan 
and Harakmbut, which do not belong in this family, and provides intermediate reconstructions for ‘proto-Shani’ (=Terena and Baure, 106 sets), ‘proto-Harakbut’ (191 sets, 
but not Arawakan), ‘proto-Piro-Apurina’ (185 sets), ‘proto-Ashaninka’ (=Kampa, 260 sets), ‘proto-Madi’ (=Arawan, 87 sets, but not Arawakan), and ‘proto-Newiki’ (=some 
of the Rio Negro languages, 144 sets). The study has 353 Arawakan ‘cognate’ sets with reconstructions, but many of these are sets only by virtue of including Arawan or 
Harakmbut forms; these sets also cite data from Paresi, Wahiro and Garifuna. Matteson packs the reconstructions with segments and morphs that are attested in some of the 
languages only. Many of the forms cited contain unacknowledged and unsegmented inflectional affixes. 

Wise (1990) assembles data for about twelve valence-changing suffixes, using data from about twenty languages. 

David Payne (1987a) assembles data for about twelve affixes — person markers and markers of possession — using data from fifteen to twenty languages. Payne (1987b) 
assembles twenty-three cognate sets from twenty-nine languages and makes tentative reconstructions, and Payne (1990) assembles 230 cognate sets from virtually all the 
Maipurean languages still spoken, and makes a tentative reconstruction of proto-Maipurean phonology, tracing developments in all descendants. Payne (1990) offers a 
classification based on shared lexical material assumed to be retentions. This is clearly the single greatest step forward in comparative Arawakan studies. Although most of 
the sound correspondences that point to the different proto-Maipurean phonemes have already been identified in the various works of Taylor, Taylor did not reconstruct, 
nor use so many languages, nor have such extensive and accurate data. Payne points out that his proposed classification should be used as a way of identifying intermediate 
proto-languages that need reconstructing. 


Arawan family 
Arawa 127 extinct Amazonas, Brazil 


The group was wiped out by measles in 1877. 


Kulina Culina/Madiha 128 1,300 Amazonas, Brazil; SE Peru 

Deni 129 600 Amazonas, Brazil 
Jamamadi language area 

Jamamadi em. lang. 130a 195 Amazonas, Brazil 


Its dialect Tukurina may be a separate language. 


Kanamanti em. lang. 130b 2 Amazonas, Brazil 
Jarawara em. lang. 130c 150 Amazonas, Brazil 
Banawa em. lang. 130d 70 Amazonas, Brazil 
Paumari 13la 700 Amazonas, Brazil 
Zuruaha Suruaha 131b 130 Amazonas, Brazil 
Kandoshi Candoshi 132 3,000 Peru 


Matteson (1972) is the only comparative study of this family using moderately accurate data. She uses data from Kulina, Jamamadi, Jarawara and Paumari. According to her, 
the percentage of cognates between Jarawara and Jamamadi is 80 per cent; this would imply that they are probably not separate languages. The other cognate counts she 
gives are: Kulina—Jamamadi/Jarawara, 49 per cent; Paumari—Kulina and Paumari-Jamamadi/Jarawara, 31 per cent. She cites eighty-seven cognate sets for the three (or four) 
languages. Matteson calls this family Madi, since two of the languages (Madiha and Jamamadi) have this element in their names. 

Arawakan proper (or Maipurean) and Arawan are widely believed to form a family. Matteson included Arawan in her Arawakan reconstruction, but without much 
success. There is no doubt that Arawan is typologically like Arawakan, but that the relationship, if real, is not very close. 


Macro-Puinavean cluster 

The Macro-Puinavean cluster represents the intersection of Greenberg’s more inclusive “‘macro-Tucanoan’ and Swadesh’s more inclusive ‘macro-Mact’. Awaké-Kaliana, 
which is not assigned by Swadesh to ‘macro-Mact’, seems worth comparing to Maku (145), and transitively to the rest of macro-Puinavean. Macro-Puinavean is not 
supported by careful comparative study. 


Puindvean stock (57 centuries) 


Kuri-Dou language area 133 extinct Amazonas, Brazil 

Hupda 134 1,350 Amazonas, Brazil; Vaupés, Col. 
Kaburi language area 

Nadéb em. lang. 135a 300 Amazonas, Brazil 
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Language Alternative name/autonym Number Population Location 

Kaman em. lang. /Daw 135b 83 Amazonas, Brazil 

Wariva Guariba 136 2 Amazonas, Brazil 

Kakua Cacua 137 150 Vaupés, Colombia 

Puinave Guaipuinavi/Waipuinavi 138 2,240 Vaupés, Col.; Amazonas, Ven. 
Waviare Guaviare/Nukak Maku 139 300? Guaviare jungle region, Col. 


The name “Makw’ is a pejorative term of Jerai origin meaning ‘slave’ or ‘underling’. Although the term is entrenched, it is clearly unacceptable. The subgrouping of this 
family or stock is still unclear. Waviare is geographically between Puinave and the other Puinavean languages. Swadesh’s figure of 57 centuries seems not to include Puinave 
(or Waviare either, which was unknown at the time). 


Katukinan family 
Katukina Catuquina 140 1 Acre, Brazil 
Dyapa [Southern Katukina] 141 2 Amazonas, Brazil 
Katawishi Katawixi/Hewadie 142 10 Amazonas, Brazil 


Since most of the ethnolinguistic groups in this language family have a name with a second element dyapa, meaning ‘people’ or ‘clan’, a better name for this family might be 
Djapa. This would help avoid the ambiguity associated with the names Katukina, Kanamari and Katuwishi. As Rivet (1920) shows, the name Katukina has been applied to so 
many linguistic groupings as to make it extremely unspecific and uninformative. 


Kalianan stock 


Awaké-Kaliana family 


Awaké Arutani/Uruak 143 22 Roraima, Brazil; Bolivar, Ven. 
Kaliana Kariana/Sapé 144 5 Bolivar, Venezuela 
Maku Macu 145 10? Roraima, Brazil 


Macro-Tekiraka cluster or Tekiraka-Kenichana stock 
Teikraka Tequiraca/Avishiri 146 2 Loreto, Peru 
Kanichana Canichana 147 extinct Beni, Bolivia 


Tukanoan stock 


Western Tukanoan branch 


Korewahe Koreguaje, Caqueta 148 2,000 Caqueta, Colombia 

Piohé Secoya 149 435 Oriente, Ec.; Putumayo, Col.; Loreto, 
Peru 

Teteté 150 extinct Colombia/Ecuador border 

Koto Coto/Orejon 151 190 Loreto, Peru 

Jaina Yahuna 152 20? Amazonas, Colombia 


Central Tukanoan branch 
Kubewa Cuveo/Pamiwa, Hehenawa 153 6,150 Vaupés, Col.; Amazonas, Br. 


Eastern Tukanoan branch 


Makuna-Erulia 154 550 Vaupés, Col.; Amazonas, Br. 
Yupua-Durina 155 extinct Amazonas, Colombia 
Kuerett Cueretu 156 extinct Amazonas, Brazil 
Desana-Siriana 157 1,300 Vaupés, Col.; Amazonas, Br. 
Bara-Tuyuka 158 815 Vaupés, Col.; Amazonas, Br. 
Karapana Carapana 159 650 Vaupés, Col.; Amazonas, Br. 
Tukano Tucano 160 5,000 Vaupés, Col.; Amazonas, Br. 
Wanana-Pira Piratapuyo/Waikena 161 1,070 Vaupés, Col.; Amazonas, Br. 


A very large number of dialects in Tukanoan are reported as separate languages in Ethnologue (for example, Desana and Siriana), but the linguistic facts do not warrant this. 
According to Waltz and Wheeler (1972: 124), Tukano and Wanana ‘manifest completely identical reflexes’ of the proto-Tukanoan phonemes; taken together with 91 per cent 
cognates in basic vocabulary, any case for these being separate languages seems extremely weak. This applies to many sets of Tukanoan ‘languages’, even the ones recognized 
here. Though I am certain that there are not fourteen different languages in this family, I lack the familiarity with them necessary for arriving at an independent conclusion. 
If the cognate percentages given in Waltz and Wheeler are accurate, and taken seriously, one would want to recognize only about seven distinct Tukanoan languages. The 
whole matter requires study by someone not inclined to multiply entities needlessly. 

Of course, the fact that there is a large number of distinct ethnolinguistic groups within this family is unavoidable and uncontroversial. Swadesh gives a time depth of 23 
centuries for Western Tukanoan, and 45 centuries for Eastern Tukanoan. Waltz and Wheeler want to recognize a third branch, ‘Middle Tukanoan’, to include Kubewa, 
which might in fact be closer to Western than to Eastern Tukanoan. This leaves the position of Yupua-Durina and Kuerett ambiguous: they might be Western, Central or 
Eastern; since they are dead, they are not discussed in either Waltz and Wheeler (1972) or Ethnologue. I find, from the meagre data cited in Loukotka (1968), that they are 
most like Desana, and I therefore place them in Eastern Tukanoan. Comparative work has been done on Tukanoan: Waltz and Wheeler (1972) reconstruct proto-Tukanoan 
phonology using data from seven languages — Buhagana (Barasano), Bara, Kobewa, Desana, Wanana, Tukano and Siano — about 300 cognate sets, with reconstructions. 
There are 278 numbered sets, but often the same etymon is found in several different sets, while some numbered sets contain more than one etymology. Judging from the 
reconstruction offered in Malone (1986), the phonological reconstruction could have been refined considerably from the point actually attained in this article. 


Juri-Tikuna stock (46 centuries) 


Tikuna Ticuna 162 24,000 Amazonas, Br., Col.; NE Peru 
Juri Yuri 163 extinct Amazonas, Br., Col. 
Munichi Muniche 164 3 Loreto, Peru 


3.4.3 Northern foothills region 


Language Alternative name/autonym Number Population Location 


Takame-Jartiroan stock 


Takame Tacame 165 extinct Manabi, Ecuador 
Jaruro Yaruro/Yuapin 166 2,000 Amazonas, Apure, Venezuela 
Kofan Cofan 167 1,300 Putumayo, Col.; Napo, Ec. 
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Language Alternative name/autonym Number Population Location 


Macro-Andean cluster 
Macro-Andean is the intersection of Greenberg’s more inclusive ‘Andean’, Swadesh’s more inclusive ‘macro-Jibaro’ and Sudrez’s ‘Jivaro-Cahuapana’, or agreement between 
any two classifiers. No attempts to validate macro-Andean as such are known to have been made. Puelche is dead. See Kaufman (1990) for a slightly different formulation. 


Hivaro-Kawapdanan stock (Jivaro-Cahuapana, 19 centuries) 


Hivaro language area 
Hivaro em. lang. Jivaro/Achuar-Shiwiar 168a 5,000 Loreto, Peru; Oriente, Ec. 
Awaruna em. lang. Aguaruna/Awahun 168b 35,000 Amazonas, Peru 


Hivaro is one of the populous languages of lowland South America. The name Hivaro is the same as Shuara, and is a Hispanicization of an earlier indigenous pronunciation. 
Some linguists and anthropologists claim that all Hivaro is a single language, while acknowledging that Awaruna is the most divergent dialect. The regional folk classification 
also distinguishes Awaruna from Hivaro. A defunct language and culture, Palta, is thought by Lokoutka to have belonged within Hivaro, but it is poorly documented, and 
shows little resemblance. 


Kawapénan family 
Chayawita Chayahuita 169 10,000 Loreto, Peru 
Hevero Jebero 170 2,300 Jeberos, Peru 
Urarina /Shimaku 171 2,000 Urarinas, Peru 
Puelche 172 extinct Patagonia, Argentina 


Sdaparo-Yawan stock (60 centuries) 


Sdparo family: Sdparo group 


SAparo-Konambo Zaparo 173 5 Pastaza, Ec.; extinct in Peru 
Arabela-Andoa 174 extinct Pastaza, Ec./Peru border 
Ikito-Kawarano Iquito-Cahuarano 175 150 Loreto, Peru 

(Kawarano 


dialect extinct) 


Yawan family 
Yawa Yagua 176 5,000 Loreto, Peru 
Peva 177 extinct Loreto, Peru 
Yaméo 178 extinct Loreto, Peru 
Taushiro /Pinchi 179 7 Tigre River, Peru 
Omurano 180 extinct Loreto, Peru 


On the basis of a certain amount of lexical evidence, it seems that Taushiro might be related to both Omurano and Kandoshi, and more closely to the former. 
Sabela Waorani 181 800 E Ecuador 


Bora-Witotoan stock (Bora-Huitoto, 54 centuries) 


Boran family 
Bora-Miranya 182 2,500 Putumayo, Peru.; Amazonas, Br./Col. 
Bora-Muinane 183 150 Amazonas, Colombia 

Witotoan family 
Andokero Andoquero 184 extinct Amazonas, Colombia 
Koeruna Coeruna 185 extinct Amazonas, Brazil 
Okaina Ocaina 186 165 Loreto, Peru; Amazonas, Col. 
Nonuya 187 extinct? Loreto, Peru 
Murui(-Witoto) Huitoto 188 2,900 Putumayo, P.; Amazonas, Br./C. 
Koihoma Orejon/Koto 189 190 Loreto, Peru 
Meneka(-Witoto) Minica 190 1,700 Amazonas, Col; Peru; Amazonas, Br. 
Andoke Andoque 191 518 Caqueta, Amazonas, Colombia 


(extinct in Peru) 


3.4.4 Andes region 


Language Alternative name/autonym Number Population Location 

Chimuan family 
Mochika 192 extinct Lambayeque, Libertad, Peru 
Kanyari 193 extinct Cafiar, Azuay, Ecuador 
Puruwa 194 extinct Chimborazo, Bolivar, Ecuador 


Mochika was one of the major languages of pre-Columbian South America and was fairly well documented by Carrera (seventeenth century) and Middendorff (nineteenth 
century). For Kanyari and Puruwé, only a few words each are known. 


Macro-Kulyi-Cholonan cluster 
Both Swadesh (‘macro-Padez’) and Greenberg (Northern Andean) agree that these languages belong to a fairly low-level higher genetic grouping. The hypothesis has not 
been systematically tested, and all the constituent languages are dead and poorly documented. 


Cholonan family 
Cholén 195 extinct Huallaga valley, Peru 
Hibito Jibito 196 extinct Huallaga valley, Peru 
Kulyi 197 extinct Huaylas, Peru 


Macro-Lekoan cluster 
Swadesh groups Sechura and Leko in ‘macro-Leco’; he does not discuss Katakaoan. Greenberg groups all three together as part of his Northern Andean. The hypothesis has 
not been systematically tested, and all the constituent languages are dead and poorly documented. 


Sechura-Katakdoan stock 
Sechura 198 extinct Piura, Peru 
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Language Alternative name/autonym Number Population Location 
Katakdoan family 

Katakao Catacao 199 extinct Piura, Peru 

Koladn Colan 200 extinct Piura, Peru 

Leko Leco 201 few? La Paz, Bolivia 


Kechumaran (Quechumara) stock 
Kechua (Quechua) language complex 
Central Kechua branch 

North Central Kechua language area 


Ancash Kechua em. lang. 202a 333,000 Ancash, Peru 
Hudnuco Kechua em. lang. 202b 700,000 Hudnuco, Peru 
Pasco Kechua em. lang. 202c 30,500 Pasco, Peru 
Junin-Lima Kechua em. lang. 202d 76,525 Junin, Lima, Peru 

Wanka (Huanca) language area 
Yaqa em. lang. Huaylla 203a 300,000 Junin, Huancayo, Concepcion, P. 
Nyuqa Jauja, Shausha 203b 14,500 Junin (Jauja), Peru 

Peripheral Kechua 

Yunqay language area 
Cajamarca Kechua em. lang. 204a 35,000 Cajamarca, Chetilla, Los Bafios, P. 
Lambayeque Kechua em. lang. 204b 20,000 Lambayeque, Peru 

Kichua (Quichua) language area 
Lamista Kichua em. lang. 205a 40,000 San Martin, Sisa, Lamas, Peru 
Chachapoyas Kichua em. lang. 205b 3,000 Chachapoyas, Luya, Peru 
Lowland Kichua em. lang. 205c 6,000 Pastaza, Ec./Peru border 
Highland Kichua em. lang. 205d 1,417,000 Cafiar, Chimborazo, Imbabura, Loja, 


Tungurahua, Ecuador 


Southern Kechua language area 


Ayacucho Kechua em. lang. 206a 1,000,000 Ayacucho, Peru 

Imperial Kechua em. lang. 206b 3,500,000 Cuzco, Arequipa, P.; E Bol., Salta, 
Arg. 

Santiago del Estero Kechua em. lang. 206c 60,000 Santiago del Estero, Arg. 

Chilean Kechua em. lang. 206d 2 Second Region, Chile 

Yauyos Kechua 207 18,950 Lima, Yauyos, Peru 


Kechua is a language complex like Romance, Misteko or Sapoteko. There is a great deal of internal diversification. 


Haki (Jaqui) language complex (13 centuries) 


Hakaru-Kauki Jaqaru 208 2,000 Lima, Yauyos, Peru 

Aimara Aymara 209 2,200,000 Altiplano, Bol., Peru, Chile, Arg. 
Uru-Chipaya language area 

Uru 210a extinct? Oruro, Atahuallpa, Bolivia 

Chipaya 210b 2,000 Oruro, Atahuallpa, Bolivia 

Pukina 211 extinct E Altiplano, Bolivia 


The Qalyawaya jargon (Callahuaya) of Bolivia is used by about ten Kechua speakers who (apparently) used to speak Pukina, or in any event a language related to Pukina. 
Qalyawaya is also called Pohena. It is not known where all its components come from. 


3.4.5 Southern foothills region 


Language Alternative name/autonym Number Population Location 


Yurakare [isolate] Yuracare 212 500 Beni, Cochabamba, Bolivia 


Macro-Panoan cluster 
Macro-Panoan is the intersection between Greenberg’s more inclusive ‘macro-Panoan’ and Swadesh’s more inclusive ‘Quechuach6én’ and Suarez’s more inclusive ‘Macro- 
Panoan’, or agreement between any two. The hypothesis seems promising. Greenberg does not link Chon to Mosetén, but both Swadesh and Suarez do. 


Pano-Takanan stock (47 centuries) 


Panoan family: outliers 
Kasharari Kaxarari 213 220 Rondonia, Brazil 
Kulino 214 extinct Amazonas, Brazil 


Panoan family: main branch 


Kashibo group 
Nokaman Nocaman 215 extinct Loreto, Peru 
Kashibo Cashibo-Cacataibo 216 1,500 Loreto, Peru 
Pano language area 
Panobo em. lang. 217a extinct Loreto, Peru 
Wariapano em. lang. 217b extinct Peru (reportedly) 
Shipibo group 
Shipibo Shipibo-Conibo 218 30,000 Ucayali River, Peru 
Kapanawa Capanahua 219 350 Loreto, Peru 
Marubo 220 594 Amazonas, Brazil 
Wanninawa Katukina 221 196 Acre, Brazil 


(distinct from the non-Panoan Katukina language in the same province) 
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Language Alternative name/autonym Number Population Location 
Remo 222 extinct Amazonas, Peru 
Tushinawa Tuxinaua 223 extinct Acre, Brazil 
Tri-State group 
Amawaka language area 
Amawaka em. lang. Amahuaca 224a 520 Loreto, Peru; Acre, Brazil 
Iskonawa em. lang. Isconahua 224b 28 Callaria River, Peru 
Kashinawa Cashinahua/Kaxinaua 225 1,600 Amazonas, Br.; Curanja and Purus 
rivers, Peru 
Sharanawa Sharanahua 226 850 Acre, Brazil; Loreto, Peru 
Yaminawa Yaminahua 227 1,700 Acre, Brazil; Huacapishtea, Mapuya, 
Peru; Pando, Bolivia 
Atsawaka-Yamiaka Atsahuaca-Yamiaca 228 extinct Madre de Dios, Peru 
Parannawa 229 extinct Acre, Brazil 
Poyanawa Puinahua 230 310 Acre, Brazil 
Shipinawa Xipinawa 231 extinct Amazonas, Acre, Brazil 
Bolivian branch 
Karipuna Caripuna 232 12 Rondonia, Brazil 
Pakawara Pacahuara 233 one family Beni, Bolivia 
Chakobo Chacobo 234 520 Beni, Bolivia 
Shaninawa Xaninaua 235 extinct Acre, Brazil 
Sensi 236 extinct Loreto, Peru 
Majoruna-Matsés Mayoruna 237 2,000 Amazonas, Brazil, Peru 


This language may be a mixture of various refugee Panoan languages. 
Comparative work has been done on Panoan: Shell (1975) reconstructs proto-Panoan phonology using data from seven languages — Shipibo, Kapanawa, Kashibo, 
Kashinawa, Amawaka, Marinawa and Chakobo — and supplementary data from several others. There are 512 cognate sets with reconstructions. 


Takanan family 


Takana group 
Takana 
Reyesano 
Araona 
Kavinenya 


Chama group 
Ese'eha 
Toromona 


Tacana 


Cavinefia 


Ese Ejja, Ese Exa 


238 
239 
240 
241 


242 
243 


3,500 

few? extinct? 
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2,000 


850 
extinct 


Beni, Bolivia 
Beni, Bolivia 
Colonia, Bolivia 
Beni River, Bolivia 


Beni River, Bol.; Maldonado, P. 
NW Bolivia 


The names Ese'eha and Chama are not very satisfactory. Chama is a regional pejorative name for these indigenes. Comparative work has been done on Takanan: Girard 
(1971a) reconstructs proto-Takanan phonology using data from three languages: Takana, Kavinenyo and Ese'eha - and supplementary data from two others. There are 504 


cognate sets with reconstructions. 


Mosetén-Chon stock (51 centuries) 


Mosetén language area 


Mosetén em. lang. 244a 500 Beni, Bolivia 
Chimane em. lang. Tsimané 244b 5,000 Beni, Bolivia 
Chon family 
Tewelche Tehuelche 245 30 Patagonia, Argentina 
Island Chon language area 
Ona em. lang. 246a extinct Tierra del Fuego, Ch./Arg. 
Haush em. lang. 246b extinct Tierra del Fuego, 
Ch./Arg. 
3.4.6 The Cone 
Language Alternative name/autonym Number Population Location 
Yamana 247 3 Patagonia, Chile 
Kaweskar language area (Qawaskar) 
Aksana em. lang. 248a extinct S Chile 
Hekaine em. lang. 248b 20 Magallanes, Chile 
Mapudungu 249 440,000 central Chile, Argentina 


Macro-Warpean cluster 


Swadesh and Greenberg agree that Warpe and Mura-Matanawian belong in the same higher grouping. No systematic study of this specific connection has so far been made. 


Warpe language area 
Alyentiyak em. lang. 
Milykayak em. lang. 


Mura-Matanawian stock/family 


Allentiac 
Millcayac 


Except for Lokoutka, there is general agreement that these languages all form a family. 


Muran family 
Mura 
Pirahén 
Bohura 
Yahahi 


Piraha 


250a 
250b 


251 
252 
253 
254 


This classification is based on information from Ethnologue, Aryon Rodrigues and Dan Everett. 


Matanawi 


255 


extinct 
extinct 


few? 

150 
extinct 
extinct 


extinct 


Mendoza, Argentina 
San Luis, Argentina 


Amazonas, Brazil 
Amazonas, Brazil 
Amazonas, Brazil 
Amazonas, Brazil 


Amazonas, Brazil 
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3.4.7 Chaco region 


Language 


Alternative name/autonym 


Number 


Population 


Location 


Macro-Waikuruan cluster 


Swadesh (1959) places all these entities within his SW group (Matakoan, Waikuruan, Charruan, Maskoian); Greenberg places them all within his VIB group, which he calls 
“‘macro-Panoan’, and which (according to Greenberg) includes other genetic entities as well. Suarez (1974) links Waikuruan and Charruan. This higher grouping deserves to 


be explored and tested. 


Matakoan family (17 centuries) 


Matako Mataco, Wichi 256 25,000 Salta, Arg.; Tarija, Bol. 
Chorote 257 2,800 Salta, Arg.; Pilcomayo, Par. 
Chulupi Churupi, Nivaclé 258 18,200 Salta, Arg.; Chaco, Presidente Hayes, 
Boqueron, Paraguay 
Maka Maca 259 1,000 Chaco, Presidente Hayes, Par. 
Waikuruan family (35 centuries) 
Waikurt branch 
Kadiwéu Caduveo/Mbaya 260 1,200 Mato Grosso, Br.; Gran Chaco, Par. 
Southern branch 
Pilaga 261 2,000 Formosa, Chaco, Salta, Arg. 
Toba 262 15,500 Chaco, Arg., Par.; Gran Chaco, Bol. 
Mokovi Mocovi 263 3,000 Chaco, Argentina 
Abipén 264 extinct Chaco, Argentina/Paraguay 
Eastern branch 
Wachi Guachi 265 extinct Mato Grosso, Brazil 
Payawa Payagua 266 extinct Colonia Capitan Bado, Par. 
Charruan family 
Charruia 267 extinct Uruguay; Entre Rios, Arg.; Rio 
Grande do Sul, Br. 
Chana 268 extinct Uruguay 
Maskoian family 
Kaskiha Guana 269 500 Chaco, Boqueron, Paraguay 
Sanapana 270 6,900 Chaco, Presidente Hayes, Boquer6én, 
Par. 
Lengua Enxet 271 10,000 Chaco, Presidente Hayes, Boqueroén, 
Par. 
Maskoi Mascoy/Emok 272 extinct Chaco, Paraguay 
Lule-Vilelan stock 
Lule 273 extinct Chaco, Argentina 
Vilela 274 5 families Chaco, Argentina 
Samukoan family (Zamuco) 
Ayoreo 275 4,000 Chaco, Bol.; Chaco, Alto Paraguay 
Chamakoko Chamacoco 276 1,800 Chaco, Alto Paraguay 
3.4.8 Eastern Brazil 
Language Alternative name/autonym Number Population Location 


Macro-Je cluster 


The macro-Je cluster is the best supported of all South American clusters. There is good reason for believing all the languages listed below to be related, and detailed 
comparative work should be carried out. The time depth may not be as great as Swadesh’s estimates might suggest. Davis and Rodrigues both point to connections with 
Tupian, while Swadesh includes Chikitano-Bororoan in his more inclusive ‘macro-Tupi’. Comparative work has been done on ‘macro-Je’; Davis (1968) assembles sixty- 
seven cognate sets between proto-Je, Mashakali and Karaja, and extracts sound correspondences from them, but does not propose a proto-‘macro-Je’ phonological system. 
Davis makes about ten comparisons between proto-Je and proto-Tupi. 


Chikitano-Bororoan stock 
Chikitano 


Bororoan family 
Bororo 
Umutina 
Otuké 


Aimoré language complex (16 centuries) 
Krenak virtual language 
Nakrehé virtual language 
Guéren virtual language 


Rikbaktsa 


Je stock (54 centuries) 


North(east)ern branch 
Timbira 
Ipewi 
Apinayé 
Kayap6 
Suya 


Chiquit(an)o 


Umotina 
Otuqué 


Krikati-Timbira 
Kreen-Akarore 


277 


278 
279 
280 


281 
282 
283 
284 


285 
286 
287 
288 
289 


42,000 


850 
extinct 
extinct 


80 
extinct 
extinct 

800 


420 
122 
800 
4,000 
378 


Chaco, Bolivia 


Mato Grosso, Brazil 
Mato Grosso, Brazil 
Chaco, Bolivia 


Minas Gerais, Brazil 
Minas Gerais, Brazil 
Bahia, Brazil 

Mato Grosso, Brazil 


Maranhao, Para, Goias, Brazil 
Xingu Park, Mato Grosso, Br. 
Tocantins, Goids, Brazil 

Xingu Park, Mato Grosso, Br. 
Xingu Park, Mato Grosso, Br. 
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Language Alternative name/autonym Number Population Location 
Central (Akwe) branch 
Shavante Xavante 290 8,000 Mato Grosso, Brazil 
Akroa Koroa 291 extinct Bahia, Brazil 
Sherente Xerente 292 1,200 Tocantins, Goids, Brazil 
Shakriaba Xakriaba 293 extinct Minas Gerais, Goias, Brazil 
Southern branch 
Kaingang 294 18,000 Parana, Rio Grande do Sul, Sao 
Paulo, Brazil 
Shokleng Xokleng/Aweikoma 295 250 Santa Catarina, Brazil 
Wayana Guayana 296 extinct Rio Grande do Sul, Brazil 


Comparative work has been done on Je; Davis (1966) offers a reconstruction of proto-Je phonology based on data from Apinayé, the Kanela dialect of Timbira, Suy4, 


Shavante and Kaingang. There are 112 cognate sets with reconstructions. 


Jeik6é 297 extinct Piaui, Brazil 
Kamakanan family (32 centuries) 
Kamakén language area/complex 
Kamakan em. lang. 298a extinct Bahia, Brazil 
Mangalo em. lang. 298b extinct Bahia, Minas Gerais, Brazil 
Kutash6 em. lang. Kutaxo 298c extinct Bahia, Minas Gerais, Brazil 
Menyén 299 extinct Bahia, Minas Gerais, Brazil 
Masakara 300 extinct Bahia, Brazil 
Mashakalian family (38 centuries) 
Malali 301 extinct Minas Gerais, Brazil 
Patasho-Hanhanhain Pataxo-Hahahae 302 extinct Minas Gerais, Espirito Santo, Br. 
Mashakali Maxakali 303 728 Minas Gerais, Brazil 
Purian family (32 centuries) 
Korop6 304 extinct Minas Gerais, Rio de Janeiro, Br. 
Puri 305 extinct Espirito Santo, Brazil 
Fulnio language 
Fulnio 306 2,788 Pernambuco 
Karaja language area 
Karaja-Shambioa em. lang. Xambioa 307a 1,300 Goids, Para, Mato Grosso, Br. 
Javaé em. lang. 307b 383 Goias, Brazil 
Ofayé language 
Ofayé Opayé 308 extinct Mato Grosso, Brazil 
Guat language 
Guat6é 309 40 Brazil/Bolivia border 
Isolates, unclassified, possibly macro-Je 
Oti 310 extinct Sao Paulo, Brazil 
Baenan Baena 311 extinct Bahia, Brazil 
Kukura 312 extinct Mato Grosso, Brazil 
3.4.9 North-eastern Brazil 
Language Alternative name/autonym Number Population Location 


Macro-Katembri-Taruma cluster 


The Macro-Katembri-Taruma cluster is set up on the basis of the relatively close relationship recognized by both Swadesh and Greenberg. The hypothesis is untested by 
systematic comparison, and both languages are dead and poorly documented. 


Katembri 313 extinct Bahia, Brazil 
Taruma 314 extinct Amazonas, Brazil; Guyana 
Unclassified 
Kariri 315 extinct Paraiba, Pernambuco, Br. 
Tusha Tuxa 316 extinct Pernambuco, Brazil 
Pankararti 317 extinct Pernambuco, Brazil 
Natt. 318 extinct Pernambuco, Alagoas, Brazil 
Shukurti Xukurti 319 extinct Pernambuco, Paraiba, Brazil 
Gamela 320 extinct Maranhao, Brazil 
Wamoé Uamué 321 extinct Pernambuco, Brazil 
Tarairia 322 extinct Rio Grande do Norte, Brazil 
Shok6é Xok6é 323 extinct Pernambuco, Brazil 
3.4.10 Central Amazonia 
Language Alternative name/autonym Number Population Location 
Jabutian family 
Jabuti 324 5 Rondonia, Brazil 
Arikapa 325 6 Rondonia, Brazil 
Mashubi 326 extinct Rondonia, Brazil 
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Language Alternative name/autonym Number Population Location 
Unclassified 

Koaya 327 few? Rondonia, Brazil 

Aikana 328 few? Rondonia, Brazil 
Nambikuaran family 

Northern Nambikuara 329 126 Mato Grosso, Brazil 
Southern Nambikuara language area 

Nambikuara em. lang. 330a 900 Mato Grosso, Brazil 

Sararé emergent language 330b 150 Mato Grosso, Brazil 

Kabishi 330c 100 Mato Grosso, Brazil 

Sabané 331 60 Mato Grosso, Brazil 
Unclassified 

Iranshe Iranxe 332 191 Mato Grosso, Brazil 

Movima 333 few? Beni, Bolivia 

Kayuvava Cayubaba 334 extinct Beni, Bolivia 


Macro-Tupi-Karibe cluster 


The Macro-Tupi-Karibe cluster is set up on the basis of evidence collected by Aryon Rodrigues, with additional evidence found by Doris Payne and Spike Gildea. 


Tupian stock (55 centuries) 
Tupi-Guarani family 
Guarani group 
Guarani language area 
Kaingwa dialect 
Bolivian Guarani dialect 
Paraguayan Guarani dialect (Avafie’e) 
Chiripa4-Nyandeva dialect 
Chiriguano em. lang. 
Mbya Guarani dialect 
Sheta Xeta 
Guayaki Guaiaqui/Aché 
Guarayu group 
Guarayu 
Pauserna 
Siriond 
Tupi group 
Tupi language area 
Tupinamba dialect 
Southern Tupi dialect 
Jeral dialect Nhengatu 


Potiguara 


Kokama subgroup 
Kokama-Kokamilya Cocama-Cocamilla 


Omawa-Kampeva Omagua-Campeva 


Araweté 
Araweté 


Tenetehara group 
Tapirapé 
Akuawa Parakana-Asurini 

Ava Canoeiro 


Tenetehara Guajajara, Tembé 
Wayampi group 
Amanayé language area 
Anambé 
Guaja 
Urubu 
Wayampi language area 
Amapari Wayampi 
Oiampi 
Emerillon 
Takunyapé 
Kayabi group 
Kayabi Parua 
Asurini (do Xingtt) 
(distinct from Asurini do Tocantins, another Tupi language) 
Kawahib group 
Parintintin 
Uruewauwau 
Makiri 
Apiaka Apiaca 
Unclassified 
Kamayura 
Jo'é 
Aweti 
Mawé Sateré-Mawé 


Tenharim 
Uru-eu-uau-uau 


335a 
335b 
335c¢ 
335d 
335e 
335f 
336 

337 


338 
339 
340 


341la 
341b 
341c 


341d 


342 


343 


344 


345 
346 
347 
348 


349a 
349b 
349c 


350a 
350b 
350c 
351 


352 
353 


354 
355 
356 
357 


358a 
358b 
359 
360 


5,000? 
5,000 
5,000,000 
11,900 
32,000 
5,000 


extinct 
extinct 
8,000 


extinct 


200 


102 


184 


208 
642 
56 
10,100 


200 
extinct 


800 
63 


345 
100 
extinct 
40 


279 
200? 

36 

9,000 


Mato Grosso do Sul, Brazil 
Tarija, Chiquisaca, Bolivia 
Paraguay 

Paraguay; Brazil; Argentina 
Paraguay, Bolivia, Argentina 
Brazil; Paraguay; Argentina 
Parana, Brazil 

Alto Parana, Paraguay 


Santa Cruz, Bol.; W Par. 
Beni, Bolivia 
Beni, Santa Cruz, Bolivia 


N coast of Brazil 

Brazil 

Alto Maranhao, Br.; Vaupés, Col.; 
Amazonas, Ven. 

Paraiba, Brazil 


Marafion-Huallaga, Peru; Putumayo, 
Col. 
Omaguyas, Peru 


Amazonas, Brazil 


Mato Grosso, Brazil 
Xingu Park, Parana, Br. 
Goias, Brazil 
Maranhao, Brazil 


Para, Brazil 
Maranhao, Brazil 
Maranhao, Brazil 


Amapa, Brazil 
Para, Brazil 

S French Guiana 
Para, Brazil 


Mato Grosso, Brazil 
Para, Brazil 


Amazonas, Brazil 
Rondonia, Brazil 
Mato Grosso, Brazil 
Mato Grosso, Brazil 


Xingtt Pk., Mato Grosso, Br. 
Para, Brazil 
Xingtt Pk, Mato Grosso, Br. 
Para, Brazil 
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Languages Alternative name/autonym Number Population Location 
Munduruku group 
Munduruku 361 2,000 Para, Amazonas, Brazil 
Kuruaya Curuaia 362 52 Para, Brazil 
Juruna group 
Juruna 363 181 Xingu Pk., Mato Grosso, Br. 
Shipaya Xipaia 364 extinct Xingt River, Brazil 
Manitsawa Maritsaua 365 extinct Xingu Pk., Mato Grosso, Br. 
Karitiana group 
Arikem Ariquemes 366 extinct Mato Grosso, Brazil 
Karitiana 367 150 Rondonia, Brazil 
Kabishiana 368 ig Rondonia, Brazil 
[Note: The existence of Kabishiana as a language is dubious] 
Tupari group 
Tupari 369 56 Rondonia, Brazil 
Amniapé Kanoé, Canoé 370 extinct Rondonia, Brazil 
Wayord 371 40? Rondonia, Brazil 
Makurap 372 114 Rondonia, Brazil 
Kepkiriwat 373 extinct 2, Brazil 
Ramarama group 
Ramarama-Urumi 374 extinct Mato Grosso, Brazil 
Arara (do Jiparana) Uruki 375 92 Rondonia, Brazil 
(distinct from Carib language of the same name spoken in Para) 
Itogapuk 376 95 Rondonia, Brazil 
Monde group 
Mondé Sanamai 377 30 Rondonia, Brazil 
Surui (do Jiparana) 378 800 Rondonia/Mato Grosso, Br. 
(distinct from other Tupi language of same name spoken in Para) 
Arua 379 12 Rondonia, Brazil 
(but “Gaviao do Jiparara’ language may be identical) 
Unclassified 
Purubora 380 50 Rondonia, Brazil 


Comparative work on Tupian has been done by Hanke, Swadesh and Rodrigues. Rodrigues (1958) reconstructs proto-Tupian phonology, and provides sound correspond- 
ence charts and about seventy-five proto-Tupian reconstructions in annotation to a vocabulary of Amniapé. 

Lemle (1971) reconstructs proto-Tupi-Guarani phonology using data from seven languages — Asurini, Guajajara, Parintintin, Kamayura, Urubt, Guarani and Kokama — 
with supplementary data from three others. There are 221 cognate sets with reconstructions. 

Rodrigues (1966) classifies the language of the ‘Cinta Larga’ people, whose language is Arua, in the Mondé group (or family) of the Tupian stock, with reference to 
proto-Tupian reconstructions. Rodrigues (1980) classifies Munduruku within Tupian with reference to proto-Tupian and proto-Tupi-Guarani reconstructions. Rodrigues 
(1979) classifies Sheta (Xeta) within the Tupi-Guarani family with reference to proto-Tupi-Guarani reconstructions. Rodrigues (1985) classifies the languages of the Tupi- 
Guarani family according to common phonological innovations from proto-Tupi-Guarani, using data from all languages in the family. 

Jensen (1984) traces the historical development of Wayampi from proto-Tupi-Guarani, and deals with both phonology and grammar. This is the major work in 
comparative Tupi-Guarani studies. Jensen (1987) surveys the evidence for pronominal agreement prefixes on verbs in Tupi-Guarani languages, and provides a historical 
account. Jensen (1990) surveys the pronomical agreement markers on verbs and predicate nouns and adjectives, and the interaction of these markers with categories of 
aspect and subordination, and develops a hypothesis about the structure of proto-Tupi-Guarani, and its development into something rather different in some of the 
daughter languages. 

Dietrich (1987) classifies the Tupi-Guarani languages on the basis of both phonological and grammatical developments. It seems to equate Tupi-Guarani with 
seventeenth-century Tupinamba. 


3.4.11 Northern Amazonia 


Languages Alternative name/autonym Number Population Location 


Kariban family (37 centuries) 


Opon-Karare 381 extinct Santander, Colombia 
Yukpa group 
Yukpa-Japreria language area 
Yukpa 382a 3,000 Col./Ven. border 
Japreria 382b 80 Zulia, Venezuela 
Koyaima Coyaima 383 extinct Magdalena, Colombia 


Kari'nya group 
Kari'nya Carib, Galibi 384 10,000 Monagas, Anzoategui, Bolivar, Ven.; 
Suriname coast; Fr. Guiana coast; 
NW Guyana; Amapa, Brazil 


Tiriyo group: Tiriyé subgroup 


Akuriyo Tiriyometesem 385 44 SE Suriname 

Tiriy6 386 1,150 Suriname; Para, Brazil 
Tiriyé group: Karihona subgroup 

Hianakoto Jianacoto 387 2 Vaupés, Caqueta, Col. 

Karihona Carijona 388 1402 Vaupés, Caqueta, Col. 
Tiriyé group: Salumd subgroup 

Saluma 389 165 Para, Brazil 
Kashuyana group 

Kashuyana-Warikyana Kaxuiana 390 434 Para, Brazil 

Shikuyana Sikiana 391 33 Para, Br./Sur./Ven. Border 
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Languages Alternative name/autonym Number Population Location 
Waiwai group 
Waiwai Uaiuai 392 770 Amazonas, Para, Roraima, Br.; 
Guyana 
Hishkariana Hixkaryana 393 550 Amazonas, Brazil 
Yawaperi group 
Boanari 394 extinct Amazonas, Brazil 
Yawaperi Atruahi 395 350 Amazonas, Roraima, Alto Maranhao, 
Brazil 


Paravilyana group: Sapard subgroup 
Sapara 396 extinct Roraima, Brazil 
Paravilyana group: Paravilyana subgroup 
Pawishiana Pauixiana 397 extinct Roraima, Brazil 
Paravilyana Paravilhana 398 extinct Roraima, Brazil 


Pemong group: Pemong subgroup 


Makushi Macuxi, Teweya, Teueia 399 3,800 Roraima, Br.; Guyana; E Ven. 

Pemong 400 6,004 Bolivar, Ven.; Roraima, Br.; Guyana 
Kapong language area 

Patamona A0la 4,700 Guyana 

Akawaio 401b 500 Roraima, Rio Branco, Br. 


(The languages in the Pemong subgroup are almost mutually intelligible.) 


Pemong group: Purukot6 subgroup 
Purukot6 402 extinct Ven./Br. Border 


Central Cariban: Kumand group 
Kumana Chayma 403 few? Anzoategui, Sucre, Ven. 


Central Cariban: Yao group 


Tiverikoto Tivericoto 404 extinct Monagas, Ven. 
Yao 405 extinct Trinidad, Fr. Guiana 
Central Cariban: Wayana group 
Wayana 406 750 Suriname; Fr. Guiana; Para, Br. 
Arakaji 407 extinct Para, Brazil 


Central Cariban: Apalai group 
Apalai 408 450 Para, Brazil 


Central Cariban: Mapoyo-Yavarana group 
Mapoyo-Yavarana Wanai 409 22 Amazonas, Venezuela 


Central Cariban: Makiritare group 
Makiritare Maguiritari 410 270 Roraima, Br./Ven. border 
Wajumara 4ll extinct Roraima, Brazil 


South Amazonian Cariban: Bakairi group 
Bakairi Bacairi 412 570 Mato Grosso, Brazil 
Amonap Matipu, Kuikuro, Kalapalo 413 566 Mato Grosso, Brazil 


South Amazonian Cariban: Arara group 
Arara-Pariri Arara do Para 414 110 Para, Brazil 
(distinct from Arara do Jiparana, a Tupi language) 
Apiakda-Apingi 415 extinct Para, Brazil 
(distinct from Apiaka in Mato Grosso, a surviving Tupi language) 
Juma 416 extinct Rondonia, Brazil 


(distinct from Juma in Amazonas, a possibly surviving Tupi language) 


Yaruma 417 extinct Xingu Pk., Mato Grosso, Br. 

Chikaon Txikao 418 146 Xingutt Pk., Mato Grosso, Br. 
Ungrouped 

Palmela 419 extinct Rondonia, Brazil 

Pimenteira 420 extinct Piaui, Brazil 

Panare 421 1,200 Bolivar, Venezuela 


Except for Op6n, Yukpa, Pimenteira and Palmela (and possibly Panare), the Kariban languages are not very diverse phonologically and lexically (though more so than 
Romance, for example). The groups recognized here reflect either a consensus of the opinions of the various attempts at classification, or evidence I have found that 
supports one classifier against all the others. In each case the languages included in a group are probably quite closely related. Where one or another scholar has expressed a 
different opinion about the affiliation of a particular language or dialect, it is usually because of scanty, faulty, inauthentic or misnamed data. I have ordered the groups with 
respect to each other according to similarities in phonological patterning and development, and lexical sharing, but on the basis of very limited data, principally that found 
in Lokoutka (1968). 

Panare appears to be more like Hishkaryana than Yukpa is. Grammatically, Panare has similarities with Kapong-Pemong-Makushi. Villalon (1991) finds Panare to be 
lexicostatistically peripheral in comparison to a fairly tight group made up of Makiritare, Kari'nya, Wayana, Bakairi, Makushi, Pemong and Kumana, and also to form a 
separate branch by comparison with the other peripheral branches Yukpa, Hianakoto and Yavarana. These groupings are hard to reconcile with the groupings that seem 
likely on phonological grounds; clearly the diversification of this not very deep family has been very complicated, possibly with contact re-established between groups that 
had become effectively separate languages. 

Further comparative work has been done on Kariban: Shafer (1963) recognizes ten or eleven consonant correspondences and three vowel correspondences to data 
from languages of the whole family. Shafer provides only ten explicit reconstructed forms, but many cognate sets. 

Girard (1971) reconstructs proto-Karivan phonology using primary data from Karib, Hishkaryana, Yukpa, Tiriy6 and Waiwai, and supplementary data from most of 
the forty remaining languages. There are 153 cognate sets with reconstructions. 

Gildea (1992) is a study in comparative morphosyntax, wherein the development of ergativity from accusativity is postulated, along with the extension of nominalized 
verb forms from dependent to main clauses where they function like finite verb forms. Languages are divided into three batches from the point of view of their structural 
state, but not enough languages have been studied to be able to classify the whole family. 
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South America 


Language Alternative name/autonym Number Population Location 

Yanomaman family (21 centuries) 

Yanomaman language area 
Nanomam em. lang. 422a 1,500 Roraima, Amazonas, Br. 
Yanomam em. lang. 422b 12,000 Orinoco-Mavaca, Ven, 
Yanam-Ninam 423 566 Roraima, Br.; Bolivar, Ven. 
Sanema Sanuma 424 1,500 Ven.; Roraima, Br. 


Comparative work has been done on Yanomaman: Migliazza (1972) reconstructs proto-Yanomaman phonology using data from the four languages of the family. There are 
about 700 cognate sets with reconstructions. Migliazza (1987) compares proto-Yanomaman with proto-Panoan and proposes a genetic connection between these families, 


and, by implication, a connection with Takanan as well. 


Language Alternative name/autonym Number Population Location 
Sdlivan family 
Saliva Saliba 425 2,250 Meta, Casanare rivers, Col.; Cedofio, 
Ven. 
Piaroa-Mako 426 12,000 Meta, Casarane rivers, Col.; 
Amazonas, Ven. 
(Piaroa and Mako may be distinct languages with mutual bilingualism; investigation needed.) 
Unclassified language 
Hoti Yuwana, Waruwaru 427 300 Bolivar, Amazonas, Ven. 
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Australasia and the Pacific 


4.1 Introduction 


The area covered by the maps in this chapter (Maps 
24-44) is linguistically one of the most complex 
regions in the world. The greater part of the 
Australia—Pacific area also exhibits the highest 
linguistic density in the world, for the maps that 
illustrate this chapter contain approximately 2,000 
languages, with one single area, the great island of 
New Guinea, being home to well over 1,000 distinct 
languages. In addition, most of the languages covered 
by these maps have numerous dialects and sub- 
dialects, not included here for reasons of space and 
legibility. 


4.2 Genetic relationships of 
Asia-Pacific languages 

In the geographical area covered by this chapter, there 

are three major language families, Austronesian, 

Papuan and Australian, and two minor families, 


Austro-Asiatic and Thai-Kadai, both of which are 
found only on the map of the Malay peninsula. 


4.2.1 The Austronesian languages 


The Austronesian language family (see Figure 4.1), 
formerly known as Malayo-Polynesian, is perhaps the 
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largest language family in the world. Certainly it 
extends over the largest geographical area of any lan- 
guage family today other than Indo-European. There 
are between 1,000 and 1,200 distinct Austronesian 
languages, depending on the categorial criteria 
employed (Tryon, ed., 1995). 

Austronesian languages are spoken in most 
regions of Island South-East Asia and Oceania. They 
are spoken by the indigenous population of Taiwan 
(Formosa) and throughout the Philippines; they are 
also spoken across nearly all of Indonesia, where 
Papuan languages (see 4.2.2 below) are spoken in 
parts of Halmahera, Alor, Pantar and Papua (West 
Irian). Austronesian languages are also spoken in East 
Timor, Malaysia and Singapore, and in parts of 
southern Vietnam and Cambodia. The language or 
languages of Madagascar is/are also Austronesian. 
Austronesian languages are also spoken in the Mergui 
archipelago, off the coast of Myanmar (Burma), in 
the Andaman Islands and on Hainan Island in 
southern China. 

In Oceania, Austronesian languages are spoken 
in the coastal areas of greater New Guinea (though 
not generally in the interior, where Papuan languages 
are spoken almost exclusively), in New Britain and 
New Ireland and right down the Melanesian chain, 
through the Solomon Islands and Vanuatu, as far 
south as New Caledonia and Fiji. From there they 
extend eastwards to include all of the languages 
of Polynesia and northwards to take in all of the 
languages of Micronesia. 
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Figure 4.1 Austronesian and Papuan languages (after Tryon, ed., 1995) 
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The total number of Austronesian speakers today 
is approximately 280 million. In demographic terms, 
the distribution of the Austronesian-speaking pop- 
ulation is quite uneven, with all but 2-3 million 
speakers living west of a north-south line at about 
130° east longitude, extending from just west of the 
Caroline Islands and just east of the Bird’s Head 
on the island of New Guinea. On either side of 
this north-south line there are approximately 500 
languages spoken. 

The Austronesian language with the greatest 
number of speakers is Javanese, with 75,000,000 
speakers, followed by Malay/Indonesian with 
52,000,000 and Sundanese with 27,000,000. There are 
twenty-five Austronesian languages with over one 
million speakers. These languages account for roughly 
87 per cent of all Austronesian language speakers. On 
the other hand, the majority of the very numerous 
Austronesian languages are spoken by very small 
populations, often only a few hundred souls, espe- 
cially in the Oceania region. In Vanuatu, for example, 
the ‘largest’ language has no more than 10,000 
speakers, while there are twenty languages with fewer 
than 500 speakers. 

What of the genetic relationships of the Austro- 
nesian languages? It is now widely accepted that the 
Austronesian languages had their homeland in the 
south of mainland China and that their first landfall 
outside the mainland was on the island of Taiwan, 
some 5,000 years ago (Bellwood, 2002). From there 
they eventually dispersed southwards through the 
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Philippines to Indonesia, and onward to Madagascar 
and parts of the Asian mainland. Another easterly 
route out into Oceania emanated from the southern 
Philippines and eastern Indonesia, crossing the 
north coast of New Guinea and settling first in the 
New Ireland/New Britain area of eastern Papua New 
Guinea before moving rapidly south-eastwards to 
colonize the South Pacific, as far east as Easter 
Island. 

The most recent research on the genetic relation- 
ships of the Austronesian languages basically confirms 
what has been established for some time, namely that 
there exists a group of Austronesian languages known 
as Malayo-Polynesian, which embraces all of the 
Austronesian languages spoken outside Taiwan (Dahl 
1973, Blust 1977, 1978, 1982). The Malayo-Polynesian 
languages, according to the Blust model, divided into 
a Western and a Central-Eastern subgroup, which in 
turn split into a Central and an Eastern Austronesian 
subgroup. This Eastern subgroup ultimately divided 
into a South Halmahera/West New Guinea subgroup, 
and a huge Oceanic subgroup, covering nearly all of 
Oceania. The Austronesian languages of Taiwan 
would then constitute a co-ordinate higher-order 
subgroup of Austronesian, as represented in the 
family tree model developed by Blust (1978), as 
is shown in Figure 4.2. 


In Figure 4.3, the Austronesian languages fall into ten 
highest-order subgroups, nine of which consist of the 
indigenous language groups of Taiwan (Formosa), the 
tenth being the huge Malayo-Polynesian subgroup, 
which encompasses all the Austronesian languages 
spoken outside Taiwan. The present distribution of 
the Austronesian languages of Taiwan is to be found 
in Map 50 (Section 5). 

The Malayo-Polynesian subgroup consists of a 
Central-Eastern Malayo-Polynesian subgroup, whose 
period of separate development was apparently quite 
short, and a loose assemblage of twenty to twenty-five 
groups known collectively as Western Malayo- 
Polynesian. Western Malayo-Polynesian is not a sub- 
group definable by shared innovation or by any of the 
other criteria recognized by the comparative method. 

Lower down the tree, Central Malayo-Polynesian 
consists of a Central Malayo-Polynesian linkage 
(Blust 1993), defined as what remained of the 
Central-Eastern Malayo-Polynesian subgroup after 
the Eastern Malayo-Polynesian subgroup split off. 
Here again, the period of separate development 
of proto-Eastern Malayo-Polynesian was very short. 
Proto-Eastern Malayo-Polynesian was probably 
spoken somewhere around the Bird’s Head (Vogelkop) 
of New Guinea. The shared innovations upon which 
this subgroup is based are entirely lexical items. 


Austronesian 


Formosan 


Malayo-Polynesian 


Western 


Central-Eastern 


Central 


South Halmahera— 
West New Guinea 


Eastern 


Oceanic 


Figure 4.2 Austronesian languages (after Blust 1978) 


The most recent research (Dyen 1995, Wolff 1995, 
Blust 1999, Ross in preparation) has questioned the 
validity of the highest-order subgroups of the Austro- 
nesian family, although the integrity of the family 
itself has not been challenged. Dyen and Wolff, for 
example, have advanced a Formosan-Philippines 
hypothesis, claiming that the languages of Taiwan and 
the Philippines look so similar that they must form a 
discrete subgroup of Austronesian. However, such 
hypotheses are not convincing, as they provide no 
answer to the Malayo-Polynesian hypothesis, which is 
based on the merger of proto-Austronesian *C and *t 
as proto-Malayo-Polynesian *t. On the other hand, 
the Formosan-Philippines hypothesis appears to be 
based on similarities due to shared retentions rather 
than shared innovations. Shared retentions from 
the proto-language have no value as subgrouping 
evidence. 

The currently favoured Austronesian family-tree 
model, based on the work of Blust (1999) and Ross 
(in preparation) is a modification of the Blust (1978) 
family tree, as is shown in Figure 4.3. 


South Halmahera—West New Guinea is a clearly 
defined subgroup (Ross 1995a), although Blust 
(1993) prefers to consider that it is only a linkage. 
However, the status of Oceanic as a subgroup is 
indisputable (Dempwolff 1937), consisting of half 
of all Austronesian languages. Proto-Oceanic was 
spoken in the Bismarck Archipelago (the group 
of islands to the east of the Papua New Guinea 
mainland). 

The dispersal of the Austronesian languages from 
Taiwan, through the islands of South-East Asia and 
out into the Pacific, was remarkably rapid, taking as 
little as 5,000 years from beginning to end. Indeed 
Ross (in press) considers that the Central-Eastern 
and Eastern Malayo-Polynesian subgroups were but 
tiny blips in the rapid eastward movement of the 
Austronesians. 


4.2.2 The Papuan languages 


There are some 750 Papuan languages spoken in the 
Asia-Pacific region, nearly all of which are located in 
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Figure 4.3 Austronesian languages (after Blust 1999 and Ross) 


the Greater New Guinea area; that is, in Papua New 
Guinea and the province of Indonesia known today 
as Papua (formerly Irian Jaya, and earlier still Dutch 
New Guinea). Papuan languages are also spoken on 
the islands of Timor, Alor and Pantar, and in the 
northern half of Halmahera in Indonesia, and also in 
parts of the newly independent state of East Timor. 
To the east of the great island of New Guinea, they are 
also spoken in parts of New Ireland, New Britain 
and Bougainville (all in Papua New Guinea), and in 
pockets in the Solomon Islands, where the languages 
are predominantly Austronesian. 

The Papuan languages constitute a number of 
separate and unrelated language groups. Papuan is a 
cover term for some 750 languages of the New 
Guinea region that are not Austronesian or Aus- 
tralian. The label ‘Papuan’ says nothing about the 
genealogical unity of the languages, and no one 
has ever claimed that they constitute a single genea- 
logical unit, although it is not impossible that in the 
future linguistic research may demonstrate such a 
unity. 

McElhanon and Voorhoeve (1970) found a set 
of 91 lexical resemblances indicating a distant 
genealogical relationship between the geographically 
separated Central and South New Guinea and 
Finisterre-Huon groups. They named the combined 
group the Trans-New Guinea phylum. They did make 
some observations about sound correspondences 
between the two groups, but not systematically. The 
true application of the comparative method, required 
to establish reliable subgrouping, did not take place 
until recently (Pawley 1995, 1998, 2001). 

Wurm (ed. 1975, 1982) contains the first over- 
view of Papuan languages and an extensive set of 
hypotheses about possible families and subgroups. In 
his 1975 volume, Wurm proposes an expanded Trans- 
New Guinea phylum, containing nearly two-thirds of 
all Papuan languages, spread right along the central 
cordillera of New Guinea and extending out to the 
easternmost islands of Indonesia and to the islands 
to the east of today’s Papua New Guinea, referred 
to above. The major exclusions were groups of 
languages in many of the valleys to the north of the 
central divide and much of the north coast. These 
languages were considered as separate phylic groups. 
Wurm recognized ten groups, including the Trans- 
New Guinea phylum. 

Foley (1986) remained unconvinced by the 
Wurm expanded Trans-New Guinea hypothesis. He 
stated that it lacked the proof that only the com- 
parative method could provide. Foley (1986: 13-14) 
accepts only the low-level family groupings in Wurm’s 
classification, recognizing up to 60 distinct low-level 
groups. 

Wurm’s classification was mainly lexicostatisti- 
cally based, supplemented by typological similarities. 
However, such similarities are commonly contact- 
induced and so are of no diagnostic value for sub- 
grouping purposes. He also invoked substratum 
influence, the outcome of imperfect learning of a lan- 
guage. However, Ross considers that a more common 
contact-induced change is metatypy, or a change in 
grammatical type (in the original language). 

Ross (1995b) carried out a survey of pronouns 
and says that ‘it was easy to recognize forms from 
across a large area which reflected a single set of Proto 
TNG forms. This area was basically the main cordil- 
lera of New Guinea, much of the territory to its south, 
and an area in the north-east that included the Huon 
Peninsula and Umboi Island. It was smaller than 
Wurm’s Trans-New Guinea phylum, but nonetheless 
occupied a large portion of New Guinea.’ 

Ross performed a bottom-up reconstruction, 
beginning with microgroups, tabulating both free and 
bound pronominal forms. The bound forms are pos- 
sessor or object forms. These bound forms are the 
outcome of the cliticization and affixation of the free 
forms. 

In the course of his survey, Ross examined data 
from 605 Papuan languages, and provisionally 
assigned 359 of them to the Trans-New Guinea 
family, the major Papuan language grouping by far. 
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Figure 4.4 Trans-New Guinea family (after Ross, in press) 


Ross (in press) distinguishes twenty-three fam- 
ilies of Papuan languages. He defines a family as a set 
of languages that appear to be genealogically related 
to one another; that is, they are descended from a 
common ancestor, a protolanguage, but are unrelated 
on current evidence to the languages of any other 
family. A subgroup is a genealogically related set 
of languages, but one that forms part of a family. 
Its members are considered on present evidence to 
be more closely related to each other than to 
other members of the family. That is, they share a 
common ancestor not shared by other members of 
the family. 

The twenty-three families of Papuan languages 
recognized by Ross (in press), based on the applica- 
tion of the comparative method to pronominal sys- 
tems, are shown in the following tentative revised list: 


1 ‘extended West Papuan’ (?) 
la West Papuan 
lb East Bird’s Head, Sentani, Burmeso, Tause 
lc Yava 
Mairasi 
East Cenderawasih Bay 
Lakes Plain 
Orya-Mawes-Tor-Kwerba 
Nimboran 
Skou 
Border 
Left May-Kwomtari 
9a Kwomtari 
9b Left May 
10 Senagi 
11 Torricelli (three separate areas) 
12 Sepik 
13 Ramu-Lower Sepik 
13a Lower Sepik 
13b Ramu 
14 Yuat 
15 Piawi 
16 South-Central Papuan 
16a Yelmek-Maklew 
16b Morehead-Upper Maro 
16c Pahoturi 
17 Eastern Trans-Fly 
18 Trans-New Guinea 
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19 Yele-West New Britain (?) (Yele, Aném, Ata) 

20 East New Britain (Baining, Taulil, Butam) 

21 North Bougainville (Konua, Rotokas) 

22 South Bougainville (Nagovisi, Nasioi, Motuna, 
Buin) 

23 Central Solomons (Bilua, Baniata, Lavukaleve, 
Savosavo) 


Isolates are identified in four areas: 


e In the north of West Papua: the isolates are Abi- 
nomn and Isirawa. Abinomn is a previously 
unrecorded isolate reported by Mark Donohue. 
Isirawa was classified by Voorhoeve (1975) as a 
family-level isolate within the Dani-Kwerba stock 
of the Trans-New Guinea phylum. 

e In the Sandaun Province of Papua New Guinea: 
the isolates Karkar, Busa and Yalé are also isolates 
in Wurm (1975). 

e Near the mouth of the Sepik: there is one isolate, 
Taiap (=Gapun), classified by Laycock and 
Zgraggen (1975) as a ‘sub-phylum-level isolate’ 
of the Sepik-Ramu phylum. 

e In North-West Island Melanesia: there are three 
isolates, Sulka and Kol (New Britain) and Kuot 
(New Ireland), classified by Wurm (1975) as 
members of his East Papuan phylum. 


Ross (in press) also proposes a revised classification of 
the very large Trans-New Guinea family, with groups, 
subgroups and isolates as follows (from west to east): 


West Trans-New Guinea linkage 
West Timor-Alor-Pantar 
East Timor 
West Bomberai 
Wissel Lakes 
Dani 


Tanah Merah, Mor, Dem, Uhunduni 
Mek 

Kaure (?) 

Pauwasi (?) 

Kayagar 

Kolopom 

Moraori 

Kiwai-Porome 

Marind 


Eleman 


Asmat 

Awyu-Dumut 

Mombum 

Ok 

Oksapmin 

Gogodala-Suki 

Tirio 

Eleman 

Inland Gulf 

Turama-Kikori 

Teberan (?) 

Pawaian 

Angan 
Angaataha 
Nuclear Angan 

West Kutubu 

East Kutubu 

Duna-Pogaia 

Awin-Pa 

East Strickland 

Bosavi 


Kamula 
Engan 
Wiru 
Chimbu-Wahgi 
Kainantu-Goroka 
Gorokan 
Kainantu 
Madang (=Madang-Adelbert Range) 
Southern Adelbert Range-Korak-Waskia 
Rai Coast 
Kalam-Kobon 
Croisilles 
Finisterre-Huon 
Finisterre 
Huon 
Central and Southeast Papuan 
Goilalan (?) 
Koiarian 
Kwalean 
Manubaran 
Yareban 
Mailuan 
Dagan 
Binanderean 
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4.2.2.1 Conclusions 


The linguistic geographies of the New Guinea main- 
land and of North-West Island Melanesia differ 
sharply. On the mainland (see Figure 4.1) the overall 
pattern is of a single large group, the Trans-New 
Guinea family, whose members are spoken along 
the length of the main cordillera and in much of 
the region to the south of it, as well as in parts of 
Timor, and on Alor and Pantar. Most of the region 
to the north of the cordillera, as well as northern 
Halmahera, is occupied by non-Trans-New Guinea 
families. Apparently non-Trans-New Guinea families 
also occupy a discontinuous region of the south coast 
of New Guinea. 

In North-West Island Melanesia, east of the New 
Guinea mainland, the situation is very different. In 
this area there is a thin scattering of individual 
Papuan languages surrounded by Austronesian 
speakers. Before the arrival of the Austronesians, the 
region as far east as the central Solomons — extending 
perhaps even as far as the Santa Cruz group in the far 
south-east of the Solomons archipelago — was previ- 
ously peopled by speakers of Papuan languages. Thus 
today’s east Papuan languages are linguistic survivals 
from this early period, in a region where Papuan lan- 
guages have generally been displaced by Austronesian 
languages. It should be noted, however, that the status 
of some of the languages of the Santa Cruz group 
as Papuan is not universally accepted (Wurm 1978, 
Lincoln 1978). 

No east Papuan language family has more than 
four members. Typological similarities suggest that 
this is a result of pre-Austronesian contact among the 
Papuan languages of the North-West Island Melanesia 
area. 

The New Guinea area has been settled for 
40,000-60,000 years, according to archaeological evi- 
dence (Spriggs 1997). This enormous time-depth 
accounts for the amazing number and diversity of 
New Guinea languages, especially the Papuan lan- 
guages, and is a serious obstacle to detecting language 
relationships. 

The confirmation of the existence of the 
Trans-New Guinea family (see Figure 4.4) and its 
hundreds of member languages through the rigorous 
application of the comparative method has been a 
major achievement in the past decade, even if the 
present classification is based only on the historical 
reconstruction of pronominal forms at present. 

So where did the Trans-New Guinea family 
originate and when did it begin its spread across the 
huge area that it occupies today? The current view is 
that the Trans-New Guinea family had its origins in 
the east-central highlands area, spreading along the 
cordillera both east and west, and down into the 
southern highlands, moving into areas inhabited by 
non-Trans-New Guinea family speakers. The dis- 
persal of the Trans-New Guinea family is associated 
with the domestication of the food-plant taro, which 
may have begun as early as 9000 Bp. For a number 
of reasons, however, it is considered more plausible 
that the Trans-New Guinea dispersal that began about 
6000 Bp. 

Much of the western part of what became New 
Guinea, south of the cordillera, was under water 
about that time, peaking about 6000 sp, but retreat- 
ing ever since. This would imply that groups such as 
the Marind, Awyu-Dumut and Asmat expanded into 
the area as the waters retreated. 


4.2.3 The Australian languages 


There are approximately 150 Australian Aboriginal 
languages extant today, a significant reduction from 
the 300 languages estimated at the time of first Euro- 
pean contact in 1770 (Dixon 1980, Wurm 1982, 
Dixon 2002). Many of these languages are on the 
point of extinction, spoken by only a handful of 
elderly people, with no possibility of transmission to 
younger generations. Almost inevitably, in the face of 
the advance of modern communication technology, 
half of these languages will have disappeared within a 


generation or two. The Aboriginal population of 
Australia does not exceed 300,000, with only half that 
number at best using their indigenous language on a 
daily basis. 

The indigenous languages of Australia are 
unrelated to any other neighbouring language group 
in South-East Asia or Oceania, including greater New 
Guinea. Indeed, it has not been demonstrated by the 
systematic application of the comparative method 
that all Australian languages descend from a common 
proto-Australian, although many leading Australian 
specialists assume the existence of such a language. 
The position of the indigenous languages of Tasmania 
is unclear, as the last speaker of a Tasmanian language 
died in 1877. The language data surviving suggest 
that the Tasmanian languages were probably related 
to the languages of the Australian mainland, though 
they are so fragmentary as to render a clear assign- 
ment unlikely. 

Australia has been settled for some 60,000 years, 
so it is not surprising that the task of reconstructing 
the higher-order language subgroups is one of con- 
siderable difficulty. Indeed Dixon (2002: xix) claims 
that ‘the question of whether all Australian languages 
go back to a single ancestor is not answerable, because 
of the great time-depth involved’. However, the 
younger generation of Australian specialists, led by 
Evans (2003), are making significant progress with a 
systematic bottom-up application of the comparative 
method to the quite diverse non-Pama-Nyungan 
languages of northern Australia. 

For the subgroups of the languages of Australia, 
if one assumes a common ancestor, have a remarkably 
uneven distribution. One single family (or major 
subgroup), called Pama-Nyungan, occupies roughly 
seven-eighths of the continent, while the remaining 
twenty-five families are crowded into the remaining 
one-eighth, the area extending from the north-west 
of Western Australia and across the top end of 
the Northern Territory as far east as the Gulf of 
Carpentaria (see Map 35). 

The languages of Australia, following the recent 
work of Evans ed. (2003) fall into the following 
families: 


Nyulnyulan family 
Worrorran family 
Bunaban family 
Jarrakan family 

Mindi family 
Southern Daly family 
Western Daly family 
Eastern Daly family 
Northern Daly family 
Anson Bay family 
Kungarakany family 
Larrakiyan family 
Tiwi family 

Limilngan family 
Umbugarla/Ngumbur family 
Gaagudju family 
Iwaidjan family 
Giimbiyu family 
Gunwinyguan family 
Maningrida family 
Maran family 
Anindilyakwa family 
Wardaman/Wagoman family 
Garrwan family 
Tangkic family 

26 Pama-Nyungan family 


OANHND UM SF WHY FE 
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At present there have been no serious attempts 
to group these families into higher-order sub- 
groups, although some suggestions are currently 
being followed up (Evans ed., op. cit.). 

The huge Pama-Nyungan family, which extends 
over seven-eighths of the Australian continent, is 
currently considered (Bowern and Koch eds., 2004; 
Koch personal communication) to consist of the 
following sub-groups: 

Yolngu 

Djinang 
Dhaangu 


Dhuwal 
Dhuwala 
Dhuwaya 
Djinba 
Ritharrngu 
Dhay'yi 
Marrngu 
Karajarri 
Mangarla 
Nyangumarta 
Ngumpin-Yapa (McConvell and Laughren 2004) 
Ngumpin 
Juwarliny 
Walmajarri 
Jaru 
Gurindji 
Mudburra 
Ngarinyman 
Yapa 
Warlmanpa 
Warlpiri 
Kartangaruru 
Ngarti 
Ngayarta (Dench 1998a, O’Grady 1966) 
Ngarla 
Nyamal 
Panyjima 
Jurruru 
Yinhawangka 
Ngarlawangka 
Kariyarra 
Ngarluma 
Martuthunira 
Nhuwala 
Yindjibarndi 
Kurrama 
Kanyara-Mantharta (Austin 1988, Austin 1981) 
Mantharta 
Warriyangka 
Thiin 
Jiwarli 
Tharrgari 
Kanyara 
Pinikura 
Thalanyji 
Payungu 
Burduna 
Kardu 
Yingkarta 
Maya 
Malgana 
Pulinya 
Wajarri (Marmion 1996) 
Wirtimay 
Patimaya (Dunn 1988) 
Nhanda (Blevins 2001) 
Wati/Western Desert (cf. Thieberger 1993, 
ch. 4.6) 
Gugadja 
Wangkajunga 
Pintupi 
(Yumu) 
Kukatja 
Pitjantjatjara 
Yankuntjatjara 
Yulparija 
Nyangatjarra 
Yiltawangka? 
Warnman 
Manjiljarra 
Kartutjarra 
Putitjarra 
Kiyatjarra 
Ngaatatjarra 
Nhanyatjarra 
Pitungu 
Ngaanyatjarra 
Tjalkatjarra 
Kuwarra 
Nyanganyatjarra 
Mantjintja 
Nakaku 
Wirrtjarakantja 


100 


Australasia and the Pacific 


Antakarrinya 
Kukata 
Ngaliya 
Pintini 
Murunitja 
Wangkayi 
Ngurlu 
Martu Wangka 
Nyungar (Douglas 1976) 
Nankarti 
Watjanmay 
Tjapanmay 
Karlamay 
Pipelman 
Kwetjman 
Mirnong 
Kaniyang 
Mirniny (Brandenstein 1980, Nash 2002) 
Kalaaku 
Ngadyumaya 
Wudjari 
Mirniny 
Thura-Yura (Simpson and Hercus 2004) 
Kuyani 
Barngarla 
Nawo 
Nukunu 
Kaurna 
Narangga 
Adnyamathanha 
Ngadjuri 
Wirangu 
Warumungu 
Arandic (Koch 2004) 
Kaytetye 
Aranda 
Anmatyerr 
Alyawerr 
Ayerrereng 
Antekerrepenh 
Western Arrernte 
Eastern Arrernte 
Lower Arrernte 
Warluwarric (Koch 1989, Brammall 1991, 
Carew 1993, Breen 2004) 
Wakaya 
Warluwarra-Bularnu 
Yanyuwa 
Yinjilanji 
?Kalkatungic 
?Wakapunga 
Kalkatungu 
Yalarnnga 
Galimbamu 
Paman 
Northern Paman (Black 2004) 
Yumukuntyi: dialect of Uradhi 
Yaraytyana: dialect of Uradhi 
Atampaya: dialect of Uradhi 
Angkamuthi: dialect of Uradhi 


Utyangikwathiya: dialect of Uradhi 


Mutyanthi: dialect of Uradhi 
Uradhi (Crowley 1983) 
Wudhadhi: dialect of Uradhi 
Mpakwithi (Crowley 1981) 
Tjungundji (Crowley 1981: 149) 
Mpalityanh 

Yupngayth (Crowley 1981: 149) 
Alngith 

Luthigh 

Linngithigh 

Ngkoth 

Yinwum 

Aritinngithigh 

Awngthim 

Ndra'ngith (Crowley 1981: 149) 
Mbiywom 


Middle Paman (Smith and Johnson 2000, 


Black 2004) 
Wik Ayangenycha 
Wik-Ep = Wik Me'anh 
Wik Me'anh 
Wik Ompon 


Wik Paach 


Wik Ngathan (patrilect of Mungkan) 
Wik Ngatharra (patrilect of Mungkan) 


Wik Mungkan 


Kugu Nganhcara (Smith and Johnson 


2000) 


Kugu Ugbanh (patrilect of 


Nganhcara) 


Kugu Muminh (patrilect of 


Nganhcara) 


Kugu Uwanh (patrilect of 


Nganhcara) 


Kugu Mu'inh (patrilect of 


Nganhcara) 


Kugu Mangk = Kugu Yi'anh 
(patrilect of Nganhcara) 


Wik Iyanh (patrilect of 
Nganhcara) 
Wik Keyangan 
Pakanh 
Ayapathu 
Kuuku Ya'u 
Kaantju 
Umpila 


Southwest Paman (Black 2004, Alpher 2004, 


Alpher 1972) 
Kuuk Yak 
Kuuk Thaayorre 
Yir-Thangedl 
Yir-Yoront (Alpher 1991) 
Koko-Bera 
Kwanthar 
Kok-Nar 
Uw Oykangand 
Olkola 
?Walangama 
Norman Paman (Black 1980) 
Kurrtjar 
Areba 
Kuthant 
?Kukatj (Breen 1976) 
Rarmul Paman 
Thaypan, + Rarmul dialect 
(Rigsby 1976) 
Aghu-Tharnggal 
?Lamalamic 
Umbindhamu 
Umbuygamu 
Mbariman-Gudhinma 
Lama-Lama 
Gugu Warra 
Southern Paman 
Agwamin 
Mbabaram (Dixon 1991) 
Ngaygungu: dialect of Warrungu? 
(Dixon 1991) 
Dagalag 
Wamin 
Mbara 
Flinders Island 
Barrow Point 
Guugu Yimidhirr (Haviland 1979) 
Yalanjic 
Kuku Yalanji (Patz 2002) 
Kuku Jangkun (dialect of Kuku 
Yalanji) 
Muluriji (dialect of Kuku Yalanji) 
Yidinyic 
Dyabugay (Patz 1991) 
Yidiny (Dixon 1977) 
Kala Lagaw Ya 
Dyirbal 
Bandyi 
Warrgamay 
Nyawaygi 
Wulgur 
Bindal 
Yuru 
Maric (Breen 1990: ch. 4, Terrill 1998) 
Gugu Badhun 
Warrungu 
Gudjal 
Yilba 


Biri 
Miyan 
Giya 
Yiningayi 
Yangga 
Wiri 
Yuwibara 
Baradha 
Yambina 
Yagalingu 
Wangan 
Barnya 
Gabalbara 
Gayiri 
Gangulu 
Wadjigu 
Wadjabangayi + Yandjibara 
(Breen 1990: 65) 
Garingbal 
Bidyara 
Wadjalang (Breen 1990: 68) 
Nguri 
Gungabula 
Yiman 
Margany 
Gunya 
Mandandanyi 
Guwamu 
Gunggari 
Guwa (Breen 1990: ch. 5) 
Yanda (Breen 1990: ch. 6) 
Yirandhali 
Kungkari (Breen 1990: ch. 3) 
Kulumali 
Kungatutji 
Pirriya/Birria (Breen 1990: ch. 2) 
Mayi (Breen 1981) 
Mayi-Kutuna 
Mayi-Kulan 
Mayi-Yapi 
Ngawun 
Mayi-Thakurti 
Wunumara 
Karnic (Bowern 2001, Bowern 1998) 
North-western 
Wangkamadla 
Rangwa 
Pitta-Pitta (Blake 1979) 
Wangka-Yutjurru 
Ringu-Ringu 
Rakaya 
Ngulupulu 
Karanya 
Yurlayurlanya 
Kunkalanya 
Lhanima 
Mayawarli 
South-western (Hercus 1994) 
Wangkangurru 
Arabana 
Central 
Mithaka 


?Karruwali (Breen 1990: vii) 
?Marrulha (Breen 1990: vii) 


Yarluyandi 
Ngamini 
Yawarrawarrka 
Yandruwandha 
Negurawarla 
Diyari 
Dhirrari 
?Pirlatapa 
Eastern 
?Karendala 
¢Yarumarra 
Wangkumara 
Punthamara 
?Tereila 
Garlali 
?Mambangura 
(Breen 1990: vii) 
Bitjara 
Mingabari 
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Yarli (Hercus 2004) 


Wadikali 
Malyangapa 
Yardliyawara 


Muruwari (Oates 1988) 
Central New South Wales 


Bayali 


Gamilaraay-Yuwaalaraay- 
Yuwaalayaay 
(Ash et al. 2003) 
Wangaybuwan-Ngiyambaa 
(Donaldson (1980) 
Wiradhuri 


Dharumbal (Terrill 2002) 


Koinjmal 


Waka-Kabic (Laffan 2003) 


Goreng 


Kabi 


Wakka 


Yagara 
?Durubulic 


Bandjalang 
Gumbaynggiric 


?Yuin-Kuri 
¢Kuri 


¢Iyora 


Yuin 


Goreng-Goreng 
(Daribalang > 
Goeng-Goeng) 


Kabi-Kabi, + Badjala 
(Fraser Island) 


Wuli-Wuli 

Wakka-Wakka, + 
Duungidjawu 
(southernmost dialect) 

Gayabara 

Barunggam 

Muringam 


Guwar 
Nunungal 
Djendewal 


Gumbaynggir 
Yaygir 


Yugambal 
Nganyaywana 
Ngagu 
Ngambaa 
Dyangadi 
Kathang 
Awabakal 


Darkinyung 
Dharuk 


Ngunawal 
Ngarigu 
Thawa 
Dyirringany 
Dhurga 
Dharawal 
Gundungura 


Paakantyi (Hercus 1982) 


Lower Murray 


Bandjigali 
Baarundji 

Gurnu 

Nabarlgu 
Wanyubarlgu 
Wilyagali 
Dhanggagarli 
Southern Paakantyi 
Barundji 

Marawara 


Ngarinyeri 
Peramangk 
Ngayawung 
Yuyu 
Keramin 
Yitha-Yitha 


Kulin (Blake and Reid 1998) 


Wemba Wemba 
Nari Nari 
Wathawurung 


Kolakngat 
Wuywurrung 
Bungandidjan 
Bungandidj (Blake 2003a) 
Kuurn Kopan Noot 
(Warrnambool language 
in Blake 2003b) 
Yorta-Yorta (Bowe and Morey 1999) 
Yorta-Yorta 
Yabula-Yabula 
Pallanganmiddang (Blake and Reid 1999) 
Dhudhuroa (Blake and Reid 2002) 
Ganay: Muk-Thang 


4.3 Languages and language 
roles in Australasia and 
the Pacific 


While Map 25 provides an overview of the pidgins, 
creoles and languages of wider communication in the 
region, the great majority of the languages shown 
on Maps 26-44 are local languages with no wider 
currency beyond their immediate areas. 

In a few areas, local languages have been used as 
regional lingua francas and trade languages. Malay, 
for instance, has been adopted as the national lan- 
guage of Malaysia. In Indonesia, Indonesian (see 
below) is the national language and paramount 
lingua franca. Similarly, ten of the Philippine lan- 
guages have been languages of wider currency, with 
one of them, Tagalog, constituting the basis of 
Pilipino (Filipino), the national language of the 
Philippines, which is taught in schools throughout 
the country. However, in most of Melanesia, including 
the New Guinea area, English-based pidgin languages 
are the paramount lingua francas, with one, Bislama, 
the national language of Vanuatu. In bygone days, a 
similar situation prevailed in Micronesia and parts of 
Polynesia. 


4.3.1 Micronesia and Polynesia 


Forms of Pidgin English used to be current in Micro- 
nesia and Polynesia, but have yielded to English as a 
lingua franca, except for some remnants in the Hawaii 
Islands, Sapuahfik (Caroline Islands) and Nauru. In 
French Polynesia, Tahitian is increasingly becoming 
the general lingua franca of the indigenous popula- 
tion, and has been gradually replacing the dialects 
of the Austral Islands, and Pa'umotu in the western 
islands of the Tuamotu Archipelago, and even 
threatens Mangarevan in the Gambier archipelago. 
It is now the native language of about 100,000 
and second language for about 15,000. A pidginized 
form of Tahitian, called Parau Tinito, is also used, 
mainly for intercommunication between Chinese and 
Tahitians. Its speakers may number around 3,000 
today. French is the normal language of education 
and broader social interaction. 


4.3.2 Vanuatu 


In Vanuatu, the former New Hebrides, the one 
hundred plus indigenous languages have only local 
significance. The only exception to this is the Mota 
language of an island in the Banks Islands, which was 
used by the Melanesian Mission as a church language 
from the late nineteenth century until 1931 in what is 
today Vanuatu, and in the south-eastern Solomons to 
the north. Many older islanders still have some know- 
ledge of it today in those areas. The lingua franca and 
national language of Vanuatu is Bislama, another 
elaborate English-based pidgin language sharing 
features with Solomon Pijin and also with Tok Pisin. 
It has close to 200,000 speakers. English and French 
are official languages and languages of education. 


4.3.3 New Caledonia 


In New Caledonia and the Loyalty Islands, most of 
the twenty-eight indigenous languages have only local 


significance. Only one indigenous language, Wailu 
(Houailou, Ajié) was adopted as a church lingua 
franca in the Protestant Church on the east coast of 
the mainland. It still has some currency beyond its 
boundaries, although the usual lingua franca and 
official language of New Caledonia is French. 


4.3.4 Solomon Islands 


In the Solomon Islands there are more than sixty 
local vernaculars. Of these a few missionary and 
church languages are used as regional lingua francas, 
such as Babatana on Choiseul, Roviana and Marovo 
on New Georgia, and Gari on Guadalcanal. The 
Kwara'ae language of Malaita has diffused into many 
parts of the Solomon Islands as a result of Malaitan 
expansion and immigration, with many Solomon 
Islanders having some knowledge of it. The only 
lingua franca of wider intercommunication is 
Solomon Pijin, another English-based pidgin lan- 
guage, and close cousin of Bislama (Vanuatu) and 
Tok Pisin (Papua New Guinea). A larger percentage of 
its vocabulary is derived from English than is the case 
with Tok Pisin. The language of education is English, 
which plays a more important role in the Solomons 
than in Papua New Guinea. 


4.3.5 Papua New Guinea 


In Papua New Guinea, most of the over 750 indigen- 
ous languages have only local significance, though 
active and passive bi- and multi-lingualism among 
speakers of neighbouring languages is widespread. 
Some of the few numerically comparatively large 
languages have limited broader functions in being 
used as local government council languages in local 
administration instead of Tok Pisin (see below), 
at least orally. Such languages are, for instance, the 
highlands languages Enga (240,000 speakers), Wahgi 
(60,000 speakers), Kuman (or Chimbu proper, 80,000 
speakers) and Hagen (100,000 speakers). A few of the 
originally numerically quite small languages, which 
had been adopted by missions as missionary and 
church languages, have become regional lingua 
francas for general use and are now used as local 
government council languages instead of Tok Pisin, 
for example Kate and Yabim in the Morobe Province, 
and Tolai in northern New Britain and adjacent 
areas. The same has been the case with some trade 
languages existing in pre-European times, such as 
Suau in the Milne Bay Province. There are also a few 
other native lingua francas of limited currency, such 
as Gogodala in the Western Province. Some of these 
are pidginized languages, such as Pidgin Dobu in the 
D’Entrecasteaux Islands in the Milne Bay Province. 
The large lingua francas of wide intercommunication 
in Papua New Guinea are Police Motu (or Hiri Motu) 
and Tok Pisin. The former, which has over 200,000 
speakers in much of southern Papua New Guinea 
(in what was formerly the Territory of Papua), is a 
pidginized form of the Austronesian Motu language 
of the Port Moresby area. It was first used by the 
indigenous police force and spread by them — hence 
its name. The most important lingua franca, Tok 
Pisin, which is often referred to as the unofficial 
national language of the country, has over 2 million 
speakers. It is an English-based pidgin which has 
developed and been used largely as a language of 
Papua-New Guinean inter-group communication, 
and is a highly developed, quite complex language, a 
so-called expanded pidgin. It has elevated functions 
such as being the primary language of debate in the 
Papua New Guinea Parliament, as well as a media 
language. English, which is the third official language 
of the country, has little more than 200,000 speakers 
and is more an élite language than a language of 
wider communication. 


4.3.6 Australia 


In Australia, the surviving languages of the Abori- 
ginals have mostly only local significance, except that 
there is widespread active and passive bi- and multi- 
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lingualism which allows for quite wide inter- 
communication between speakers of different lan- 
guages. A few Australian languages had been adopted 
by missions as missionary and church lingua francas, 
with some of these expanding into general regional 
lingua francas. The most important is the Western 
Desert language in north-western South Australia 
and southern Western Australia, with approximately 
5,000 speakers. Others are, for instance, Dhuwal- 
Dhuwala in the north-east corner of Arnhem Land, 
with 3,500 first- and second-language speakers, as 
well as Arrente (3,500) and Warlpiri (3,000) in central 
Australia. Some aboriginal lingua francas had not 
been missionary lingua francas before, for example 
Walmajarri in a northern portion of Western 
Australia (1,300 native speakers) and Kuku Yalanji in 
the south-eastern part of Cape York Peninsula (300 
native speakers). Of course the major languages of 
wider communication are the two English-based 
pidgins, Torres Strait Creole, with 20,000 speakers in 
the Torres Straits islands and in north Queensland, 
and Kriol, also with 20,000 speakers, across northern 
Western Australia, the Northern Territory and western 
Queensland. 


4.3.7 Indonesia 


In Indonesia, there is a somewhat unusual situation. 
There are many languages, especially in Irian Jaya and 
Eastern Indonesia, that have relatively small to very 
small numbers of speakers. These all have only local 
currency. However, in the western part of Indonesia, 
there are a number of languages with large numbers 
of speakers, such as Javanese with about 75 million 
speakers, Sundanese with 27 million and Madurese 
with 15 million. Nevertheless, these languages also 
have only local significance. The reason for this is that 
there is a single language functioning as the general 
lingua franca and as the national language of 
Indonesia; that is, Indonesian. Today it has approxi- 
mately 30,000,000 mother-tongue speakers and 
140,000,000 second-language users. This language, 
based on standard Malay, but expanded and subjected 
to extensive language-planning procedures, had been 
chosen by the Indonesian leaders long before the 
political independence of the Republic in 1949. 
Indonesian was to become the national language and 
general lingua franca of Indonesia, because it was not 
the mother tongue of any Indonesian citizen, a fact 
that the authorities hoped would forestall — or at 
least lessen — language rivalries and the danger of 
language-based separation movements. This situation 
would have certainly become acute if one of the large 
languages in Indonesia, such as Javanese, had been 
chosen as the national language and general lingua 
franca of the country. Various forms of Malay, the 
best known of which is Bazaar Malay, had been used 
as trade languages for centuries in what is today 
Indonesia, and Malay had been employed as the 
administrative language by the Dutch in colonial 
times. Malay, in its various forms, then, had been 
familiar to Indonesians as a language of wider com- 
munication for generations preceding independence. 

In addition to Indonesian, some regional lingua 
francas do exist in Indonesia: namely, Ngaju Dayak in 
the south-eastern part of Kalimantan (Borneo), 
Tetum on much of the island of Timor, Ternate and 
Tidore in the Halmahera Islands to the west of Irian 
Jaya, and Biak in Irian Jaya itself. 


4.3.8 Philippines 


There are approximately 150 languages spoken in 
the Philippines, all of them Austronesian. With the 
exception of ten major languages, all of these have 
only local significance. Of the ten languages, Tagalog 
is the dominant language, with over 17 million 
first-language speakers, mainly in central and south- 
western Luzon. A standardized variety of Tagalog, 
called Filipino, is the major language of the media in 
the Philippines, in the movies and in comics through- 
out the country. It is spoken and/or understood by 
well over 50 per cent of the total population and has 


been the national language since 1946. Cebuano, with 
15 million speakers, is used as a widespread lingua 
franca in the central Philippines and on Mindanao in 
the south. Native speakers of these two languages 
make up almost 40 per cent of the population of the 
Philippines. 

Two more languages — Ilokano, which is widely 
spoken as a lingua franca throughout northern 
Luzon, and Hiligaynon (Ilonggo), another lingua 
franca in the central and southern Philippines — have 
over 7 million speakers each. The number of the 
native speakers of these four languages comprise over 
70 per cent of the population. The six other major 
languages (Standard Bikol, Samar-Leyte, Kapampan- 
gan, Pangasinan, Magindanao and Maranao) have 
numbers of speakers varying from over 3 million for 
standard Bikol to over 1 million for Magindanao, 
with the total number of native speakers of these 
six languages amounting to close to 10 million. In 
addition, all of them are used as lingua francas in 
varying degrees. The number of native speakers of 
the ten major languages is about 90 per cent of the 
population of the Philippines. 


4.3.9 Malaysia 


In peninsular Malaysia, Malay is the general language, 
spoken by almost all members of the population in a 
variety of dialectal forms as their first or second lan- 
guage. Standard Malay is the national language and 
most important lingua franca. Bazaar Malay is also 
widespread, but receding before standard Malay. The 
special dialectal forms of Malay known as Para-Malay 
and Aboriginal Malay are used as second languages 
and regional lingua francas by speakers of the Austro- 
Asiatic Aslian languages in many areas. These Aslian 
languages themselves have only local significance. In 
northern Borneo (in Malaysian Sarawak and Sabah, 
and also in Brunei) standard Malay is the national 
language and lingua franca, with Bazaar Malay 
present, but receding. The other languages have only 
local significance, except that with speakers of some 
of the languages of the Kayan-Kenyah and Rejang- 
Baram groups, Iban dialects play a role as languages 
of wider communication. 


4.3.10 Singapore 


The languages of Singapore, where three-quarters 
of the population speak one or another variety of 
Chinese, are discussed in Section 5.1.1.3. 


4.4 Pidgin languages, trade 
languages and lingua 
francas 


4.4.1 Philippines, and mainland and 
insular South-East Asia 


4.4.1.1 Non-indigenous languages 


Chinese is spoken as a first language and as a contact 
language in Singapore and many areas of Malaysia and 
Indonesia. In Singapore and Malaysia, most of the 
Chinese are descendants of immigrants who came 
there from the late eighteenth century onwards. They 
speak Hokkien, Hakka, Cantonese, Teochew or Hain- 
anese as their mother tongue. Hokkien commonly 
became a contact language among Chinese of different 
linguistic backgrounds in parts of Malaysia and Singa- 
pore. However, Cantonese became the contact lan- 
guage in Kuala Lumpur, Ipoh, Seremban and Kuantan. 
Hakka is the contact language in Sabah. More recently, 
Mandarin has become a widespread contact language 
among Chinese in Malaysia and Singapore. 

From the thirteenth century onwards, the 
Chinese traded with what is now Indonesia and some 
of them settled there. Until the mid-nineteenth cen- 
tury, these traders were mainly Hokkien-speaking 
men; many married local women, resulting in 
‘Peranakan’ communities. From the 1860s, there were 
increasing migrations of Teochew, Hakka, Cantonese 


and Hokkien speakers. Hokkien represents about 65 
per cent of Chinese speakers in Java and Madura, and 
47 per cent of the Chinese in Indonesia as a whole, 
the next largest group being Hakka with 13 per cent. 
Hokkien became a lingua franca among Chinese of 
different language groups. It is estimated that there 
are at least 10 million Chinese speakers (as mother 
tongue or contact language) in Malaysia, Singapore 
and Indonesia. The spread of these languages is 
shown in Figure 4.5. 

See also discussion of the Min/Hokkien and Keija/ 
Hakka varieties of Chinese in Section 5.1.1 (page 
160-1). 


4.4.1.2 Non-indigenous-based pidgins and 
creoles 


Creole Portuguese (Kristang) 


The Portuguese became established in Malacca in 
1511, retaining it until their defeat by the Dutch in 
1641. Creole Portuguese (Kristang or Papia Kristang) 
has survived until the present in two areas of Malacca, 
West Malaysia: Trankera and Hilir. It has some 1,000 
speakers, mostly in Hilir where it is also spoken as a 
second language by some Chinese shopkeepers. Prior 
to the Second World War it was used in services in 
the local Catholic church. Today the use of Papia 
Kristang within the Portuguese Eurasian community 
is declining. 

From Malacca, Creole Portuguese speakers have 
migrated to other parts of Malaysia. Related varieties 
of Creole Portuguese survive in cosmopolitan 
communities in Kuala Lumpur and Singapore with 
several hundred speakers. A further variety existed in 
Penang, at Pulau Tikus, spoken by descendants of 
Portuguese Eurasians from Phuket (Thailand) and 
migrants from Malacca. It is now virtually extinct. 

In Java, Creole Portuguese became the lingua 
franca of seventeenth- and eighteenth-century Batavia 
(Jakarta) owing to the presence of numerous Pidgin/ 
Creole Portuguese-speaking freemen and slaves from 
former Portuguese territories. Creole Portuguese 
survived in Jakarta until recent times at Tugu, but is 
now virtually extinct. 


Creole Spanish 


Three main dialects of Creole Spanish (Chavacano) 
are spoken in the Philippines. It has more than 
300,000 speakers. Two dialects of Chavacano are 
spoken in Cavite Province, principally at Cavite (where 
the dialect is known as Cavitefio) and Ternate (this 
dialect sometimes being referred to as Ternatefio). 

In the southern Philippines a further dialect of 
Chavacano is spoken in the Zamboanga-Basilan area 
(where the dialect is known as Zamboanguefio), with 
offshoots in Cotabato and Davao. 


4.4.1.3 Indigenous-based pidgins and creoles 


Bazaar (Pasar, Bazar) Malay 


Malay in various pidginized, creolized and simplified 
varieties, generally known as Bazaar Malay (Melayu 
Pasar), has been in wide use as a trade language 
throughout Malaysia and Indonesia for nearly five 
centuries. It is spoken by many millions. This lan- 
guage is widespread in Irian Jaya, though rapidly 
giving way to Indonesian. It extends into north- 
western New Guinea. The language is based on Malay 
and was formerly used throughout the Dutch East 
Indies, with a very large number of speakers. The 
number of its speakers in Irian Jaya today may run 
into well over 100,000. It may have about 2,000 
speakers in north-western Papua New Guinea today, 
though around the turn of the century this figure was 
considerably higher. 


Baba Malay (Straits Malay, Chinese Malay) 


Baba Malay is the term for the creole Malay language 
spoken by the Babas, the Straits-born Chinese, of 
Malacca, Penang and Singapore. The Babas are the 
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Figure 4.5 Chinese speakers in Island South-East Asia 


descendants of the earliest Chinese settlers to the 
Malay peninsula who came largely from the southern 
Chinese province of Fukien. These migrants began 
arriving in the fifteenth century and in the absence of 
Chinese women they married Malays. The first major 
Chinese settlements were in Malacca and today it is 
still the most concentrated pocket of Babas in the 
Malay peninsula. 

Baba Malay has its basis in Hokkien and Malay, 
and is spoken by some 5,000 people, principally in 
Malacca. Today, however, it is in decline as a result of 
outmarriage from the Baba community and extended 
use of standard Malay. 


Chinese Indonesian 


Chinese Indonesian (Peranakan Indonesian, Baba 
Indonesian) is a creole Indonesian spoken by the Per- 
anakan, descendants of Chinese traders who settled in 
Java from the beginning of the seventeenth century 
and married Javanese women. These traders came 
largely from the Fukien Province and appear to have 
spoken a form of Bazaar Malay. Chinese Indonesian 
has its basis in Indonesian and Javanese. It has many 
thousands of speakers in East and Central Java. 


Davavenyo 


This language, according to Whinnom (1956: 16), is a 
synthesis of Tagalog, Cebuano and other Visayan 
dialects. It is different from Davawefio, which is a 
dialect of Chavacano (see Creole Spanish, p. 103). It 
is reported to contain some Spanish lexical items. The 
1990 Census records nearly 150,000 speakers. 


4.4.1.4 Non-pidginized indigenous lingua 
francas 
Ternatese (Papuan) 


The language of Ternate Island in the western part of 
the Halmahera region, which today has 42,000 native 


speakers, is used as a lingua franca in the northern 
and north-eastern parts of the Halmahera area. It is 
spoken by at least 20,000 people as a second language. 


Tidorese (Papuan) 


The language of Tidore Island in the western part of 
the Halmahera region, which today has 26,000 native 
speakers, is used as a lingua franca in the southern 
and eastern parts of the Halmahera area. It is spoken 
by at least 20,000 people as a second language. 


Tetun (Tetum) (Austronesian) 


Tetun, an Austronesian language, which has over 
300,000 native speakers, is used as a lingua franca 
throughout East Timor [Timor Leste] (except in the 
extreme east of the island) and in frontier areas of 
West Timor. It is spoken by many thousands as a 
second language. 


Ngaju (Austronesian) 


Owing to its use by the administration and the 
church missions, Ngaju (Ngaju Dayak), a Barito 
language of Borneo, is widespread in southern Kali- 
mantan in a large area between the Barito and Sampit 
rivers as a lingua franca, and is understood even to 
the east of the Barito, and to the north in the Malawi 
River region. It has 250,000 native speakers. 


Biak (Austronesian) 


The language of the Biak and Numfor Islands in the 
Geelvink Bay of north-western Irian Jaya, which 
today has 100,000 native speakers, was in use as a 
lingua franca in that area in pre-contact days and has 
since extended its currency to the northern part of 
the Vogelkop Peninsula, the Raja Empat Islands to the 
west of that peninsula, and even further to the south- 
west. It is spoken by quite a few thousands as a second 
language. 


4.4.2 Oceania and Australia 


This commentary is based on the sources quoted in 
the bibliography, numerous other sources, and per- 
sonal observations. 


4.4.2.1 General remarks 


On Map 25, metropolitan languages (English, French, 
Indonesian, German, Japanese), used in varying 
degrees in the present or past as lingua francas, have 
not been shown. 

In the light of this, only three different types 
of languages are shown on this map: (1) non-Pacific- 
based pidgins (and creoles), English-, French-, Hindi- 
and Japanese-based; (2) native-based pidgins; (3) 
missionary and church lingua francas and non- 
pidginized indigenous lingua francas. Only languages 
still living today have been shown and discussed, with 
the important exceptions of New Caledonian Pidgin 
French and the Toaripi Hiri and Koriki Hiri trade 
pidgins in southern Papua New Guinea. 


4.4.2.2 New Guinea area 


Non-Pacific-based pidgins and creoles 


New Guinea Pidgin, officially called Tok Pisin 
(English-based) 

This was formerly also known as Neo-Melanesian. Its 
forerunner stabilized from an English-based contact 
jargon of the Pacific Pidgin English type on planta- 
tions in Samoa for which labour was recruited from 
New Britain in the early 1880s, and later repatriated 
there. The language spread rapidly through German 
New Guinea and today is widespread throughout 
Papua New Guinea and extends some way into north- 
eastern Irian Jaya. It is by far the more widely used of 
the two major general languages of Papua New Guinea 
(disregarding English). First-language speakers of it 
are estimated to number approximately 50,000, with 
well over 2,000,000 second-language speakers. Tok 
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Pisin is very highly developed and has all the features 
of a creole rather than those of a pidgin language. 
It has two main sociolects: rural and urban. The 
earlier regional dialect differences between Highlands, 
Lowlands and Islands New Guinea Pidgin have now 
largely disappeared. 


Native-based pidgins 
Pidgins based on Austronesian languages 


Hiri Motu (Police Motu) This language is wide- 
spread in the southern and south-eastern part of 
Papua New Guinea. Based on the Austronesian Motu 
language of the Port Moresby area, it developed late 
in the nineteenth century as the language of the 
police force in British New Guinea (later the Territory 
of Papua) and today is the less widely used of the 
two major general languages of Papua New Guinea 
(disregarding English). It has under 200,000 speakers. 


Pidgin Dobu (Gosiagu Talk) In recent decades, a 
pidginized form of the Dobu language of the 
D’Entrecasteaux Islands has come to be used as the 
language of the labour line in the Milne Bay Province 
and in areas adjacent to it. It is known as Gosiagu 
Talk. The number of its speakers may be estimated at 
several thousand. 


Pidgins based on Papuan languages 


The Hiri trading languages of the Motu. The Motu, 
who carried out annual trading expeditions, called 
hiri, to the Papuan Gulf from the Port Moresby area, 
used two different pidgin and contact languages in 
their contacts with the Papuan speakers of the Gulf 
area. One was used with the speakers of Toaripi and 
other languages of the Eleman family (see Map 32) 
and was based on those languages; the other was used 
with speakers of the Purari (Koriki, Namau) language 
(see Map 32). It constitutes essentially a simplified 
form of that language. Today both these languages are 
virtually extinct. 

The Yimas Pidgins are based on the Yimas lan- 
guage of the Pondo Family in the Eastern Sepik 
Province, and used as means of wider communica- 
tion with their Karawari-, Alamblak- and Arafundi- 
speaking neighbours in three corresponding forms. 

The Iatmul jargon is a widespread pidgin lan- 
guage used as a contact language in the Middle Sepik 
area of the East Sepik Province. 

Dani Police Talk is a Dani Pidgin used pre- 
dominantly by the police force in the central 
highlands area of Irian Jaya. 


Missionary and church lingua francas, and 
non-missionary indigenous lingua francas 

Kate (Papuan) 

The Wemo dialect of the Kate language of the eastern 
part of the Huon Peninsula was adopted by the 
Lutheran Mission as a missionary and church lan- 
guage in the last decade of the nineteenth century. 
Subsequently, its use spread through the Huon 
Peninsula and was introduced into the highlands 
areas from the 1930s. It has largely disappeared from 
the highlands areas today, and has been receding 
elsewhere before New Guinea Pidgin and English. 
The number of the original speakers of the Wemo 
dialect of Kate at the end of the nineteenth century 
was approximately 600. Today, the number of indi- 
genes with an active command of Kate is estimated 
to be about 75,000, with a further 40,000 having a 
passive knowledge of it. It has also become a non- 
missionary indigenous lingua franca in eastern parts 
of the Huon Peninsula. 


Yabém (Austronesian) 


The Yabém language was originally spoken in two 
eastern coastal villages on the Huon Peninsula by 
about 900 people. In the mid-1880s it was adopted as 
a missionary and church language by the Lutheran 
Mission, and its use later spread through Austro- 
nesian-speaking areas of the Huon Gulf region. Its 
use has been rapidly receding in recent decades before 


New Guinea Pidgin and English, though the number 
of indigenes with an active knowledge of it is estimated 
to be still about 25,000, with many more possessing a 
passive knowledge of it. 


Bel (Gedaged) (Austronesian) 


Missionaries of the Lutheran Mission first started 
working in the Gedaged language of the Madang 
Harbour area in 1889 (the name Bel has been gener- 
ally applied to the language only since 1967). How- 
ever, it was not until 1924 that it was adopted as the 
Lutheran missionary and church lingua franca for the 
Austronesian-speaking parts of the area; in 1930, its 
use was extended to the Papuan-speaking parts of the 
area for which the Papuan Amele had been adopted 
as the missionary and church lingua franca in 1923. 
In 1956, the use of Gedaged was abandoned in 
teacher training in favour of New Guinea Pidgin, 
though its use in theological training continued 
until 1962. In the mid-1950s, the estimated number 
of indigenes who had an active command of it 
was about 20,000. Today, it may not be more than 
12,000. 


Tolai (Kuanua) (Austronesian) 


The Kuanua language (today called Tolai) of the 
Gazelle Peninsula and surrounding areas of New 
Britain was adopted as a missionary and church lingua 
franca by the Methodist (then Wesleyan) Mission 
soon after the establishment of the mission there in 
1875-80. Its use spread throughout New Ireland as far 
as the islands of Lihir, Tabar and Lavongai, through 
the Duke of York Group, and also to the Nakanai area 
on the north-eastern coast of New Britain. During the 
height of its use, about 15,000 indigenes used Kuanua 
as a second language (its native speakers at that time 
numbered about 45,000). Its use as a missionary and 
church lingua franca has since receded before Pidgin 
and English, but it has gained importance as a non- 
missionary indigenous lingua franca. 


Gogodala (Papuan) 


In 1935, the Gogodala language spoken to the north 
of the Lower Fly River and on the Aramia river in 
south-western Papua New Guinea was studied and 
reduced to writing by missionary linguists of the 
Unevangelized Fields Mission (now Asia Pacific 
Christian Mission). All pastoral training at Balimo in 
the centre of the Gogodala area was carried out in 
Gogodala, and men from neighbouring tribes who 
came to be trained learned the language and took it 
back home with them. Also, Gogodala pastors went to 
the Bituri Creek area in the north-east of the Trans- 
Fly area and spread the knowledge of the language 
there. All this resulted in several thousand non- 
Gogodala learning Gogodala as a second language 
(the number of its native speakers today is about 
10,000). With the opening of more and more pastor 
training centres using local languages, the importance 
of Gogodala as a missionary and church lingua franca 
has been decreasing, but it has gained importance as 
a non-missionary indigenous lingua franca in the 
Lower Fly area. 


Motu (Austronesian) 


The Motu language of the Port Moresby and nearby 
areas, which today has about 15,000 native speakers, 
was adopted as a missionary and church language by 
the London Missionary Society in the 1870s. At first, 
the Motu people taught the missionaries Pidgin Motu 
as this was their language for use with outsiders. 
However, this was gradually replaced by Motu proper. 
With the early expansion of missionary work outside 
Motu-speaking villages, with Motu as the medium, 
the language became a missionary and church lingua 
franca. It continued to be used for theological train- 
ing until 1970, when it was replaced by English. Its 
use as a missionary and church lingua franca has been 
receding before English in recent years. 

The Jehovah’s Witnesses also adopted Motu as a 
missionary and church lingua franca, but replaced it 
by Hiri Motu (Police Motu) in 1966-7. 

Motu has also become a _non-missionary 


indigenous lingua franca in the area around that of 
the Motu tribe. People who have an active command 
of it as a second language are estimated to number 
10,000—12,000, with several thousand more having a 
passive knowledge of it. 


Kiwai (Papuan) 

The Island Kiwai dialect of the Southern Kiwai 
language — in the Fly Delta, and eastern and south- 
eastern coastal areas of the Trans-Fly — was adopted as 
a ‘high Kiwai’ by the London Missionary Society 
around 1900 for missionary and church use in the 
areas of other dialects of Southern Kiwai, and of 
neighbouring Kiwaian languages to the north. With 
Scripture translations and other missionary literature 
becoming available in the 1940s and 1950s in some 
other Kiwaian languages to the north, the use of 
Island Kiwai as a missionary and church language 
ceased in those areas. The emergence of the Southern 
Coastal Kiwai and Daru Kiwai dialects of Southern 
Kiwai as indigenous lingua francas amongst Kiwaian- 
and non-Kiwaian speakers of the area made further 
inroads into the currency of Island Kiwai as a mis- 
sionary and church language. Before the 1940s, there 
may have been over 5,000 non-Island Kiwai dialect - 
speakers who had at least a full passive knowledge of 
the Island Kiwai dialect, as well as about 4,500 native 
speakers, but the first figure is now likely to be con- 
siderably lower. 


Dobu (Austronesian) 


The Dobu language of the D’Entrecasteaux Islands 
area — to the north-east of the tail-end of the main- 
land of New Guinea — was, in pre-contact days, in 
limited use as an indigenous lingua franca in the 
framework of the Kula cultural trade activity, and 
its knowledge, on the part of some members of 
the indigenous population, extended through the 
D’Entrecasteaux Archipelago and as far as the 
Trobriand Islands. It was adopted as a missionary and 
church lingua franca by the Methodist Mission in the 
last decade of the nineteenth century. However, it 
never really achieved that status against the pressure 
of local languages, and has, as a missionary and 
church language, been receding before English even 
in the Dobu area itself in recent years. It is, however, 
still used as a non-missionary lingua franca to a 
limited extent. The number of indigenes having an 
active command of Dobu as a second language may 
never have exceeded a few thousand; the language 
itself has today about 8,000 native speakers. 


Wedau (Austronesian) 


The Wedau language, which is spoken on the north 
coast of the eastern tail-end of the mainland of New 
Guinea and today has 3,000 native speakers, was 
adopted as a missionary and church lingua franca by 
the Anglican Mission in the 1980s. It has been func- 
tioning as such in some of the Austronesian language 
areas near the Wedau area, but has receded before 
other local languages and especially before English. 
The number of indigenes having an active command 
of it as a second language may never have exceeded a 
few thousand. 


Binandere (Papuan) 


The Binandere language, with 3,300 speakers in 
northern coastal areas of the south-eastern part of 
Papua New Guinea, was first used by the Anglican 
Mission as a missionary and church language in the 
first decades of the twentieth century. Translations of 
Scripture into it — and into a closely related language, 
Notu or Ewage, spoken further east — were used by 
speakers of most Binanderean languages, which gave 
these languages the status of missionary and church 
languages amongst them. Quite a few thousand 
indigenes have at least a passive knowledge of one of 
them as a second language. 


Suau (Austronesian) 

In pre-contact days the Suau language, on the south 
coast of the eastern tail-end of the mainland of 
New Guinea and on near-coastal islands had about 
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4,000-5,000 native speakers and was used as a trade 
language by perhaps 2,000-3,000 more. It was 
adopted as a missionary and church lingua franca by 
the London Missionary Society in the 1890s and it 
was widely used. In recent years, it has receded 
before English, but plays a role also as a non- 
missionary lingua franca. Today, it is estimated to 
have roughly 20,000 first- and second-language 
speakers. 


Toaripi (Papuan) 

The Toaripi language of the Papuan Gulf area of 
Papua New Guinea currently has 23,000 native 
speakers, with 15,000 speaking the Toaripi (proper) 
dialect. The latter was adopted as the missionary 
and church lingua franca for the area by the 
London Missionary Society in the 1880s. The lan- 
guage was used successfully amongst the Eastern 
Eleman people, whose dialects and languages are 
close to the Toaripi (proper) dialect. Its introduction 
amongst the Western Eleman people was not 
very successful, and its use gradually diminished 
amongst them, with translations of Scripture 
becoming available in Orokolo, the major Western 
Eleman language, from the mid-1920s. In its hey- 
day, the number of indigenes who had at least a 
passive command of Toaripi as a second language 
may have been well over 5,000. In the last decade 
or so, Toaripi and Orokolo have also become non- 
missionary indigenous lingua francas in the Eastern 
and Western Eleman areas. 


4.4.2.3 Solomon Islands area 


Non-Pacific-based pidgins (and creoles) 
Solomon Islands Pidgin (Pijin) (English-based) 


Solomon Islands Pidgin, a variety of Melanesian 
Pidgin English, became stabilized in its present form 
in the early 1920s, based on an earlier plantation 
pidgin of the Queensland/South Pacific Pidgin 
English type. It is today in use throughout the 
Solomon Islands and has about 250,000 speakers. 
Creolization is still at a very low level: only 1,300 
Solomon Islanders were known to speak Solomon 
Islands Pijin as their first language at independence in 
1978. 


Missionary and church lingua francas (all 
Austronesian) 


Babatana (Choiseul Island) 


This language, which is actually a dialect of East 
Choiseul and today has 5,000 native speakers, was 
adopted by the United Church (Methodist) around 
1915 for use in the New Georgia Archipelago and on 
Vella Lavella. About 6,000 Solomon Islanders have a 
knowledge of Babatana as a second language. 


Roviana (New Georgia Archipelago) 


This language, which today has 5,500 native speakers, 
was adopted by the United Church (Methodist) 
around 1915 for use in the New Georgia Archipelago. 
About 16,000 Solomon Islanders have a knowledge of 
Roviana as a second language. 


Marovo (New Georgia, Marovo Lagoon Area) 


This language, which now has 5,000 native speakers, 
was adopted by the Seventh Day Adventists in the 
1920s for use in parts of the New Georgia 
Archipelago. About 3,000 Solomon Islanders have a 
knowledge of it as a second language. 


Maringe (Santa Isabel) 

This language, which has 5,000 native speakers, was 
adopted by the Melanesian Mission (now the Church 
of Melanesia, Anglican) for use in parts of Santa 
Isabel. About 2,000 Solomon Islanders have a know- 
ledge of it as a second language. 


Gela (Florida) 


This language, which now has 6,000 native speakers, 
was adopted by the Melanesian Mission (now the 


Church of Melanesia, Anglican) in the 1880s for 
use in Florida, Savo and parts of Santa Isabel and 
Guadalcanal. About 1,000 indigenes have a know- 
ledge of it as a second language. 


Ghari (Guadalcanal) 


As a dialect of West Guadalcanal, this language today 
has 5,000 native speakers; it was adopted by the 
Roman Catholic Church in the 1900s for use in 
Guadalcanal and Malaita. It is estimated that about 
5,000 Solomon Islanders have a knowledge of it as a 
second language. 


Kahua (San Cristobal) 


The Kahua dialect of this language was adopted by 
the Roman Catholic Church as the standard form of 
Kahua in the various dialect areas of Kahua, which 
has a total of 4,000 speakers, with the Kahua dialect 
proper spoken by 1,700 indigenes. 


Mota (north-eastern Vanuatu) 


The Mota language — of the island of Mota in the 
Banks Islands in north-eastern Vanuatu — which 
today has 700 native speakers, was adopted by the 
Melanesian Mission (now the Church of Melanesia, 
Anglican) in the 1860s as a general missionary 
and church lingua franca for use in the then New 
Hebrides and the south-eastern and eastern Solo- 
mons. Its use continued until 1931, when it was 
replaced by English. At its peak, about 10,000 
indigenes in the then New Hebrides and in the 
south-eastern Solomon Islands had a knowledge of 
it as a second language. Today their number is 
dwindling. 


Non-missionary lingua franca 
Kwara'ae (Malaita) 


This language, which today has 33,000 native 
speakers, has been carried diffusely through most of 
the Solomon Islands area with Malaitan expansion 
and emigration. Many other Solomon Islanders have 
acquired some knowledge of it as a second language, 
and they may number 6,000-8,000 today. 


4.4.2.4 Vanuatu, Loyalty Islands and New 
Caledonia 


Non-Pacific-based pidgins (and creoles) 
Bislama (Bichelamar) (English-based) 


Pacific Pidgin English was first introduced into what 
was then the New Hebrides (now Vanuatu) in the 
1830s and its use increased through the decades 
until it became stabilized in the expanded form 
known as Bislama (Bichelamar) between 1890 and 
1910 in a number of regional dialectal forms which 
persisted strongly until the 1960s, but became less 
and less marked from that time onwards. The lan- 
guage is widespread throughout Vanuatu as the 
major general language and today has about 190,000 
speakers. Its creolization has begun in urban centres, 
but is on a very low level, between 5,000 and 10,000 
speakers. 


New Caledonian Pidgin French 


It has been thought that New Caledonian Pidgin 
French, which developed in New Caledonia after the 
establishment of French rule in 1853, owed its origin 
to a relexification of New Caledonian Pidgin English, 
but this view has been challenged, and the two lan- 
guages are known to have co-existed in different parts 
of New Caledonia for many years. This Pidgin French 
gradually fell into disuse and is now extinct, replaced 
by New Caledonian French, itself a variety of metro- 
politan French. 


Missionary lingua francas (all Austronesian) 
Mota 


The use of the Mota language by the Melanesian 
Mission (now the Church of Melanesia, Anglican) has 


been mentioned in Section 4.4.2.3 on the Solomon 
Islands. A major area of its use was in what today is 
Vanuatu. There is still a large number of ni- Vanuatu 
there who have a knowledge of it as a second lan- 
guage, though it has not been in use as a missionary 
and church lingua franca since 1931. 


Wailu (Houailou, Ajié) 

The Wailu language of New Caledonia, which used to 
be called Houailou and is now correctly named Ajié, 
was adopted by the Eglise Protestante as a missionary 
and church lingua franca about 1900. It has 5,000 
native speakers, and about 2,000 New Caledonian 
indigenes have a knowledge of it as a second 
language. 


4.4.3 Fiji 


4.4.3.1 Non-Pacific-based pidgins (and creoles) 
Pidgin Hindi 

A pidginized form of Hindi is used in Fiji today for 
communication between some Indians, and also for 
communication between Indians and Fijians, some 


of whom have a knowledge of this language. The 
number of its speakers may be around 50,000. 


4.4.3.2 Native-based pidgins 


Modern Pidgin Fijian 


A modern pidginized form of Fijian is used in Fiji 
today for intercommunication between Indians and 
Fijians. The number of its speakers may be around 
50,000. 


4.4.4 Polynesia 
4.4.4.1 Non-Pacific-based pidgins (and creoles) 


Hawaiian Creole English (Hawaiian Pidgin) 


In the 1870s, an English-based pidgin developed in 
Hawaii. It became the lingua franca of the Hawaiian 
Islands for about half a century, developing towards 
creolization and also expanding in the direction of 
English. It constitutes a group of co-existent systems: 
an English-based pidgin which is now little used, an 
English-based creole and a non-standard dialect of 
English. If all these are included, it has about 600,000 
speakers. 


Hawaiian Japanese Pidgin English 


A now dying Japanese Pidgin English is spoken by 
immigrant workers on Hawaiian plantations. 


4.4.5 Tahiti area 
4.4.5.1 Native-based pidgins 


Pidgin Tahitian 


A pidginized form of Tahitian (called Parau Tinitd) 
is used mainly for intercommunication between 
Chinese and Tahitians. Numbers of speakers are 
rapidly dwindling, perhaps no more than 2,000 today. 


4.4.5.2 Non-missionary lingua franca 


Tahitian 


Tahitian is used as a lingua franca through much of 
the area adjacent to the Society Islands in the south 
and east. It has been gradually replacing the Tubuai 
language in the Austral Islands, and Pa'umotu in the 
Tuamotuan Archipelago. It has about 117,000 native 
speakers, and about 15,000 people may have a know- 
ledge of it as a second language. 
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4.4.6 Pitcairn Island, Norfolk Island 


4.4.6.1 Non-Pacific-based pidgins (and creoles) 


Pitcairn Island, to the south-east of Mangareva in 
eastern Polynesia, was settled by HMS Bounty 
mutineers in 1790 together with their Polynesian 
wives. A special form of English creole developed, 
which contains a number of Polynesian elements. It 
has under 100 speakers today on Pitcairn Island. 

In 1856, the Pitcairn Islanders were all moved 
to Norfolk Island, between New Zealand and New 
Caledonia. Some returned to Pitcairn Island in 1858, 
followed by others in 1862. There the Pitcairn lan- 
guage still survives today, moving in the direction of a 
non-standard English. 

Population numbers on Norfolk were swelled by 
later settlers from Australia and New Zealand. Today, 
‘Norfolk is moribund on Norfolk Island, in spite 
of recent attempts to revive it and introduce it in 
schools. 


4.4.7 Australia (and Tasmania) 
4.4.7.1 Non-Pacific-based pidgins and creoles 


Australian Pidgin English 


From the early days of European settlement in 
Australia, a form of Pidgin English was used for 
communication between the settlers and Australian 
Aborigines, with eastern and western Australian local 
differences. A version of it may also have been used in 
Tasmania before the extinction of the aboriginal 
population in the second half of the nineteenth cen- 
tury, and on Flinders Island in Bass Strait, where 
the last surviving Tasmanian Aborigines had been 
taken. In recent decades, the use of this language has 
declined rapidly with the spread of the knowledge of 
English amongst the aboriginal population. It is still 
used extensively in the north of Australia and in some 
isolated areas elsewhere. At the same time, forms of 
Australian Pidgin English and Australian Aboriginal 
English have become stabilized and creolized in some 
areas (see below). 


Australian Creole 


Australian Pidgin English (and Australian Aboriginal 
English) has become creolized in an expanded form 
in a number of centres during the last twenty years or 
so. A number of varieties exist, some of them (espe- 
cially in the north) close to Australian Pidgin; others 
(especially in the south) have moved in the direction 
of non-standard English. Eastern and Western Aus- 
tralian varieties can be distinguished. The number of 
speakers of varieties of Australian creoles is well over 
20,000. 


Queensland Canefields English 


This language developed from Pacific Pidgin English 
amongst indentured labourers from the south-western 
Pacific on the Queensland sugar-cane plantations in 
the second half of the nineteenth century, when it had 
many thousands of speakers. Most of the islanders 
were repatriated after 1900, but a few stayed behind 
and continued using their language, referred to as 
Kanaka English, for some time. It is now moribund, 
remembered only by a few families in Bowen and 
Mackay, in north Queensland. 


Torres Strait Pidgin English 


This, also called Broken, has been in use since the 
nineteenth century and has several thousand speakers. 
It differs from Pacific Pidgin English and apparently 
also from some forms of Papuan Pidgin English. 


Japanese Pidgin English 


A variety of Japanese Pidgin English is spoken in the 
Japanese pearling area at Broome, in north-western 
Australia. 


4.4.7.2 Missionary and church lingua francas 
and non-missionary indigenous lingua 
francas 


Gunwinygu 


The Gunwinygu language of northern central Arnhem 
Land was adopted by the Oenpelli Mission for use in 
the Gunwinygu and adjacent areas. It has also become 
used to some extent as a non-missionary lingua 
franca in the region. It has now 900 native speakers, 
and about 1,000 Aborigines have a knowledge of it as 
a second language. 


Gubabuyngu (Dhuwal) 


Gubabuyngu, a form of Dhuwal, in the north- 
eastern corner of Arnhem Land, was adopted by the 
Milingimbi Mission for use in that area, which is 
occupied by very closely related languages. It has, at 
the same time, become a non-missionary lingua 
franca in coastal areas further west, where many 
speakers of it have moved. It has now 500 native 
speakers, and about 1,000 Aborigines have a know- 
ledge of it as a second language. 


Pitjantjatjara 


Pitjantjatjara, one of the major languages of the 
Western Desert area of the Australian continent was, 
in a southern form, adopted by the Ernabella Mission 
as a lingua franca over a considerable area. It has since 
gained much added importance as non-missionary 
lingua franca over a large portion of the western part 
of Australia. It has over 2,000 native speakers, and 
about 2,000 Aborigines have a knowledge of it as a 
second language. 


Murrinhpatha 


Since 1935, when the Port Keats Mission was 
established, Murrinhpatha has been used as a lingua 
franca in this linguistically diverse area, with some 
encouragement from local missionaries. 


Walmadjari 


This lingua franca in the northern portion of the 
western part of Australia has now 1,000 native 
speakers, and about 1,000 aborigines have a know- 
ledge of it as a second language. 


Kuku Yalandji (also spelled as Gugu Yalandyi) 


A lingua franca in the south-eastern part of Cape 
York Peninsula, it has now about 300 native speakers, 
and about 200 Aborigines have a knowledge of it as a 
second language. 


Wik Munkan 


This lingua franca in western central Cape York 
Peninsula has now about 600 native speakers, and 
about 200 Aborigines have a knowledge of it as a 
second language. 


4.5 Indigenous languages 


4.5.1 Polynesia 


The Polynesian languages (Map 26) constitute a 
special type of Oceanic Austronesian in which 
particles play a major role. Their possession marking 
differs from that of Micronesian and other Oceanic 
Austronesian languages and is based on a dichotomy 
of close versus not-close possession, with two con- 
trasting particles. Their phonologies are very simple. 
Samoan is characterized by the co-existence of very 
different formal and casual speech styles. The Fijian 
languages are typical Oceanic Austronesian languages 


of the type found also further west. Rotuman is 
aberrant, especially in its phonological system. 

Within the Fijian and Polynesian groups, the 
differences between language groupings and lan- 
guages decrease in an easterly direction. Hawaiian 
and Maori are languages of communities whose 
members increasingly use English. 


Table 4.1 Map 26: Polynesia — Languages and 
numbers of speakers 


Central Pacific Languages 


Rotuman 
Rotuman 9,000 
Fijian 
Western Fijian 57,000 
Eastern Fijian 333,441 
Polynesian 
Tongic 
Tongan 123,000 
Niue 14,000 
Nuclear Polynesian 
Samoic-Outlier 
Samoan 426,394 
Niuafo'ou 690 
Tokelau 4,500 
Tuvalu 11,000 
East Uvea 17,000 
East Futuna 8,000 
Pukapuka 840 
Rennell-Bellona 3,570 
Pileni 1,680 
Anuta 360 
Tikopia 3,550 
West Uvea 1,870 
Emae 200 
Mele-Fila 2,000 
Futuna-Aniwa 600 
Sikaiana 920 
Luangiua 1,800 
Nukumanu 200 
Takuu 250 
Nukuria 200 
Kapingamarangi 3,000 
Nukuoro 860 
Eastern Polynesian 
Easter Island 2,500 
Tahitian 125,000 
Tongareva 600 
Rapa 521 
Austral 8,000 
Cook Islands Maori 43,000 
Manihiki-Rakahanga 5,000 
Pa'umotu 14,400 
New Zealand Maori 100,000 
Moriori (extinct) 
Mangareva 1,600 
North Marquesas 3,400 
South Marquesas 2,100 
Hawaiian 1,000 


4.5.2 New Caledonia 


The languages of New Caledonia (Map 27) are 
Oceanic Austronesian, but are quite aberrant in their 
phonologies, vocabularies and grammars. Their 
phonological systems are exceedingly complex, and 
some are tonal. 


4.5.3 Vanuatu (New Hebrides) 


The languages of Vanuatu (Map 27), which are all 
Austronesian, are in part typical Oceanic Austro- 
nesian languages showing the characteristic dichot- 
omy of different possessive marking for inalienable 
and alienable possession, with the latter subdivided 
into several classes. Some languages, especially in 
eastern Espiritu Santo, Malekula and on the southern 
islands, are aberrant in their complex phonological 
systems, their vocabularies and in part also in 
features of their grammars. The Polynesian Outliers 
are Nuclear Polynesian languages of the Samoan 


type. 
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Austronesian Languages 
North and Central Vanuatu 


North Vanuatu 
Hiw 
Loh 
Lehali 
Lehalurup 
Motlav 
Mota 
Vera'a 
Vurés 
Mésén 
Lemerig 
Nume 
Koro 
Wetamut 
Lakona 
Merlav 
Marino 
Central Maewo 
Baetora 
North-east Ambae 
Nduindui 
Sakao 
Lorediakarkar 
Shark Bay 
Butmas-Tur 
Polonombauk 
Mafea 
Tutuba 
Aore 
Tamambo 
Tambotalo 
Narango 
Tangoa 
Araki 
Wailapa 
Amblong 
Akei 
Kiai 
Roria 
Morouas 
Merei 
Navut 
Malmariv 
Wusi 
Tasmate 
Nokuku 
Cape Cumberland 
Vunapu 
Piamatsina 
Tolomako 
Central Vanuatu 
Raga 
Apma 
Sowa 
Ske 
Sa 
North Ambrym 
Lonwolwol 
Dakaka 
Port Vato 
South-east Ambrym 
Paama 
Lendamboi 
Nioleien 
Aveteian 
Nasarian 
Ninde 
Naati 
Sinesip 
Naha'ai 
Nasvang 
Axamb 
Avok 
Maskelynes 
Port Sandwich 
Nisvai 


Banam Bay 
Aulua 
Pangkumu 
Unua 
Navava 
Nevwervwer 
Neve'ei 
Naman 
Larevat 


Table 4.2 Map 27: Vanuatu and New Caledonia — 
Languages and numbers of speakers 


120 
315 
150 
100 
1,275 
450 
275 
1,050 
20 

10 
450 
105 
100 
300 
1,350 
180 
350 
550 
5,000 
4,500 
1,500 
(extinct) 
225 
525 
225 
50 
150 
(extinct) 
4,000 
75 
160 
375 
105 
100 
150 
650 
450 
150 
150 
150 
525 
150 
300 
160 
160 
2,400 
375 
150 
450 


2,300 
7,800 
(extinct) 
600 
2,500 
5,250 
1,200 
600 
750 
3,700 
6,000 
800 
90 

10 

20 
1,100 
18 
600 
1100 
275 
750 
500 
1,100 
1,200 
100 
900 
750 
400 
400 
675 
1,250 
750 
20 
675 


Marakhus (Tape) 15 
Big Nambas 3,350 
Uripiv-Wala-Rano-Atchin 9,000 
Dirak 900 
Vao 1,900 
Mpotovoro 430 
Vovo A475 
Malua Bay (Petarmul) 500 
Lamen 850 
Lewo 2,200 
Bierbo 800 
Baki 350 
Maii 175 
Bieria 25 
Namakir 3,750 
Nakanamanga 9,500 
South Efate 6,000 
Southern Vanuatu 
Erromanga 
Sie 1,900 
Ura 
Tanna 
Kwamera 3,500 
Whitesands 7,500 
North Tanna 5,000 
Lenakel 11,500 
South-west Tanna 5,000 
Aneityum 
Aneityum 900 
New Caledonia and Loyalty Islands 
Far North 
Yalayu 1,400 
Kumak 900 
Caac 750 
Yuaga 2,000 
North 
Jawe 900 
Pwapwa 130 
Nemi 600 
Pwaamei 325 
Voh-Kone 300 
Fuai 1,000 
Pije 100 
Central North 
Cemuhi 2,500 
Central South 
Paici? 5,500 
South 
Arho 50 
Arha 250 
Ajié 5,000 
Orowe 750 
Neku 200 
Neré 20 
Tiri 600 
Xaractiu 3,500 
Xardgure 950 
Far South 
Dubea 1,400 
Numeéé 1,800 
Loyalty Islands 
laai 2,200 
Drehu 15,000 
Nengone 6,050 
Polynesian Outliers 
Emae 400 
Ifira-Mele 3,500 
Futuna-Aniwa 1,500 
Faga-Uvea 1,870 


4.5.4 Micronesia 


The Micronesian languages (Map 28) are nearly all 
members of the Oceanic Austronesian subgroup, of 
a special type with complex possession marking 
and complicated phonological systems. The Polyne- 
sian Outliers are Nuclear Polynesian languages of 
the Samoan type. The non-Oceanic Micronesian 
Austronesian languages (Chamorro and Palauan) 
tend towards the Philippine language type. 


Table 4.3 Map 28: Micronesia — Languages and 
numbers of speakers 


Oceanic Austronesian Languages 
Micronesian 
Trukic 
Western Trukic (Ulithic) 
Sonsorolese 600 


Ulithian 3,000 
Woleaian 1,631 
Mapia (extinct) 
Central Trukic (Carolinian) 
Satawalese 458 
Saipan Carolinian 3,000 
Puluwat 1,364 
Namonuito 944 
Eastern Trukic 
Trukese 38,341 
Mortlockese 5,904 
Padfang 1,318 
Ponapeic 
Yapese 6,592 
Ponapean 27,700 
Pingilapese 2,500 
Mokilese 1,050 
Kosraean (I) 
Kosraean 6,900 
Marshallese (1) 
Marshallese 43,900 
Gilbertese 
Gilbertese 58,320 
Northern Gilbertese 2 
Nauruan (I) 
Nauruan 6,000 
Polynesian 
Nukuoro 860 
Kapingamarangi 3,000 


Non-Oceanic Austronesian Languages 
Chamorro-Palauan 


Chamorro (I) 

Chamorro 90,000 
Palauan (I) 

Palauan 15,000 


4.5.5 Solomon Islands and Bougainville 
(Papua New Guinea) 


The Austronesian languages of the area (Map 29) are 
in part fairly typical of Oceanic Austronesian, show- 
ing the characteristic dichotomy in possessive mark- 
ing for inalienable and alienable possession, with the 
latter subdivided into several classes. Some languages, 
especially those of the Utupua, Vanikoro, West Isabel, 
East Isabel and Choiseul groups, and to some extent 
those of the Bougainville area, show unusual vocabu- 
laries and phonological systems. The Polynesian Out- 
liers are Nuclear Polynesian languages of the Samoan 
type. The Papuan languages (North Bougainville, 
South Bougainville, Central Solomons and Reefs- 
Santa Cruz groups) mostly show elaborate gender 
and class systems with concordance, number marking 
with nouns and highly elaborate verb morphologies. 
The languages are very highly agglutinating. 


Table 4.4 Map 29: Solomon Islands and Bougainville — 
Languages and numbers of speakers 


Austronesian Languages 
North-west Solomonic Chain 
Nehan-North Bougainville 


Nehan (I) 
Nehan 7,000 
Buka 
Halia 20,000 
Solos 3,200 
Petats 2,000 
Saposa-Timputz 
Saposa 1,400 
Hahon 1,300 
Timputz 3,900 
Teop 5,000 
Papapana (I) 
Papapana 150 
South-west Bougainville 
Banoni 1,000 
Piva 550 
Shortlands 
Mono-Alu 9,500 
Uruava (extinct) 
Torau 605 
Choiseul 
North-west Choiseul 
Vaghua 1,650 
Varisi 7,200 
Ririo 380 
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Central-eastern Choiseul (I) 
Central-eastern Choiseul 
New Georgia-Santa Ysabel 
New Georgia 
Ghanongga 
Lungga 
Simbo 
Nduke 
Roviana 
Ughele 
Kusaghe 
Hoava 
Marovo 
Vangunu 
Santa Ysabel 
Kia 
Laghu 
Kokota 
Zazao 
Blablanga 
Maringe 
Gao 


South-east Solomonic Chain 


Gela-Guadalcanal 
Bugotu (I) 
Bugotu 
Gela 
Gela 
Lengo 
Guadalcanal 
West Guadalcanal 
Talise 
Malango 
Birao 
Malaita-San Cristobal 
Malaita 
Longgu 
Lau 
North Malaita 
Kwara'ae 
Langalanga 
Kwaio 
Dori'o 
‘Are'are 
Marau 
Oroha 
South Malaita 
San Cristobal 
Arosi 
Fagani 
Bauro 
Kahua 


Eastern Outer Islands 


Utupua 
Nembao (I) 
Nembao 
Asumboa (I) 
Asumboa 
Tanimbili (l) 
Tanimbili 
Vanikoro 
Buma 
Vano 
Tanima 


Polynesian Outliers 
Polynesian Outliers 


Rennellese 
Takuu 
Nukumanu 
Luangiua 
Sikaiana 
Pileni 
Tikopia 


Papuan Languages 
North Bougainville 


Rotokas 
Eivo 
Konua 
Keriaka 


South Bougainville 


Nasioi 
Nagovisi 
Siwai 
Buin 


Central Solomons 


Kazukuru 
Bilua 
Baniata 
Lavukaleve 


75 


190 


500 
(extinct) 
(extinct) 


3,570 
250 
200 

1,800 
920 

1,680 

3,550 


4,320 
1,200 
3,500 
1,000 


10,000 
5,000 
6,600 

30,500 


(extinct) 
8,540 
1,480 
1,150 


Savosavo 2,200 
Reefs-Santa Cruz 

Aiwo 7,100 

Santa Cruz 5,000 

Nanggu 9,500 


4.5.6 New Ireland and Admiralty Islands 
(Papua New Guinea) 


The Austronesian languages on Map 30 are Oceanic 
Austronesian, but ranging from quite typical to rather 
aberrant. Those of the Admiralty Islands are being 
increasingly replaced by New Guinea Pidgin as the 
first language. Kuot, the Papuan language shown, has 
a gender system with concordance, number marking 
with nouns and a highly elaborate verb morphology. 
It is highly agglutinating. 


4.5.7 New Britain (Papua New Guinea) 


The Austronesian languages shown on Map 30 are 
Oceanic Austronesian, ranging from quite typical to 
quite highly aberrant. They are highly diverse, and 
some appear to have been subject to strong Papuan 
influence. The Papuan languages shown mostly have 
gender systems with concordance, number marking 
with nouns and a highly elaborate verb morphology. 
They are less agglutinating than those shown on 
the map of the Solomon Islands and Bougainville 
(Map 29). 


Table 4.5 Map 30: New Britain and New Ireland — 
Languages and numbers of speakers 


Austronesian Languages 


Admiralty 
West Admiralty 
Wuvulu-Aua 1,000 
Seimat 1,000 
Kaniet (extinct) 
Eastern Admiralty 
West Manus 
Hermit (extinct) 
Sisi/Bipi 1,200 
Sori/Harengan 570 
Lindrou 4,200 
Mondropolon 300 
Bohuai 1,400 
Levei-Tulu 1,600 
Likum 100 
Mokerang 200 
Loniu 460 
East Manus 
Ponam 420 
Andra-Hus 810 
Leipon 650 
Kuruti-Kele-Ere 2,600 
Koro (Okro) 400 
Papitalai 520 
Titan 3,850 
Nali 1,800 
Nane 3002 
E 50? 
Pak/Tong 970 
South-east Admiralty 
Baluan-Lou-Pam 1,000 
Lenkau 250 
Penchal 550 
Nauna 130 
North New Guinea Cluster 
Ngero-Vitiaz 
Ngero 
Bariai 1,380 
Kove-Kaliai 6,750 
Vitiaz 
South-west New Britain 
Lamogai 1,600 
Pulie 1,000 
Rauto 1,000 
Mok-Aria 626 
Idne 5,000 
Longa 200 
Pasismanua 3,000 
Arawe 2,200 
Moewehafen 3,700 
Gasmata 200 
Kapore 1,050 
Mangseng 2,500 


Mengen 
Uvol 4,200 
Mamusi 6,000 
Mengen 8,400 
Kilenge 
Kilenge 440 
Maleu 4,500 


Meso-Melanesian 
New Ireland 


Tomoip 
Tomoip 700 
South New Ireland/North-west Solomons 
Siar 
Kandas 480 
Duke of York 10,000 
Patpatar 7,000 
Tolai 90,000 
Konomala 800 
Sursurunga 3,000 
Tangga 5,800 
Madak 
Lavatbura-Lamusong 1,300 
Madak 3,000 
Barok 2,120 
Tabar 
Notsi 1,600 
Tabar 2,010 
Lihir 6,000 
Lavongai-Nalik 
Lavongai 12,000 
Tigak 6,000 
Kara 5,000 
Tiang 800 
Nalik 5,200 
Willaumez 
Meramera 2,000 
Nakanai 13,000 
Xarua 1,400 
Bola 6,300 
Bulu 570 
Bali-Vitu 
Bali-Vitu 20,000 
St Matthias Group 
Emira-Mussau 5,000 
Tenis 50 


Papuan Languages 
East New Britain 
Baining-Taulil 


Butam 500 
Taulil 500 
Baining 6,500 
Yele/West New Britain 
Yele 
Yele (Yela, Yeletnye) 4,000 
Wasi 
Ata 2,000 
Anem 
Anem 600 
Sulka (I) 
Sulka 2,500 
Kol (1) 
Kol 4,000 
Kuot (I) 
Kuot 2,300 


4.5.8 South-eastern Papua New Guinea 


The Austronesian languages on Map 31 are Oceanic 
Austronesian, most of which show Papuan influence 
in their structures. The Papuan language, Yele, has a 
highly elaborate verb morphology and an extremely 
complicated phonological system with very unusual 
consonants. It has no concordance class system. 


Table 4.6 Map 31: South-eastern Papua New Guinea — 
Languages and numbers of speakers 


Austronesian Languages 
Papuan Tip Cluster 
Nuclear Papuan Tip 
North Mainland-D'Entrecasteaux 


Suau 
Tubetube c. 2,000 
Kurada 940 
Wagawaga 1,020 
Bohutu 1,070 
Suau 6,800 
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Table 4.6 continued 


Kakabai 
Dawawa 
Kakabai 

Taupota 
Kukuya 
Wedau 
Taupota 
Garuwahi 
Tawala 

Are 
Arifama 


(Arifama-Miniafia) 


Ubir 
Doga 
Are (Mukawa) 
Paiwa 
Boianaki 
Anuki 
Anuki 
Dobu-Duau 
Dobu 
Mwatebu 
Sewa Bay 
Galeya 
Bosilewa 
Duau 
Bunama 
Bwaidoga 
Wataluma 
Iduna 
Diodio 
Bwaidoga 
Kalokalo 
Yamalele 
Fagululu 
Molima 
Gumawana 
Gumawana 
Maisin 
Maisin 
Peripheral Papuan Tip 
Central Papuan 
Magori 
Magori 
Yoba 
Bina 
Ouma 
Ouma 
Sinagoro 
Sinagoro 
Keapara 
Kilivila-Louisiades 
Louisiades 
Nimoa 
Sudest 
Kilivila 
Kilivila 
Muyuw 
Budibud 
Misima 
Papuan Languages 
Trans-New Guinea 
South-east Papuan 
Koiarian 
Koiaric 
Mountain Koiari 
Baraic 
Barai 
Omie (Aomie) 
Managalasi 
Kwalean 
Kwale 
Manubaran 
Maria 
Doromu 
Yareban 
Abia 
Doriri 
Sirio 
Yareba 
Bariji 
Mailuan 
Domu 
Morawa 
Binahari 
Bauwaki 
Magi (Mailu) 
Laua (Labu) 


1,630 
880 


1,230 
2,180 
2,660 
230 

c. 10,000 


(2000) 3,470 
912 

200 

1,230 

1,320 

1,180 


530 


c. 8,000 
170 
1,520 
1,880 
350 
6,050 

c. 1,500 


190 

c. 6,000 
1,170 
5,380 
730 
2,560 
420 
2,770 


(2000) 470 


1,770 


163 
extinct 
extinct 


extinct 


c. 15,000 
16,520 


1100 
(1987) 2,000 


c. 22,000 
3,000 

170 

(2002) 18,000 


3,730 


3,010 
c. 1,100 
c. 5,000 


c. 2,000 


2,110 
840 


580 
570 
c.700 
750 
260 


480 
760 
770 
380 
c. 6,000 
(1987) 1 


Dagan 
Daga (Dimuga) 
Mapena 
Turaka 
Maiwa 
Onjob 
Sona (Kanasi) 
Ginuman (Dime) 
Gwedena (Gede) 
Jimajima 

Binanderean 

Binanderean 
Zia 
Binandere 
Ambasi (Dawari) 
Aeka 
Orokaiva 
Hunjara 
Notu (Ewage) 
Yega 
Gaina 
Baruga 
Dogoro (Doghoro) 
Korafe 

Yele-West New Britain 
Yele 
Yele (Yela, Yeletnye) 


6,000 
270 
(2000) 25 
c. 2,500 
160 

c. 2,000 
780 
2,160 
540 


c. 3,000 
3,300 
(1981) 1,200 
2,000 

c. 27,000 
4,270 

c. 12,000 
900 

1,130 

1,050 
(2000) 270 
(2003) 3,630 


(1998) 4,000 


4.5.9 Southern Papua New Guinea 


The Trans-New Guinea family Papuan languages on 
Map 32, with some exceptions, show the influence of 
a substratum that manifests itself in the abundant 
presence of nasal vowels, and a verb morphology 
that is relatively simple in general, but shows a great 
proliferation of aspect markers. Some also show a 


rudimentary gender system in the verb. 


Table 4.7 Map 32: Southern Papua New Guinea — 


Languages and numbers of speakers 


Papuan Languages 
Trans-New Guinea 
Kainantu-Gorokan 
Kainantu 
Gadsup-Auyana-Awa 
Gadsup 
Auyana 
Awa 
Tairora 
Waffa 
Binumarien 
Tairora 
Owena (Waisara) (I) 
Owena (Waisara) 
Gorokan 
Siane 
Siane 
Yabiyufa (Yawiyuha) 
Gahuku 
Gahuku-Asaro 
Benabena 
Kamano (I) 
Kamano 
Fore 
Fore 
Gimi 
Chimbu-Wahgi 
Chimbu 
Chimbu 
Chuave 
Nomane 
Wahgi 
Wahgi 
Nii 
Hagen (I) 
Hagen 
Engan 
Enga 
Enga 
Katinya 
Huli (I) 
Huli 
Angal (Mendi)-Kewa 
Angal (Mendi) 
Kewa 
Sau 


(2000) 22,000 
8220 
1,790 


1,000 
300 
13,290 
350 
23,480 


c. 2,000 


34550 
20,320 


85,580 
c. 17,000 


22,460 


137,650 
23,110 
4,650 


c. 60,000 
c. 12,000 


c. 130,000 
164,750 
900 

c. 70,000 
45,450 


c. 56,000 
3,130 


Wiru (1) 
Wiru (I) 
Kenati (I) 
Kenati (Ganati, Aziana) 
Finisterre-Huon 
Huon 
Eastern Huon 
Momare 
Sene 
Migabac 
Kate 
Dedua 
Mape 
Kube 
Kosorong 
Western Huon 
Sialum 
Ono 
Nomu 
Kinalakna 
Kumokio 
Komba 
Selepet 
Timbe 
Tobo 
Yaknge 
Burum 
Mesem 
Nabak 
Finisterre 
Erap 
Nuk 
Nek 
Nakama 
Munkip 
Gusan 
Sauk 
Numanggang 
Finungura 
Nimi 
Mamaa 
Urii 
Wantoat 
Bam 
Yagawak 
Irumu 
Wantoat 
Uruwa 
Weliki 
Komutu 
Yau 
Som 
Sakam 
Yupna 
Isan (Yupna) 
Wandabong 
Nokopo 
Kewieng 
Nankina 
Gusap-Mot 
Nahu 
Ufim 
Rawa 
Abaga (I) 
Abaga 
Ok 
Southern (Metomka) Kati 
Yonggom 
East Strickland 
Nomad 
Agala 
Konai 
Bosavi 
Beami (Bedamini) 
Onabasulu 
Kaluli 
Kasua 
Kware 
Sonia 
Bainapi 
Tomu River Language 
Awin-Pa 
Awin 
Pa 
Awyu-Dumut 
Kaeti 
Kamula (I) 
Kamula 
East Kutubu 
Foe 
Fiwaga 


15,290 


640 


370 
(1978) 10 
1,050 
6,130 

c. 5,000 
5,120 
5,860 
1,560 


640 
5,400 
810 
220 
550 
12,240 
7,000 
11,280 
2,890 
2,080 
4,090 
1,750 
c. 12,000 


1,720 
1,460 
1,040 
140 
870 
610 
2,270 
470 
1,560 
190 
2,470 


390 
490 

c. 1,300 
7,320 


200 

510 
1,700 
(2000) 80 
510 


(2000) 7,200 
520 

1,670 

820 

c. 2,500 


6,000 
550 
7,140 


150 


4,500 
c. 4,000 


2,900 
300 
400 


4,200 

430 

2,000 

200 
(2000) 140 
410 

400 


300 
6,500 
1,500 

10,000 


400 


2,800 
(1981) 300 
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West Kutubu 
Fasu 
Some 
Namumi 
Boazi 
Zimakani 
Boazi 
Marind 
Bian Marind 
Marind 
Suki (1) 
Suki 
Gogodala 
Gogodala 
Ari-Waruna 
Angan 
Angaataha (1) 
Angaataha 
Main 
Hamtai (Kapau) 
Kawatsa (Kawacha) 
Kamasa 
Menya 
Safeyoka (Ampale) 
Yagwoia 
Baruya 
Simbari 
Ankave 
Ivori 
Lohiki 
South-eastern Papuan 
Goilalan 
Biangai 
Weri 
Kunimaipa 
Tauade 
Fuyuge 
Koiarian 
Koiaric 
Koita 
Koiari 
Mountain Koiari 
Baraic 
Barai 
Omie (Aomie) 
Managalasi 
Yareban 
Sirio 
Bariji 
Doriri 
Yareba 
Kwalean 
Mulaha 
Humene 
Kwale 
Manubaran 
Maria 
Doromu 
Binandere 
Binanderean 
Suena 
Yekora 
Zia 
Mawae 
Binandere 
Ambasi (Dawari) 
Aeka 
Orokaiva 
Hunjara 
Notu (Ewage) 
Yega 
Guhu-Samane (I) 
Guhu-Samane 
Madang 
Rai Coast 
Wasembo (Gusap) 
Teberan (F) 
Dadibi (Daribi, Mikaru) 
Podopa (Polopa) 
Pawaian (I) 
Pawaian 
Kiwai-Porome 
Kiwai 
Southern Kiwai 
Wabuda 
Bamu Kiwai 
Morigi (Turama River Kiwai) 
Kerewo 
North-eastern Kiwai 
Arigibi 


750 
150 
300 


c. 1,500 
1,960 


900 
7,200 


(2003) 3,510 


c. 10,000 
c. 3,500 


1,200 


40,000 
(2000) 12 
(2003) 7 
c. 15,000 
3,120 

c. 9,000 
5,190 

c. 3,000 
c. 1,600 
400 

850 


1,400 
4,160 
c. 11,000 
c. 11,000 
c. 15,000 


c. 3,000 
1,780 
3,730 


3,010 
c. 1,100 
c. 5,000 


c. 700 
260 
570 
750 


extinct 
440 
c. 2,000 


2,110 
840 


2,270 
670 

c. 3,000 
940 
3,300 
1,200? 
2,000 
27,000 
4,270 
12,000 
900 


6,290 


590 


c. 10,000 
2,790 


c. 4,000 


9,700 
1,700 
4,400 
700 
2,200 

c. 4,400 
300 


Porome 
Porome c. 1,100 
Tirio 
Tirio 950 
Aturu 220 
Lewada-Dewara 450 
Mutum (Paswan) 400 
Waia (I) 
Waia 1,000 
Inland Gulf (F) 
Ipiko 200 
Minanibai 300 
Tao-Suamato 500 
Turama-kikorian (5) 
Turama-Omatian (Mena) 
Omati (Mini) 800 
Ikobi (Kasere) 350 
Mena 300 
Rumu (Kairi) (1) 
Rumu (Kairi) c. 1,000 
Eleman (S) 
Purari (I) 
Purari (Koriki, Namau) c. 7,000 
Eleman 
Eastern Eleman 
Toaripi 20,130 
Uaripi 2,470 
Western Eleman 
Opao 1,120 
Keuru (Belepa) 4,520 
Orokolo 13,000 
Tate 
Tate (Raepa-Tati) 270 
South-Central Papuan 
Morehead and Upper Maro Rivers 
Yey (I) 
Yey (1996) 900 
Nambu 
Dorro (2003) 390 
Nambu (2003) 710 
Tonda 
Lower Morehead (Peremka) (2003) 290 


Upper Morehead (Rouku, 
Kamindjo, Tjokwai, Tokwasa) (2002) 700 


Tonda 600 
Kanum 320 
Pahoturi River 
Agob (Dabu) 1,100 
Idi 900 


Eastern Trans-Fly 
Eastern Trans-Fly 


Bine (Kunini) c. 2,000 
Gidra (Jibu) 1,650 
Gizra (Toga) 600 
Miriam 700 
Austronesian Languages 
Oceanic 
North New Guinea Cluster 
Huon Gulf 
North Huon Gulf 
Yabem 2,080 
Bukawa 9,690 
Lae extinct 
Kela 2,750 
Markham 
Labu 1,600 
Duwet 400 
Silisili 990 
Dangal 370 
Maralango 170 
Adzera 20,680 
Sukurum 990 
Wampur 360 
Sirasira 600 
Wampar 5,150 
Sirak 150 
Musom 140 
Yalu 590 
Mari 2,000 
South Huon Gulf 
Kaiwa 920 
Hote 3,000 
Vehes 100 
Mapos Buang 6,670 
Mangga Buang 2,690 
Mumeng 6,640 
Kapin 1,670 
Piu 130 
Numbami 
Numbami (Siboma) 270 


Ngero/Vitiaz 

Vitiaz 
Sio c. 3,500 
Tami 900 
Malasanga 440 
Nenaya 320 

Ngero 
Gitua 480 


Papuan Tip Cluster 
Peripheral Papuan tip 
Central Papuan 
West-Central Papuan 


Mekeo 12,000 
Roro 8,000 
Kuni 1,700 
Doura 800 
Lala 990 
Gabadi 1,400 
Motu 14,000 
Sinagoro-Keapara 
Sinagoro c. 15,000 
Keapara 16,520 
Australian Languages 
Pama-Nyungan 
Kala Lagaw Ya 
Kala Lagaw Ya 3,000-4,000 


4.5.10 Northern Papua New Guinea 


The Austronesian languages on Map 33 are Oceanic 
Austronesian languages that show Papuan influence 
in their structures. The Trans-New Guinea family 
languages shown have a noun-classification system 
through classificatory verbs, sentence-medial verbal 
forms, no number marking with nouns and mostly 
a highly elaborate verb morphology. A few have a 
two-gender system. Most of them have complex 
phonological systems, and a number are tonal. The 
Sepik-Ramu languages shown have simple phon- 
ologies, transparent and mostly simple grammatical 
structures, with two-gender systems and number 
marking in nouns. In the Upper Sepik languages, 
multiple-class systems, with concordance, are present. 
The languages of the Nor-Pondo subgroup mostly 
have complex noun-classification systems with con- 
cordance, though this has been lost in some instances 
in the last few decades. The Torricelli family lan- 
guages shown have complex phonological systems, 
two- or three-gender systems in subject prefixes 
accompanied by a multi-class system with concord- 
ance, and complicated irregular number marking in 
nouns. The multi-class system has been lost in some 
cases over the last few decades. The languages of 
the small phyla shown are in general comparatively 
simple in their structures, except for those of the 
Skou subgroup, which are exceedingly complex, with 
some of them possessing a morphology based on 
tonal contrasts. Those of the Arai group have nasal 
vowels, tones and a large number of aspect markers. 


Table 4.8 Map 33: Northern Papua New Guinea — 
Languages and numbers of speakers 


Papuan Languages 
Trans-New Guinea 
Madang 
Southern Adelbert Range-Korak-Waskia 
Southern Adelbert Range 


Josephstaal 
Katiati 3,290 
Sileibi 260 
Osum 580 
Moresada 200 
Ikundun 1,050 
Pondoma 600 
Wadaginam 550 
Wanang 
Angaua 1,770 
Atemple (2000) 60 
Emerum 750 
Musak 260 
Paynamar 150 
Faita 
Faita (2000) 50 
Gants 
Gants 1,880 


111 


Australasia and the Pacific 


Table 4.8 continued 


Korak-Waskia 
Waskia 
Korak 


Rai Coast 


Yaganon 
Yabong 
Ganglau 
Dumun (Bai) 
Saep 
Minjim 
Songum 
Bongu 
Male 
Bom (Bogadjim, Lalok) 
Nuclear Rai 
Kolom 
Siroi (Suroi) 
Arawum 
Pulabu 
Lemio 
Yangulam 
Rerau 
Kwato (Waupe) 
Usu 
Sinsauru (Kow) 
Dumpu (Watiwa) 
Kesawai (Namuya) 
Asas (Kow) 
Sausi (Uya) 
Urigina 
Sumau (Garia) 
Danaru 
Usino 
Tauya (Inafosa) 
Biyom (Sasime) 
Wasembo (Gusap) (I) 
Wasembo 
Erima-Duduela 
Erima 
Duduela 
Jilim 
Jilim 


Kalam-Kobon 


Kalam 
Kobon 


Croisilles 


Dimir-Malas 
Dimir 
Malas 
Bunabun 
Kaukombar 
Pila 
Pay 
Saki (Maya) 
Tani (Miami) 
Mugil (Saker) 
Kumil 
Ulingan (Mawake) 
Bepour 
Moere 
Tibor-Omosa 
Mawak 
Kowaki 
Hinihon 
Musar 
Wanambre 
Abasakur 
Koguman 
Amaimon 
Amaimon 
Numugen-Mabuso 
Numugen 
Wanuma 
Yaben 
Yarawata 
Ukuriguma 
Bilakura 
Parawen 
Mabuso 
Munit 
Bemal (Kein) 
Girawa (Bagasin) 
Kare 
Panim 
Isebe 
Amele 
Gumalu 
Bau 


c. 12,000 
210 


370 
150 
(2000) 35 
580 


330 
420 
390 
1,130 


210 

700 
(2000) 60 
120 

180 

180 

240 

780 
(2003) 272 
480 

260 

540 

330 

500 
1,400 
2,510 
120 
1630 
350 

380 


(1980) 590 


410 
470 


410 


c. 15,000 
c. 6,000 


(2003) 3,820 
220 
500 


670 
770 
2,440 
c. 1,500 
2,300 


c. 2,000 
(2000) 50 
(2000) 50 


(2000) 25 
(2000) 25 
1,100 
680 

490 

760 

940 


(2003) 1,780 


1,280 
700 

100 

130 
(2000) 30 
430 


350 
640 
4,000 
380 
150 
910 
5,300 
270 
1,790 


Sihan 
Yoidik 
Rempi 
Garus 
Garuh 
Kamba 
Rapting 
Bagupi 
Nake 
Matepi 
Gal 
Baimak 
Mawan 
Mosimo 
Wamas 
Samosa 
Murupi 
Saruga 
Silopi 
Utu 
Finisterre-Huon 
Huon 
Western Huon 
Sialum 
Ono 
Nomu 
Kinalakna 
Kumokio 
Komba 
Selepet 
Timbe 
Kovai (I) 
Kovai 
Wantoat 
Wantoat 
Uruwa 
Weliki 
Komutu 
Yau 
Som 
Sakam 
Yupna 
Domung 
Bonkiman 
Isan (Yupna) 
Wandabong 
Nokopo 
Kewieng 
Gabutamon 
Mebu 
Nankina 
Warup 
Yagomi 
Degenan 
Asat 
Forak 
Morafa 
Guiarak 
Dahating 
Bulgebi 
Gusap-Mot 
Gira 
Ngaing 
Neko 
Nekgini 
Nahu 
Ufim 
Rawa 
Abaga (I) 
Abaga 
Oksapmin (I) 
Oksapmin 
Kainantu-Gorokan 
Kainantu 
Gadsup 
Gorokan 
Isabi (Maruhia) (1) 
Isabi (Maruhia) 
Gende (Bundi) (I) 
Gende (Bundi) 
Siane (I) 
Siane 
Gahuku 
Gahuku-Asaro 
Benabena 
Kamano (I) 
Kamano 
Chimbu-Wahgi 
Chimbu 
Chimbu 
Chuave 


310 

270 

590 
2,110 
2,280 
1,020 
330 
(2000) 50 
170 

240 

220 

440 

270 
(2000) 50 
140 
(2000) 90 
300 

130 

140 

580 


640 
5,400 
810 
220 
550 
12,240 
7,000 
11,280 


c. 4,500 


7,320 


200 

510 
1700 
(2000) 80 
510 


630 

(1991) 180 
(2000) 7,200 
520 

1,670 

820 

300 

320 

c. 2,500 


137 
360 
660 
160 
670 
130 
950 
(2000) 50 


280 
1,100 
320 
430 
6,000 
550 
7,140 
150 


6,700 


(2000) 22,000 


280 


(1987) 8,000 


23,480 


34550 
20,320 


(2000) 63,170 


137,650 
23,110 


Wahgi 
Wahgi 
Nii 

Jimi 
Narak 
Ganja 
Maring 

Hagen (I) 
Hagen 

Engan 

Enga 
Enga 
Katinja 
Lembena 
Nete 
Ipili 

Huli (I) 
Huli 

Angal (Mendi)-Kewa 
Angal (Mendi) 

Ok 
Lowland Ok 


Southern Kati (Metomka) 
Northern Kati (Niinati) 


Yonggom 
Iwur 
Ninggirum 
Mountain Ok 
Telefol 
Tifal 
Faiwol 
Setaman 
Bimin 
Mianmin 
Ngalum 
Kauwol 
Duna-Pogaia 
Duna 
Pogaya (Bogaia) 
East Strickland 
Nomad 
Agala 
Konai 
Bosavi 
Beami (Bedamini) 
Awin-Pa 
Awin 
Pa 
Pauwasi 
Yafi 
Emumu 


Skou 
Vanimo 
Skou (Tumawo) 
Sangke 
Wutung (Udung) 
Vanimo (Manimo) 
Krisa 
Krisa 
Rawo 
Puari 


Warapu (Varofu, Waropu) 


Border 


Northern (Border-Tor-Lake Plain) 


Taikat 


Taikat (Arso, Abrap) 


Waris 
Simog 
Daonda 
Waris 


Manem (Jeti, Skofro, Wembi) 
Waina (Sowanda) 


Amanab 
Bewani 
Ninggera 
Kilmeri 
Pagi (Pagei) 
Kwomtari-Left May 
Kwomtari 
Kwomtari 
Ifas (Bembi) 
Kwomtari 
Baibai 
Baibai 
Biaka 
Pyu (I) 
Pyu 
Arai (Left May) 
Rocky Peak (Yinibu) 
Iteri (Yinibu) 


c. 60,000 
c. 12,000 


c. 5,000 
3,570 
c. 8,000 


c. 130,000 


164,750 
900 

600 

200 
7,760 


c. 70,000 


45,450 


4,500 
8,500 
c. 4,000 
1,100 
4,000 


c. 4,800 
c.3,200 
c. 4,500 
200 

c. 2,000 
2,200 
18,000 
500 


c. 11,000 
(1981) 300 


2,900 
300 
400 


4,200 


8,000 
6,500 


170 
1,100 


350 
200 
410 

c. 2,200 


350 
510 
370 
2,990 


c. 600 


270 
140 
3,160 
400 
1,100 
6,800 


250 


1,790 
990 


1,600 
c. 900 


270 
450 


(1978) 100 


280 
(2003) 475 
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Bo (Po) 

Ama (Waniabu) 
Nimo 

Owiniga 


Senagi (Anggor) 


Anggor (Senagi, Watapor) 
Dera (Dra, Kamberatoro) 


Torricelli 


West Wapei 
One 
Seta 
Seti 
Wapei-Paiei 
Wapei 
Yis 
Olo (Wapi) 
Yau 
Elkei 


Alu 
Ningil 
Gnau 
Valman (Koroko) 
Galu 
Yapunda 
Palei 
Nambi 
Agi 
Kayik 
Aruop 
Bragat 
Aiku 
Aru 
Urat 
Urim 
Mamai 
Laeko-Libuat (I) 
Laeko-Libuat 
Beli (1) 
Beli 
Maimai 
Wiaki 
Siliput 
Yahang 
Heyo 
Kombio 
Kombio 
Aruek (Djang) 
Lou (Torricelli) 
Eitiep 
Kombio 
Yambes 
Arapesh 
Southern Arapesh 


(Muhiang, Mufwian) 


Mountain Arapesh 
(Bukiyip, Kavu) 


Bumbita (Bumbita Arapesh) 


Wom (I) 
Wom (Wam) 
Marienberg 
Mandi 
Muniwara (Tumara) 
Urimo 
Bungain 
Kamasau (Sagi) 
Elepi (Sampi) 
Buna 
Monumbo 
Monumbo 
Lilau (Ngaimbom) 


Sepik 


Biksi (I) 
Biksi 
Upper Sepik 
Abau (I) 
Abau 
Iwam 
Ilwam 
Amal 
Wogamusin 


Chenapian (Tsenap, Zenap) 


Wogamusin 
Ram 
Karawa 
Bouye 
Autu 
Tama 
Kalou 


175 
c. 400 
410 
220 


2,560 
c. 1,700 


2,210 
160 
110 


490 

c. 12,000 
140 
1,430 

c. 5,000 
2,130 
1,880 
520 

980 

700 

210 
(2000) 60 


480 

670 

770 

330 

340 

820 

130 

c. 6,000 
(2003) 3,740 


500 


1,240 


560 
220 
1,000 
1,870 


610 
c. 600 
390 
2,150 
860 


10,650 


10,300 
c. 2,500 


1,890 


160 
830 
840 
2,450 
790 
150 
1,260 


460 


450 


200 


4,550 


3,200 
390 


190 
370 


(2003) 63 
520 
380 


820 


Sepik Hill 


Pasi 
Pahi 


Mehek (Driafleisuma) 

Mayo (Mayo-Yesan, Yasyin, Yasi, 
Yessan-Mayo) 

Yellow River 

Namia 


Ak 


Awun 
Middle Sepik 
Nukuma 


Kwoma (Kuome, Washkuk) 
Kwanga (Gawanga, Womsak) 


Yerakai (I) 


Ndu 


Yerakai 


Ngala (Kara, Sogap) 

Kaunga 

Manambu 

Kwasengen 

Abelam (Abulas, Ambelas) 

Boiken 

latmul 

Sawos (Gaikunti, Kwomgal, 
Sepik Plains, Tshuosh) 


Sanio 


Sanio 

Paka 

Gabiano 

Piame (Biami) 

Bikaru (Bugalu) 

Hewa (Sisimin, Umairof) 


Bahinemo 


Bitara (Apowasi) 
Bahinemo (Wogu) 
Mari 

Bisis 

Watakataui 
Kapriman 
Sumariup (Sogoba) 


Alamblak 


Kaningara 
Alamblak 


Leonhard Schultze 
Walio 


Papi 


Walio 

Pai 

Yabio 

Tuwari (Akiapmin) 


Papi (Paupe) 
Duranmin 


Ramu 
Mongol-Langam 


Lower Sepik-Ramu 


Langam 
Mongol 
Yaul 


Porapora (Grass) 


Kambot (Keram) 
Adjora (Adjoria) 
Gorovu (Yerani) 
Aion 


Banaro (I) 


Banaro 


Arafundi 


Alfendio (Arafundi) 
Meakambut 


Rao (I) 


Aian 


Rao 


Anor 
Aiome 


Ottilien 


Watam 

Gamei 

Kaian 

Bosman (Bosngun) 
Awar 


Misegian 


Giri (Kire) 
Sepen 
Mikarew (Ariawia, Gumasi) 


Ataitan 


Andarum 
Igom 
Tangu 
Tanguat 


Tamolan 


Romkun 


160 
540 
3,340 


1,200 


3,500 
(2000) 75 
380 


2,870 
13,310 


390 


140 
230 
2,060 
6,010 
44,000 
35,200 
12,000 


9,000 


640 
300 
100 
100 
100 
2,150 


260 

430 

810 

400 

160 
1,440 
(1993) 80 


360 
1,500 


140 
210 
100 
120 


(2000) 70 
(2000) 140 


250 
340 
310 


6,110 
2,380 
(2000) 15 
860 


2,480 


630 
100 


5,000 


570 
750 


380 
1,140 
320 
720 
570 


2,000 
430 
8,000 


1,080 
1,080 

c. 3,000 
510 


390 


Breri 

Kominimung 

Igana 

Akrukay 

Itutang 

Midsivindi 
Lower Sepik 

Pondo 


Chambri (Tchambuli) 


Yimas 


Karawari (Tabriak) 
Angoram (Olem, Pondo) 


Nor 
Murik (Nor) 
Kopar 
Gapun (I) 
Gapun 
Yuat 
Yuat 
Changriwa 
Mekmek 


Biwat (Mundokuma, 


Mundugumor) 
Bun 
Miyak 
Maramba (I) 
Maramba 
Piawi 
Waibuk (Piawi) 
Waibuk (Wiyaw) 
Aramo (Aramaue) 
Pinai (Pinaye) 
Wapi 
Erem? 
Amto-Musian 
Amto (Ki) 
Musian (Musa) 
Yuri (1) 
Yuri 
Busa (I) 
Busa 
Nagatman (I) 
Nagatman 


Austronesian Languages 
North New Guinea 
Ngero/Vitiaz 
Ngero 
Mutu 
Gitua 


Malamalai (Bonga) 


Vitiaz 
Korap 
Barim 
Lukep 


Malasanga 


Mangap 
Mangap 
Sio 
Sio 
Bel 
Sepa 
Ham 
Bilibil 


Gedaged (Bel) 


Takia 
Megiar 
Matukar 


Wab (Som) 
Biliau (Sengam) 


Mindiri 


Roinji-Nenaya 


Roinji 


Nenaya (Nineia) 


Huon Gulf 
Markham 
Adzera 
Mari 
Schouten 
Manam 
Medebur 
Manam 
Bam 
Wogeo 
Kis 
Kairiru 
Kairiru 
Kaiep (Samap) 
Siau 
Sera (Ssia) 
Sissano 


1,070 
328 
110 
190 
220 
810 


1,700 

350 
1,300 
6,200 


1,480 
230 


(2000) 80 


500 
1,040 


1,640 
190 
550 


300 


1,000 
300 

c. 1,500 
1,000 


(2000) 200 
(2000) 70 


740 


300 


500 


2,000 
480 
340 


470 
1,770 
440 


(1991) 2,500 


(1987) 3,500 


(2003) 690 
(1998) 3,500 
700 

2,760 

c. 12,000 
860 

220 

140 

(2000) 1,150 
(2000) 80 


227 
(2002) 580 


20,680 
(2000) 810 


(2003) 510 
c. 6,500 
1,460 
1,240 
(2000) 220 


2,810 
300 


430 
(2000) 300 
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Table 4.8 continued 


Ali 2,118 
Tumleo 680 
Ulau-Suain 1,370 


4.5.11 Papua (Indonesia) 


Of the Austronesian languages shown on Map 34, 
those of the South Halmahera—North-West New 
Guinea group are transitional between Eastern 
Indonesian and Oceanic Austronesian, whereas those 
of the Central Maluku group are Eastern Indonesian 
languages of a relatively simple type. The Micronesian 
family of languages are Oceanic Austronesian of a 
special type, with complex possession marking 
and complicated phonological systems. The Trans- 
New Guinea family languages shown have a noun- 
classification system through classificatory verbs, 
sentence-medial verbal forms, no number marking 
with nouns and mostly a highly elaborate verb 
morphology. Most of them have complex phono- 
logical systems, and a few of them are tonal. Some of 
those of the South Bird’s Head group show strong 
West Papuan group influence in their structures, 
especially in the appearance of West Papuan type 
subject prefixes. The languages of the West Papuan 
group shown have subject and prefixes with a two- 
gender system, number marking with some nouns 
and a comparatively simple verb morphology. The 
word order is Subject-Verb—Object, by contrast with 
the order SOV, which is the norm in the languages of 
all other Papuan families. In the Cenderawasih Bay 
languages, those of the Cenderawasih Bay subgroup 
have a complex verb structure with a large range of 
aspects and indications of the directions of actions, 
while the Yava isolate shows also a two-gender system 
in pronouns, nouns and person marking in the verb. 
Number marking in nouns is also present in it. 


4.5.12 North-eastern Papua (Indonesia) 


Of the Austronesian languages on Map 34, those of 
the South Halmahera—North-West New Guinea 
group are transitional between Eastern Indonesian 
and Oceanic Austronesian. Those of the Yotafa- 
Sarmi group are more Oceanic Austronesian. The 
Trans-New Guinea languages shown have a noun- 
classification system through classificatory verbs, 
sentence-medial verbal forms and mostly an elabor- 
ate verb morphology. Those of the Nimboran sub- 
group, and of the Tor and Lake Plain groups, have 
some features reminiscent of those attributable to the 
influence of the substratum affecting Trans-New 
Guinea family languages in south-western Papua New 
Guinea. They also show a rudimentary two-gender 
system in the verb, and some number marking with 
nouns. Most of the Trans-New Guinea family lan- 
guages have complex phonological systems, and 
several are tonal. Some of the languages of the Tor 
and Lake Plain languages have an elaborate tonal 
morphology. The languages of the Skou group are 
exceedingly complex, and some of them also have 
a morphology based on tonal contrasts. Those of 
the Kwomtari group have a comparatively simple 
structure, whereas those of the Cenderawasih Bay 
group have a complex verb structure with a large 
range of aspects and indications of the directions of 
actions. Some of the isolates have noun classification 


by affixes. 


4.5.13 South-eastern Papua (Indonesia) 


Of the Austronesian languages on Map 34, those of 
the South Halmahera—North-West New Guinea 
group are transitional between Eastern Indonesian 
and Oceanic Austronesian, whereas those of the other 
two groups are Eastern Indonesian languages of a 
relatively simple type. Among the Papuan languages, 
the Trans-New Guinea family languages shown have a 
noun-classification system through  classificatory 
verbs, sentence-medial verbal forms, no number 


marking with nouns and mostly a highly elaborate 
verb morphology. A few have a two-gender system. 
Most of them have complex phonological systems, 
and a few of them are tonal. Some of the languages of 
the Tor and Lake Plain group have an elaborate tonal 
morphology, and some number marking with nouns 
and a rudimentary gender distinction in the verb is 
present in some of them. Trans-New Guinea family 
languages in the south-eastern corner of Map 34 
show the influence of a substratum that manifests 
itself in the abundant presence of nasal vowels, and 
a verb morphology which is relatively simple in 
general but shows a great proliferation of aspect 
markers. They also have a two-gender system. The 
languages of the Cenderawasih Bay stock have a 
complex verb structure with a large range of aspects 
and indications of the directions of actions. 


Table 4.9 Map 34: Irian Jaya — Languages and numbers 
of speakers 


Papuan Languages 
Trans-New Guinea 


Ok 
Lowland Ok 
Southern Kati c. 4,000 
Northern kati c. 8,000 
Yonggom c. 17,000 
Iwur c. 1,000 
Ninggirum 4,000 
Mountain Ok 
Mianmin 2,200 
Ngalum 18,000 
Kauwol 500 
Awyu-Dumut-Asmat-Kamoro 
Awyu-Dumu 
Dumut 
Wanggom 1,000 
Wambon c. 3,000 
Kaeti c. 10,000 
Awyu 
Kotogut 1,000 
Aghu 3,000 
Pisa 3,500 
Airo-Sumaghaghe 2,000 
Siagha-Yenimu 3,000 
Sawuy 
Sawuy 2,500 
Unclassified Awyu-Dumut Languages 
Mapi? 3,3002 
Ederah (1991) 500 
Kia? 2,700? 
Upper Digul (1991) 500 
Upper Kaeme? 200? 
Mombum 
Mombum 250 
Koneraw 300 
Asmat-Kamoro 
Central Asmat 50,000 
Casuarina Coast Asmat c. 8,000 
Citak Asmat c. 8,000 
North Asmat 700 
Sempan 1,000 
Kamoro c. 8,000 
lria (1993) 1,570 
Asienara (2000) 1,000 
Somahai 
Somahai (2000) 2,000 
Marind 
Boazi 
Boazi 1,960 
Marind 
Bian Marind 900 
Marind 7,200 
Yaqay 
Yaqay c. 10,000 
Warkay-Bipim 250 
Kayagar 
Tamagario 3,500 
Kaygir 7,200 
Kaugat 700 
West Trans-New Guinea Linkage 
Dani 
Great Dani 
Ngali-Nduga 
North Ngalik 35,000 
South Ngalik 5,000 
Nduga 10,000 


Dani 


Grand Valley Dani 
Western Dani 


Wano (I) 
Wano 
Saberi (I) 
Saberi 
Samarokena (I) 
Samarokena 
Wissel Lakes-Kemandoga 
Uhunduni (1) 
Uhunduni 
Ekagi-Wodani-Moni 
Moni 
Wodani 
Ekagi 
West Bomberai 
West Bomberai 


Baham 
lha 
Karas (I) 
Karas 
Mor (I) 
Mor 
Dem (I) 
Dem 
Mek (Goliath) 
Western 
Kosarek 
Nipsan 
Nalca 
Korapun 
Goliath 
Eipo-Mek 
Eastern 
Ketengban 
Sirkai? 
Kinome? 


South Bird's Head 
South Bird's Head 
Eastern 
Arandai 
Barau 
Central 
Tarof 
Kasuweri 
Western 
Puragi 
Kampong Baru 
Inanwatan 
Inanwatan 
Duriankere 
Konda-Yahadian 
Yahadian 
Konda 
Pauwasi 
Eastern 
Yafi 
Emumu 
Western 
Dubu 
Towei 
Molof (I) 
Molof 
Usku (I) 
Usku 
Tofamna (I) 
Tofamna 
Kaure 
Kaure 
Kaure 
Narau 
Kosare 
Sause (I) 
Sause 
Kapori (1) 
Kapori 
Kolopom 
Kimaghama 
Riantana 
Ndom 
Moraori (1) 
Moraori 
Tanah Merah (I) 
Tanah Merah 


Border 
Northern (Border-Tor-Lake Plain) 


Bewani 
Kilmeri 
Pagi 


c. 100,000 
c. 150,000 


c. 3,500 


1,500 


400 


12,000 


c. 25,000 
c. 10,000 
c. 100,000 


1,100 
5,500 


200 


700 


c. 1,000 


1993) 2,300 
1993) 2,500 
(2000) 11,090 
1996) 8,000 
1991) 4,000 
1987) 3,000 


2000) 9,970 


c. 1,000 
150 


c. 1,200 
c. 2,500 


900 
250 


1,100 
100 


700 
500 
170 


1,100 


130 
120 


(1978) 200 
(2000) 20 
100 

4,600 

(2000) 80-90 
(1993) 250 
500 

(2000) 30-40 
3,000 

1,100 

450 


(1998) 50 


(1978) 500 


1,790 
990 


114 


Australasia and the Pacific 


Waris 
Waris 3,160 
Manem 400 
Waina 1,100 
Senggi 120 
Taikat 
Taikat c. 600 
Awyi c. 400 
Morwap (I) 
Morwap 300 
Senagi 
Anggor 
Dera 1,700 
Extended West Papuan 
West Papuan 
West Bird's Head 
Seget 
Seget (1988) 1,200 
Moi 
Moi 4,000 
Moraid 
Moraid (1988) 1,000 
Tehit 
Kalabra (2000) 3,290 
Tehit (2000) 10,000 
Kuwani 1,000 
North and Central Bird's Head 
Karon Pantai 2,500 
Madik 1,000 
Karon Dori 5,000 
Brat 15,000 
Amberbaken (I) 
Amberbaken 5,000 
Borai-Hattam 
Hattam 12,000 
Borai 1,000 
East Bird's Head 
Meax 
Meax (2000) 14,780 
Meninggo (1996) 8,000 
Mantion (I) 
Mantion 12,000 
Sentani 
Sentani 
Nafri 1,630 
Sentani 25,000 
Tanah Merah 500 
Demta (I) 
Demta 700 
Yava (I) 
Yava 6,000 
Mairasi 
North-eastern Mairasi (pop. inc. with Mairasi) 
Semimi (1991) 1,000 
Mairasi (1996) 3,300 
East Cenderawasih Bay 
Tarunggare (1993) 500 
Baropasi (1995) 2,500 
Nisa (1987) 500 
Bauzi (1991) 1,500 
Bapu (1993) 2,000 
Lakes Plain 
East Lakes Plain 
Foau 230 
Dabra (app. = Taworta) 140 
Taworta (2000) 140 
Central Lakes Plain 
Papasena 400 
Aikwakai (1993) 800 
Weretai (2000) 150 
Taori-Kei (2000) 220 
Taogwe (2000) 15-50 
Taori-So (2000) 70-100 
Baburiwa (2000) 530 
Turu (1) 
Turu (2000) 600 
Orya-Mawes-Tor-Kwerba 
Uria (I) 
Uria 1,100 
Mawes (I) 
Mawes 690 
Tor 
Manemgi (2000) 40 
Kwesten (1987) 2,000 
Itik (2000) 80 
Bonerif 1002 
Wanes (1993) 200 
Mander (1991) 20 
Berik (1994) 1,200 
Kwerba 
Kwerba 1,500 


Bedoanas 250 
Erokwanus 250 
Central Maluku 
East Central Maluku 
East Seram 
Masiwang c. 1,000 
Seti 5,000 
Seram “Laut” 
Geser-Goram c. 36,500 
Watubela 4,000 
Banda (I) 
Banda (1987) 3,000 
Teor-Kur 
Teor-Kur c. 3,100 
South-east Maluku 
Aru-Kei-Tanimbar 
Aru 
Ujir 680 
Kola 6,800 
Dobel 5,200 
Barakai 2,300 
Wokam-Tarangan 24,500 
Kei-Fordata 
Kei-Fordata(Fordata) (2000) 85,000 
(Kei), 50,000 
Micronesian 
Trukic 
Mapia extinct 


Sasawa 190 
Airoran 350 
Nimboran 
Nimboran 
Mekwei-Gresi-Kwansu 4,050 
Kemtuk 2500 
Nimboran 3500 
Skou 
Vanimo 
Skou 350 
Sangke 250 
Wutung 410 
Vanimo c. 2,200 
Kwomtari 
Pyu (1) 
Pyu 100 
Sepik 
Biksi (I) 
Biksi (Yetfa) (1992, 1996) 1,200 
Abau (I) 
Abau (2000) 7,270 
South-central Papuan 
Morehead and Upper Maro Rivers 
Tonda 
Tonda 600 
Kanum 320 
Yey 
Yey 1,000 
Yelmek-Maklew 
Maklew 120 
Yelmek 400 
Taurap (I) 
Taurap 500 
Pauwii (I) 
Pauwi 100 
Massep (I) 
Massep (2000) 25 
Yuri (I) 
Yuri 740 
Austronesian Languages 
Yotafa-Sarmi 
Sarmi 
Sobei 1,850 
Bongo 430 
Tarpia 560 
Yotafa 
Yotafa 2,460 
Ormu 750 
Kayupulau 570 
South Halmahera-North-west New Guinea 
Cenderawasih Bay 
Warenbori 300 
Kurudu 1,100 
Waropen 6,000 
Mor 700 
lresim 100 
Yaur 350 
Yeretuar 250 
Tandia 350 
Nabi 500 
lrahutu (1987) 4,000 
Wandamen 4,000 
Ron (1993) 1,100 
Meoswar (1993) 250 
Biak (2000) 30,000 
Ansus (1987) 4,600 
Woi 1,300 
Pom (1987) 2,000 
Aibondeni 150 
Marau (1987) 1,700 
Munggui 800 
Papuma 600 
Busami 600 
Serui-Laut (1987) 1,200 
Wadapi-Laut 150 
Ambai 7,100 
Wabo 900 
Raja Empat 
Maya (2001) 4,000 
Maden (2001) 600 
Palamul (2001) 600 
Matbat (2001) 1,250 
Laganyan (2000) 250 
Kawe (2001) 600 
Bomberai 
Kaiwai 600 
Uruangnirin 250 
Onin 600 
Sekar 450 
Arguni 200 


4.5.14 Areas in Australia in which 
Aboriginal languages are still viable 


Map 36 shows those areas of Australia in which 
Australian Aboriginal languages are fully, or to some 
limited extent, viable today, having at least thirty or so 
speakers each. The areas indicated on the map are not 
necessarily occupied by speakers of the languages 
originally located in them. There has been strong 
expansion of the areas of major languages into other 
areas where the local languages have become extinct, 
and the lingua franca Pitjantjatjara is still used in 
areas extending far beyond the outer limits of regions 
where original languages still survive. 


4.5.15 Australia 


The language picture on the map of Australia reflects 
the situation around the time of the arrival of the 
Europeans. Aboriginal languages have since disap- 
peared from many areas, and are on the verge of 
immediate extinction in many more. In part of 
northern, western and central Australia, Aboriginal 
languages are still viable, and these areas are indicated 
on Map 36. The numbers of speakers of the indi- 
vidual languages, as shown in Table 4.10, give a clear 
indication of the state of Australian languages. 

The Australian Aboriginal languages shown on 
Map 35, with the exception of those of the Djingili- 
Wambayan, Garawan and Pama-Nyungan groups, are 
prefixing languages in which the subject and object 
person markers are prefixed to the verb. The great 
majority of the prefixing languages have a two-to- 
multiple noun classifying with often elaborate con- 
cordance. In the non-prefixing languages, the subject 
and object person is marked by suffixes, though some 
languages lack person suffixes and mark the person 
only by personal pronouns. However, person suffixes 
are, in a large number of languages, not suffixed to 
the verb itself, but either to the first word of an utter- 
ance, or to auxiliaries accompanying the main verb, 
or to particles which gravitate towards the beginning 
of a sentence. This is known as affix-transferring. 
Suffixing languages usually lack class systems with 
concordance; only a few, such as those of the Djingili- 
Wambayan group and of the Warlu-Warric group 
of the Pama-Nyungan languages, have classifying 
systems. An ergative suffix appears with nouns in 
many languages. In the Tangic group (non-Pama- 
Nyungan), tense is indicated in non-verbal elements 
in a sentence, such as the direct object. The phono- 
logical structure of most Australian languages is fairly 
similar. Stop consonants have four to seven points 
of articulation and their number in a language is 
matched by the number of points of articulation with 
nasals and for the tongue-tip points, with laterals. 
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Fricatives are usually absent, as is a voiced—voiceless 
contrast in stops in many languages. There are usually 
two ‘r’ sounds. Very many languages have only three 
vowels. Some languages are tonal. 

The phonologies of languages in parts of the 
Cape York Peninsula are very different from the gen- 
eral Australian pattern, but have been found to be 
derived from the latter through extensive sound 
changes. The Kala Lagaw Ya group has an un- 
Australian phonology closely similar to that of neigh- 
bouring Papuan languages, and has strong Papuan 
elements, especially in its vocabulary, from the neigh- 
bouring Eastern Trans-Fly languages of the Papuan 
Trans-New Guinea family. 


Table 4.10 Map 35: Australia — Languages and 
numbers of speakers 


Tiwi 
Tiwi (1983) 1,500 
Laragiyan 
Laragiya 
Laragiya (1983) 6 
Wulna 
Wulna (1981) 1 
Gungaragany 
Gungaragany extinct 
Limilngan 
Limilngan (1981) 3 
Umbugarla-Ngurmbur 
Umbugarla (1981) 3 
Ngunbudj extinct? 
Ngurmbur (1981) 1 
Bugunidja extinct? 
Ngarduk extinct? 
Gagudju 
Gagudju (1981) 6 
Giimbiyu 
Mangerri 
Mangerri (1981) 1 
Urninganggic 
Erre (1981) 1 
Urningangg (1983) 10 
Iwaidjan 
Garig 
Garig (2003) 4 
Margu 
Margu (2000) 1 
Iwaidjic 
Iwaidja (Wargib) 
lwaidja 380 
Mawng 
Mawng 200 
Amurag 
Amurag (1983) may be extinct 
Anson Bay 
Batyamal 20 
Pungu-Pungu 10? 
Western Daly 
Emmi 30 
Manhthe 25? 
Maranunggu (1983) 15-20 
Marri Ammu 10? 
Marrithiyel 1983) 25 
Marrityabin 50? 
Madige 30? 
Marridan 1981) 20 
Marrimanindji 1983) 15 
Marringarr (1983) 30-40 
Northern Daly 
Tyeraity 1983) 10 
Malakmalak 10 
Eastern Daly 
Yunggor (1983) may be extinct 
Matngele (1983) 15-20 
Kamu (1967) 2 
Southern Daly 
Ngenkikurrunggur 240 
Ngenkityemeri 150 
Murrinhpatha (1990) 900 
Mindi 
Djamindjungan 
Djamindjung (1990) 30 
Nungali (1981) 2 
Ngaliwuru 100? 
Djingili-Wambayan 
Djingili 
Djingili (1997) 10 


Wambayan 


Binbinga 
Gudandji 
Ngarndji 
Wambaya 


Wardaman 
Wagiman 
Wagiman 
Nolgin 
Dagoman 
Yangman 
Wardaman 
Wardaman 
Mangarayi 
Mangarayi 
Gunwinyguan 
Gunwinygic 
Kunbarlang 
Gunwinygu 
Dalabon 
Jawony 
Jawony 
Waray 
Waray 
Rembarrnga 
Rembarrnga 
Ngandi 
Ngandi 
Ngalakan 
Ngalakan 
Nunggubuyu 
Nunggubuyu 
Maningrida 
Djeebbana 
Nakkara 
Burarra 
Gunardba 
Guragone 
Enindhilyagwa 
Enindhilyagwa 
Maran 
Mara 
Warndarang 


(inc. with Wambaya) 


(1981) 3 
322 
(1981) 12 


(2000) 10 


extinct 
extinct 


(1983) 50 
(1983) 50 
(1983) 50-100 
(2000) 1,510 
202 

100 

(1981) 4 

150 

extinct 
(1981) 10 
300 
(undated) 100 


(1983) 75-100 
(1990) 400-600 


(1990) Burarra, or may be extinct 


Warndarang 


Gariyimar 
Yugul 
Mara 
Alawa 
Alawa 
Garawan 
Garawa 
Garawa 
Waanyi 
Waanyi 
Jarrakan 
Miriwungic 


Gadjerawang 


Miriwung 
Kitjic 
Guluwarin 
Kitja 
Northern Kimberley 
Wunambalic 
Wunambal 
Gambera 
Miwa 
Wilawila 
Ungarinjinic 
Wolyamidi 
Ungarinjin 
Guwidj 
Wororic 
Worora 
Unggumi 
Bunaban 
Bunaba 
Guniyan 
Nyulnyulan 
Djawi 
Bardi 
Nimanbur 
Nyulnyul 
Dyaberdyaber 
Dyuezun 
Yawuru 
Warwa 
Nyigina 
Tangic 
Nyangga 
Yugulda 
Lardil 


(1990) 20 


(1983) 1,000 


extinct 


(1990) 200 


102 


(1981) 3 
(1990) 10-20 


2 
(1983) 100 


(1990) 20 
(1981) 6 
(1981) 4 
(1981) 2 


80? 


(1990) 20 
(inc. with Worora) 


100-— 
100 


(1981) 1 
(1999) 20 
(1981) 2 
(2001) 1 
(1981) 3 
222 
(2001) 30 
(2001) 2 
(1981) 50 


(1981) 1 
52 
(2000) 2 


Gayardilt 
Mingin 
Nguburindi 


Pama-Nyungan 


Yolngu 
Djinang 
Dhaangu 
Dhuwal 
Dhuwala 
Dhuwaya 
Djinba 
Ritharrngu 
Dhay'yi 
Marngu 
Karajarri 
Mangarla 
Nyangumarta 
Ngumpin-Yapa 
Ngumpin 
Juwarliny 
Walmajarri 
Jaru 
Gurindji 
Mudburra 
Ngarinyman 
Yapa 
Warlmanpa 
Warlpiri 


(Kartangaruru) 


Ngarti 
Ngayarta 

Ngarla 

Nyamal 

Panyjima 

Jurruru 


Yinhawangka 
Ngarlawangka 


Kariyarra 
Ngarluma 


Martutuhunira 


Nhuwala 
Yindjibarndi 
Kurrama 
Balygu 
Nyiyabali 
Kanyara-Mantharda 
Mantharda 
Warriyangka 
Thiin 
Jiwarli 
Tharrgari 
Kanyara 
Pinikura 
Thalanyji 
Payungu 
Burduna 
Kardu 
Yingkarta 
Maya 
Malgana 
Pulinya 
Wajarri 
Wirtimay 
Patimaya 
Nhanda 
Nhanda 
Wati/Western Desert 
Gugadja 
Wangkajunga 
Pintupi 
Yumu 
Kukatja 
Pitjantjatjara 
Yankuntjatjara 
Yulparija 
Nyangatjarra 
Yiltawangka 
Warnman 
Manjiljarra 
Kartutjarra 
Putitjarra 
Kiyatjarra 
Ngaatatjarra 
Nhanyatjarra 
Pitungu 
Ngaanyatjarra 
Tjalkatjarra 
Kuwarra 
Nyanganyatjarra 
Mantjintja 
Nakaku 


(2000) 6 


extinct 
2 


(1982) 250 
350 
(1983) 500 
2 
200? 
90— 
(1983) 300 
(1983) 200 


202 
202 
520 


nearly extinct 
1,000 

250 

250 

(1983) 50 
170 


(1981) 50 
3,000 


extinct 
2 


(1991) 8 (+ 10 partial speakers) 


(1991) 20-30 
(1972) 50 
extinct 

2 

2 

extinct 
(1970) 70 
(1981) 5 
(1981) 10 
(1990) 500-600 
(1981) 50 

8 


extinct 
(1981) 1 
extinct 
(1981) 6 


(1981) 5 
(1981) 20 
(1981) 6 
(1981) 3 


(1981) 5 
2 

extinct 
(2005) extinct? 
(1981) 50 
2 

(1966) 20 


(1995) nearly extinct 


(1983) 300 

20 

(1983) 800 

2 

2 

(1995) 2,500 

(1985) 200-300 

2002 

2 

2 

(1973) 20 

(inc. with Martu Wangka 
(inc. with Martu Wangka 
(inc. with Martu Wangka 
(inc. with Martu Wangka 
(1995) 1,200 


) 
) 
) 
) 


Cr Cty Cr CTS] 


(1995) 1,200 
2 
2 
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Wirrtjarakantja 
Antakarrinya 
Kukata 
Ngaliya 
Pintini 
Murunitja 
Wangkayi 
Ngurlu 
Martu Wangka 
Nyungar 
Nankarti 
Watjanmay 
Tjapanmay 
Karlamay 
Pipelman 
Kwetjman 
Mirnong 
Kaniyang 
Mirniny 
Kalaaku 
Ngadyumaya 
Wudjari 
Mirniny 
Thura- Yura 
Kuyani 
Barngarla 
Nawo 
Nukunu 
Kaurna 
Narangga 
Adnyamathanha 
Ngadjuri 
Wirangu 
Warumungu 
Warumungu 
Arandic 
Kaytetye 
Kaytetye 
Aranda 
Anmatyerr 
Alyawerr 
Ayerrereng 
Antekerrepenh 
Western Arrernte 
Eastern Arrernte 
Lower Arrernte 
Warluwarric 
Wakaya 
Warluwarra-Bularnu 
Yanyuwa 
Yinjilanji 
Kalkatungic 
Wakapunga 
Kakatungu 
Yalarnnga 
Galimbamu 
Galimbamu 
Paman 
Northern Paman 
Yumukuntyi 
Yaraytyana 
Atampaya 
Angkamuthi 
Utyangikwathiya 
Mutyanthi 
Uradhi 
Wudhadhi 
Tjungundji 
Mpalityanh 
Yupngayth 
Alngith 
Luthigh 
Linngithigh 
Ngkoth 
Yinwum 
Aritinngithigh 
Ndra'ngith 
Mbiywom 
Middle Paman 
Wik-Ayangenycha 
Wik-Ep 
Wik-Me'anh 
Wik-Ompom 
Wik-Paach 
Wik-Ngathan 
Wik-Ngatharra 
Wik-Mungkan 
Kugu Nganhcara 
Kugu-Ugbanh 
Kugu-Muminh 
Kugu-Uwanh 


2 


(1981) 50 
(1981) 3 

2 

(1985) 200-300 
@ 

2 

2 

(1991) 720 


extinct 
extinct 
extinct 
extinct 
extinct 
extinct 
extinct 
extinct 


extinct 
(1981) 10 
122 
30+ 


extinct 
extinct 
extinct 
extinct 
extinct 
extinct 
302 


(1981) 2 


200 


(1983) 200 


(1983) 800 

(1991) 1,500 
¢ 

(inc. with E. Arrernte) 
(1981) 1,000 


(2000) 2,1280 
@ 


(1983) 10 

(1981) 3 
100- 

(1981) 1 


2 
extinct 
extinct 


(1981) 3 
2 


extinct 


RN NN 


extinct? 
(1981) 3 
(1981) 12 
extinct? 
extinct? 
130 

86 

(1990) 400 


(1981) 6 
7 
(1981) 40 


Kugu-Mu'inh 
Kugu-Mangk 
Wik-lyanh 
Middle Paman 
Wik-Keyangan 
Pakanh 
Ayapathu 
Kuuku Ya'u 
Kaantju 
Umpila 
South-western Paman 
Kuuk Yak 
Kuuk Thaayorre 
Yir-Thangedl 
Yir Yoront 
Koko-Bera 
Kwanthar 
Kok-Nar 
Uw Oykangand 
Olkola 
Walangama 
Kurrtjar 
Areba 
Kuthant 
Kukatj 
Rarmul Paman 
Thaypan 
Argu-Tharnggal 
Lamalamic 
Umbindhamu 
Umbuygamu 
Mbariman-Gudhinma 
Lama-Lama 
Gugu Warra 
Southern Pawan 
Agwamin 
Mbabaram 
Ngaygungu 
Dagalag 
Wamin 
Mbara 
Flinders Island 
Flinders Island 
Barrow Point 
Barrow Point 
Guugu Yimidhirr 
Guugu Yimidhirr 
Yalanjic 
Kuku Yalanji 
Kuku Jangkun 
Muluriji 
Yidinyic 
Dyabugay 
Yidiny 
Kala Lagaw Ya 
Kala Lagaw Ya 
Dyirbal 
Dyirbal 
Warrgamay 
Warrgamay 
Nyawaygi 
Nyawaygi 
Wulgur 
Wulgur 
Bindal 
Bindal 
Yuru 
Yuru 
Maric 
Gugu Badhun 
Warrungu 
Gudjal 
Yilba 
Biri 
Miyan 
Giya 
Yiningayi 
Yangga 
Wiri 
Yuwibara 
Baradha 
Yambina 
Yagalingu 
Wangan 
Barnya 
Gabalbara 
Gayiri 
Gangulu 
Wadjigu 
Wadjabangayi 
Garingbal 


(1981) 7 
(1981) 1 
(1981) 40 


(1981) 3 
(1981) 10 
(1981) 6 
(1981) 100 
50? 

100? 


extinct 


(1981) 2 
(1981) 3 


(1981) 2 


1981) 6 
1981) 7 
1981) 3 
1981) 1 
extinct 


( 
( 
( 
( 


(1981) 1 
extinct 
extinct? 
extinct? 

(1981) 1 
extinct 


(1981) 3 


(1981) 1 


(1991) 200+? 


(2000) 700 
2 


(1981) 1 


(1981) 3 
(1981) 12 


(1990) 3,000—-4,000 


(1983) 40-50 
(1981) 3 
(1981) 3 

(1981) extinct? 

(1981) extinct? 

(1981) extinct? 
(1981) 2 
(1981) 2 


extinct? 
2 


(2005) extinct 


CO Io ia Ro oe OO) 


extinct? 
extinct? 
extinct 
(1981) 1 
extinct? 
extinct? 


Bidyara 
Wadjalang 
Nguri 
Gungabula 
Yiman 
Margany 
Gunya 
Mandandanyi 
Guwamu 
Gunggari 


Guwa 


Guwa 


Yanda 


Yanda 


Yirandhali 


Yirandhali 


Kulumali 


Kulumali 


Kungatutji 


Kungatutji 


Birria 


Birria 


Mayi 


Mayi-Kutana 
Mayi-Kulan 
Mayi-Yapi 
Ngawun 
Mayi-Thakurti 
Wunumara 


Karnic 


North-western 


(1981) 20 
extinct? 

2 

(1981) 2 

extinct? 
1981) 1 
1981) 3 
1981) 1 
1981) 1 
(2005) 1022 


( 
( 
( 
( 


1¢2 
extinct? 
extinct? 
extinct 
extinct 
(2005) may be extinct 
(undated) 2 
extinct 
(inc. with Mayi-Kulan) 
(1981) 1 


(inc. with Mayi-Kulan) 
(inc. with Mayi-Kulan) 


Wangkamadla (1981) 1 
Rangwa (1981) extinct? 
Pitta-Pitta (1981) 2 
Wangka-Yutjurru extinct 
Ringu-Ringu extinct 
Rakaya (2005) extinct? 
Ngulupulu extinct? 
Karanya (2005) extinct? 
Yurlayurlanya (2005) extinct? 
Kunkalanya (2005) extinct? 
Lhanima (2005) extinct? 
Mayawarli 2 
South-western 
Wangkangurru (1981) 8 
Arabana (1981) 8 
Central 
Mithaka (2005) extinct? 
Karruwali (2005) extinct? 
Marrulha (2005) extinct? 
Yarluyandi (2005) extinct? 
Ngamini (1981) 2 
Yawarrawarrka 122 
Yandruwandha (1981) 2 
Ngurawarla (2005) extinct? 
Diyari 122 
Dhirrari 1 
Pirlatapa extinct 
Eastern 
Karendala (2005) extinct? 
Yarumarra (2005) extinct? 
Wangkumara (2005) 42? 
Punthamara (2005) 122 
Tereila (2005) extinct? 
Garlali (2005) 12? 
Mambangura (2005) extinct? 
Badjiri (2005) 222 
Bitjara (1981) 20 
(Mingabari) (2005) extinct? 
Yarli 
Wadikali extinct? 
Malyangapa extinct? 
Yardliyawara extinct? 
Muruwari 
Muruwari (1981) 1 
Central NSW 
Gamilaraay (1997) 3 
Wangaybuwan-Ngiyambaa (1981) 12 
Wiradhuri (1981) 3 
Koinjmal 
Koinjmal extinct 
Bayali 
Bayali extinct 
Dharambal 
Dharambal extinct? 
Waka-Kabic 
Goreng 
Goreng Goreng 1? 
Daribalang extinct? 
Kabi 
Kabi-Kabi extinct? 
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Table 4.10 continued 


Wakka 
Wuli-Wuli extinct 
Wakka-Wakka (1981) 3 
Gayabara (2005) extinct? 
Barunggam (2005) extinct? 
Muringam (2005) extinct? 
Yagara 
Yagara extinct 
Durubulic 
Guwar extinct 
Nunungal extinct 
Djendewal extinct 
Bandjalang 
Bandjalang (1983) 10 
Gumbaynggiric 
Gumbaynggir 
Gumbaynggir 122 
Yaygir 
Yaygir extinct 
Yuin-Kuric 
Kuri 
Yugambal extinct 
Nganyaywana extinct 
Ngagu extinct 
Ngambaa extinct 
Dyangadi (1981) 5 
Kathang extinct? 
Awabakal extinct 
lyora 
Darkinyung extinct 
Dharuk extinct 
Yuin 
Ngunawal extinct 
Ngarigu extinct 
Thawa extinct 
Dyirringany extinct? 
Dhurga extinct 
Dharawal extinct 
Gundungura extinct 
Paakantyi 
Bandjigali (1981) 1 
Baarundji (1981) 1 
Gurnu 2 
Nabarlgu extinct 
Wanyubarlgu extinct 
Wilyagali extinct 
Dhanggagarli extinct 
Southern Paakantyi 22? 
Barundji extinct 
Marawara extinct 
Lower Murray 
Ngarinyeri extinct 
Peramangk extinct 
Ngayawung extinct 
Yuyu extinct 
Keramin extinct 
Yitha-Yitha extinct 
Kulin 
Wemba Wemba extinct 
Nari Nari extinct 
Wathawurung extinct 
Kolakngat extinct 
Wuywurrung extinct 
Bungandidjan 
Bungandidj extinct 
Kuurn Kopan Noot extinct 
Yorta-Yorta 
Yorta-Yorta extinct 
Yabula-Yabula extinct 
Pallanganmiddang 
Pallanganmiddang extinct 
Dhuduroa 
Dhuduroa extinct 
Ganay 
Muk-Thang extinct 
Tasmanian Languages 
Tasmanian Languages extinct 


4.5.16 Moluccas (Maluku) 


The Austronesian languages on Map 37 belong in 
part to Central Austronesian, which is a group transi- 
tory between Western Austronesian and Eastern 
Austronesian. Those of the South Halmahera—North- 
West Guinea group are directly leading over to East- 
ern Austronesian, showing characteristics in their 
possessive systems reminiscent of Eastern Austro- 


nesian, such as the subdivision of nouns into a class 
of nouns which are intimately possessed and another 
class which are not. The languages of the West 
Papuan phylum shown have subject and object pre- 
fixes with a two-gender system, number marking with 
some nouns and a comparatively simple verb morph- 
ology. The word order in them is Subject—Verb- 
Object, by contrast with the order SOV, which is 
the norm in the languages of all other Papuan groups. 


Table 4.11 Map 37: The Moluccas — Languages and 
numbers of speakers 


Austronesian Languages 


Central Maluku 
East Central Maluku 
Seram Subroup 
Nunusaku 
Piru Bay 
Western Piru Bay 
Boano 3,240 
Manipa (1983) 1,500 
Batumerah extinct 
Larike-Wakasihu 
(1987) 12,560 
Asilulu 8,760 
Luhu 6,500 
Eastern Piru Bay 
Hitu 15,970 
Kaibobo (1983) 500 
Kamarian (1987) 10 
Haruku 18,220 
Saparua 10,220 
Nusalaut (1989) 10 
Amahai (1987) 50 
Paulohi (1982) 50 
Sepa-Teluti 19,600 
Three Rivers 
Alune (2000) 17,240 
Naka'ela (1985) 5 
Wemale 8,660 
Atamanu (2004) 1,700 
Lisabata-Noniali 1,830 
Hulung (1991) 10 
Loun (undated) 20 
Saleman (1989) 4,800 
Central Seram 
Nuaulu (1995) 1,500 
Manusela 7,000 
Kayeli (I) 
Kayeli (1995) 3 
East Seram 
Bobot 4,500 
Masiwang (1989) 1,000 
Seti (1989) 3,000 
Banda Subgroup 
Seran Laut 
Geser-Goram (1989) 36,500 
Watubela 4,000 


Teor-Kur (I) 


Teor-Kur Teor (1986) 1,100, Kur (2000) 3,180 
West Central Maluku 
Buru-Sula-Taliabo 
Buru 32,980 
Sula 20,000 
Taliabo (2000) 4,520 
Ambelau (I) 
Ambelau 5,700 
South Halmahera—North-west New Guinea 
South Halmahera 
East Makian 20,000 
Giman 4,000 
Weda 3,000 
Sawai 12,000 
Patani (2000) 10,580 
Maba (2000) 6,620 
Buli (2000) 2,520 
Geelvink Bay 
Biak (2000) 30,000 
Raja Ampat 
Maya (2001) 4,000 
Maden (2001) 600 
Palamul inc. with Maden 
Matbat (2001) 1,250 
Laganyan (2000) 250 
Kawe (2001) 600 
Gebe (2000) 2,650 
Bomberai 
Onin 600 


Malayan 
Malay 


Malay (Malayu Ambon) (1987) 200,000 


Papuan Languages 
Extended West Papuan 
West Papuan 
West Bird's Head 
Seget (I) 
Seget (1988) 1,200 
Moi (1) 
Moi 4,000 
Northern Halmahera 
West Makian (I) 


West Makian (1977) 12,000 
Halmahera 

Tidore 26,000 
Ternate 42,000 
Sahu 7,500 
Tobaru 15,000 
Loloda 15,000 
Galela 79,000 
Tobelo (2000) 27,720 
Modole 2,000 
Pagu 3,000 

Other Languages 

Sama Group 

Bajau (1982) 5,000+ 


Immigrant Languages (Sulawesi, Halmahera) 


4.5.17 Lesser Sunda Islands and Timor 


Most of the Austronesian languages shown on Map 
38 are Central Austronesian, except for the Bali-Sasak 
group languages, which are Western Austronesian. In 
the Bali-Sasak languages, a high and a low language 
are distinguished, which are remarkably different in 
vocabulary. The high language is used by persons of 
low social rank when speaking to persons of high 
social rank, and the low language by persons of 
high rank when speaking to persons of lower rank. 
Structurally the Austronesian languages on this map 
are comparatively simple with a moderate develop- 
ment of prefixes and suffixes. The Central Austro- 
nesian languages are a group transitory between 
Western Austronesian and Eastern Austronesian. The 
Trans-New Guinea family languages shown have a 
noun-classification system through  classificatory 
verbs, sentence medial verbal forms, no number 
marking with nouns and a fairly developed verb 
morphology. They show West Papuan family 
influence, which in Oirata is so strong that West- 
Papuan-type subject prefixes appear. 


Table 4.12 Map 38: Lesser Sunda Islands and Timor — 
Languages and numbers of speakers 


Austronesian Languages 


Bali-Sasak 
Balinese (2001) 3,900,000 
Sasak (1989) 2,100,000 
Sumbawa (1989) 300,000 
Bima-Sumba 

Bima 500,000 
Kodi 40,000 
Weyewa 75,000 
Lambaya 15,000 
Wanukaka 10,000 
Anakalanga 14,000 
Mamboru 16,000 
East Sumbanese 125,000 
Savu (1981) 90-95,000 
Ndao 3,500 
Manggarai 500,000 
Rongga (2000) 2,120 
Waerana (2000) 4,240 
Manus? ? 
Kepo' (2000) 10,610 
Rajong (2000) 4,240 
So'a (1994) 10,000 
Rembong (2000) 2,120 
Nage-Keo Keo (2001) 40,000, 

Nage (1993) 50,000 
Komodo (2000) 700 
Riung 14000 
Ngada 70,000 
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eer 120,000 4.5.18 Philippines: the numerically most Kapempangant) 
Palu'e 3,000 important languages Kapampangan (1990) 1,900,380 
Sama Sinauna (I) 
Sama Bajau (1981) several thousand Map 40 (inset) shows the ten Philippine languages Sinauna (2000) 2,530 
Malayan with the greatest number of speakers. The figures for Meso-Philippine 
Malay these languages in Table 4.13 are the numbers of North Mangyan 
Bahasa Malayu : speakers according to early to mid-1980s estimates. lraya c. 10,000 
Larantuka f Speakers of these ten languages number over Mangan eau 
basa Kapeme (1997) 200,000 50,000,000, out of a population of 72,944,000 (1998 JauyaWan cree 
Timor Area UN) South Mangyan 
Flores-Lembata : Batangan c. 11,000 
Sikka 180,000 Buhid c. 8,000 
Lamaholot 175,000 “7° : Hanunoo c. 11,000 
Kedang 25,000 4.5.19 Philippines Palawan 
Helong (1) The Austronesian languages on Map 40 are Western North Palawan 
Helong 5,000 Austronesian of the so-called Philippine type, charac- Kalamianon c. 5,000 
Timor and Islands terized by a proliferation of particles in addition to eileen 0/000 
fou epee prefixes and suffixes and by a very high development godin Falawan 
Uab Meto (1997) 590,000 : Roa dss ah Batak 1,400 
: of a focusing system which indicates, through mor- 
Tetun (2004) 450,000 : ? ; Aborlan-Tagbanwa c. 8,000 
Kemak (1981) (W. Timor 50,000) 100,000 phological and syntactic means, the focus on various Baleavene c. 20,000 
Tukudede (2000) 63,170 parts of an utterance. Molbog 5,640 
Mambai 80,000 Central Philippine 
Galoli (1981) 50,000 Table 4.13 Map 40: The Philippines — Languages and Tagalog : 3 
me : . pees numbers of speakers ee me alee 
akalei ; 
Wetar 3,160 : Standard Bikol c. 2,500,000 
a 2,360 Austronesian Languages Albay Bikol c. 480,000 
Kisar (1995) 20,000 Batanic Rinconada Bikol c. 180,000 
Letri Lgona 24,040 Itbayaten (2000) 1,450 Northern Catanduanes Bikol c. 65,000 
come (1981) 50 Ivatan (1998) 35,000 North Bisayan 
Dai 810 Babuyan (1996) 1,000 West Bisayan 
Dawera-Daweloor 1,860 Bia Kuyonon (1990) 123,380 
ordilleran West Binisaya? ¢ 
eas ee oe Dumagat Aklanon . (1990) 
Teun 1,000 Northern Dumagat Kinaray-A (1994) 377,530 
Nile-Serua 3,500 East eeayan Negrito Central Bisayan 
Ebi (2000) 1,260 (Dupaninan Agta) (1986) 1,200 Hiligaynon c. 6,000,000 
Waima'a Ratahial een ce Samar-Leyte c. 3,000,000 
Waimia Kasiguranin (1975) 10,000 Banton c. 65,000 
usieiat (2001) 3,000 Casiguran Dumagat (2000) 610 Romblon c. 200,000 
Kairui-Midiki (2001) 2,000 moun Dummpet Masbatefio 322,830 
Naueti (1981) 1,000 Umiray Dumagat 2,690 Sorsogon (Masbate) 85,000 
Seuss iets Northern Cordilleran Sorsogon (Waray) 185,000 
Ase tana tee Isnag (1994) 30,000 Capiznon (2000) 640,650 
Kei-Fordata (I) Adasen (undated) 4,000 Porohanon (undated) 23,000 
Kei-Fordata 88,100 Melewen ea aion Bantayanon 
Vindenadl Ibanag (1990) 500,000 (Bantoanon) (2002) 200,000 
alana 28,550 Pamplona Atta (1998) 1,000 South Bisayan 
Salwasa () Pudtol Atta (2000) 710 Cebuano c. 12,000,000 
Sdinaes 2,840 Faire Atta (2000) 300 Butuanon (1990) 34,550 
roe Central Cagayan Agta (2000) 780 Surigaonon (1990) 344,970 
eatery 8.270 Itawis (1990) 1 19,580 Tausug c. 492,000 
‘ Ga-Dang (2002) 6,000 East Mindana 
Papuan Languages Gaddang (1984) 30,000 Mamanwa 1420 
Trans-New Guinea Yogad (1990) 16,040 Kamayo (2000) 7,570 
West Trans-New Guinea Linkage Arta (I) Davaweno 124,490 
East Timor Arta (2000) 15 Mandaya 28,240 
Oirata (I) llokano Kalagan c. 60,000 
Oirata 1,220 llokano c. 8,000,000 Tagakaulu 37,830 
Fataluku (1) Alta South Philippine 
Fataluku 30,000 Northern Alta (2000) 200 Subanon 
Bunak (I) Southern Alta (1982) 1,000 Northern Subanon (1985) 10,000 
Bunak (1977) 50,000 Central Cordilleran Central Subanon (2000) 140,010 
Kolana (I) Kalinga c. 86,000 Western Subanon (1997) 75,000 
Kolana 50,000 Itneg 50,400 Kolibugan Subanon (1998) 20,000 
Tanglapui (1) Balangaw (2000) 21,270 Lapuyan Subanon (1978) 25,000 
Tanglapui (1997) Kula 5,000, Bontok 65,240 Danao 
Sawila 3,000 Kankanaey 154,040 Maranao 602,610 
West Timor-Alor-Pantar Ifugao (1990) 167,500 Magindanao c. 1,000,000 
Makasai Isinai 5,660 Ilanum 7,080 
Makasai (1989) 70,000 Southern Cordilleran Manobo 
Makalero? prob. inc. with Makasai Pangasinan c. 2,000,000 North Manobo 
Alor Inibaloi 87,750 Kinamigin c. 60,000 
Kui (2000) 4,240 I-Wak (1987) 2,000-3,000 Binukid 75,530 
Kamang (2000) 16,520 Kallahan (Kayapa) (1991) 15,000 Kagayanen (2000) 25,000 
Kiramang inc. with Kui Keley-I (Kallahan) (2000) 8,000 Central Manobo 
Abui (1981) 16,000 Karaw (1998) 1,400 Higaonon (1996) 30,000 
Kabola (1995) 3,500 llongot 6,120 Ata Manobo (2000) 26,650 
Kelon (1997) 6,000 Sambalic Agusan Manobo (2002) 60,000 
Kafoa (1981) 1,000 Sambal Rajah Kabungsuan Manobo 47,820 
Pantar Bolinao (1990) 50,000 Tigwa Manobo 11,510 
Blagar (1981) 11,000 Tina Sambal (2000) 70,000 West Bukidnon Manobo 5,620 
Nedebang (2000) 1,380 Botolan (2000) 32,870 llianen Manobo (2000) 14,610 
Teiwa (1981) 5,000 Ayta Obo Manobo 10,610 
Lamma (undated) 10,000 Abenlen Ayta (1985) 6,850 Dibabawon (1978) 10,000 
Mag-Anchi Ayta (1992) 8,200 South Manobo 
Mag-Indi Ayta (1998) 5,000 Tagabawa (1998) 43,000 
Bataan Ayta (2000) 500 Sarangani Manobo (1987) 35,000 
Ambala Ayta (1986) 1,660 Cotabato Manobo (2002) 30,000 
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Table 4.13 continued 


Sama 

Sama 
Yakan 
Jama Mapun 
Sama Sibuguey? 
Northern Sama 
Western Sama 
Central Sama 
Southern Sama 

Abaknon (I) 
Abaknon 

South Mindanao 

Blaan-Tboli 
Koronadal Blaan 
Tboli 
Sarangani Blaan 

Tiruray (I) 
Tiruray 

Bagobo (I) 
Bagobo 


Spanish-based Creole Language 
Chavacano 


Immigrant Labour from Indonesia 
Sangil 

Sangir 

Tausug as lingua franca 


4.5.20 Sulawesi 


(1990) 105,550 
(2000) 40,590 
2 

(2002) 10,000 
c. 20,000 
(2000) 90,030 
(2000) 120,000 


(2000) 21,400 
(1981) 100,000 
(2000) 95,320 
(2000) 90,750 


34,340 


35,150 


(1990) 292,630 


(1996) 15,000 


(1981) 55,000 
2 


The languages of the Sangir-Minahasan supergroup 
(Map 39) and of Mongondow-Gorontalo supergroup 
are Western Austronesian languages of the so-called 
Philippine type, characterized by large numbers of par- 
ticles as well as prefixes and suffixes, and bya very highly 
developed focusing system to indicate, by morpho- 
logical and syntactic means, the focus on various parts 
of an utterance. The languages of the other groups in 
Celebes show this feature to a lesser degree and they are 
leading over to a simpler type of Western Austronesian. 


Table 4.14 Map 39: Sulawesi — Languages and 


numbers of speakers 


Sangir-Minahasan 
Sangiric 
Talaud 


Sangir 
Bantik 
Ratahan 
Minahasan 
Tonsea 


Tondano 
Tombulu 
Tontemboan 
Tonsawang 


Mongondow-Gorontalo 
Mongondow 

Ponosakan 

Bola'ang Mongondow 
Gorontalo 


60,000 


(inc. north Maluku) 200,000 


11,000 
30,000 


90,000 
92,000 
60,000 
150,000 
20,000 


(1981) 3,000 
(1989) 900,000 


Bintauna (1981) 6,000 
Kaidipang (1981) 22,000 
Bolango (1981) 15,000 
Atinggola (1981) 5,000 
Suwawa (1981) 10,000 
Gorontalo (1989) 900,000 
Buol (1989) 75,000 
Central Sulawesi 
West-central 
Tomini 
Tolitoli (Totoli) 
Tomini (2001) 30,000 
Bolano (Boano) (2001) 2,700 
Dondo (2001) 13,000 
Dampelasa (Dampelas) (2001) 10,300 
Kasimbar (Tajio) (2001) 12,000 
Petapar (Taje) (2001) 350 
Ndau (Pendau) (2001) 3,200 
Balaesan (I) 
Balaesan (Balaesang) (2001) 3,200 
Kaili-Pamona 
Kaili (1979) Ledo 233,500, Unde 20,000 
Pipikoro (Uma) (1990) 20,000 
Lindu (1990) 2,000 
Sedoa (1979) 600 


4.5.21 Borneo 


All the languages on Map 41, except for those in the 
north-eastern extension of Borneo, belong to a 
grammatically comparatively simple type of Western 
Austronesian with a moderately developed range of 
prefixes and suffixes and simple indication of posses- 


Napu (1995) 6,000 
Bada (1991) 10,000 
Rampi (1991) 8,000 
Pamona (1979) 106,000 
Bungu-Mori 
Bungku 16,500 
Mori 12,000 
Lalaki 231,000 
Mekongka 50,000 
Eastern 
Saluan (1979, 2000) 76,140 
Andio (1991) 1,700 
Balantak (2000) 30,000 
Banggai (I) 
Banggai (1995) 100,000 
South Sulawesi 
Makassarese 1,600,000 
Bugis 3,500,000 
Mandar 200,000 
Sa'dan 500,000 
Massenrempulu 202,000 
Pitu Ulunna Salo 22,000 
Mamuju 60,000 
Seko 5,000 
Muna-Buton 
Muna 227,000 
Wolio 35,000 
South Buton 15,000 
Buton 80,000 
Tukang-Besi 70,000 
Bonerate 9,500 
Kalaotoa 500 
Layolo (1997) 800 
Sama 
Bajo (Indonesian Bajau) (1979, 1983) 33-35,000 
Malayan 
Malay 
Manado Malay (2001) 850,000 
Wotu 
Wotu 5,000 


sives. The languages in the north-eastern peninsular 
extension of Borneo, as well as the Bajau langauge, 
belong to the so-called Phillippine type with prolifer- 
ating particles, prefixes and suffixes, and a complex 
system of focusing on parts of an utterance by mor- 


phological and syntactic means. 


Barito Supergroup (1-11) 
Barito Supergroup 


East Barito 


Table 4.15 Map 41: Borneo and Madagascar — 
Languages and numbers of speakers 


North-east Barito (I) 
North-east Barito 120,000 


Dusun Deyah (I) 


Dusun Deyah (1981) 20,000 
South-east (I) 
South-east (Ma'anyan) 125,000 
West Barito 
North-west Barito 
Dohoi (2003) 25,000 
Siang (1981) 60,000 


South-west Barito 


Kapuas (Ngaju) 


Other Malayic Dayak 


(2003) 800,000 


Katingan (1981) 45,000 
Kahayan (1981) 45,000 
Bakumpai (2003) 100,000 
Barito-Mahakam 
Tunjung (1981) 50,000 
Ampanang (1981) 30,000 
Malayan 
Malay 
Borneo Coastal Malay c. 3,290,000 
Western Coastal Malay c. 800,000 
Mayic Dayak 
Iban (1) 
Iban (1995) 415,000 
Other Malayic Dayak (I) 


c. 700,000 


Land Dayak 
Silakau 
Lara’ 
Jagoi 
Singgie 
Biatah 


Bukar Sadong 


Bekati' 


Benyadu' 
Kembayan 


Ribun 


Djongkang 


Sanggau 


Semandang 


Mbaloh (I) 


Mbaloh 
Rejang Baram 


Lower Rejang 
Melanau 
Kanowit 
Tanjong 

Rejang-Bintulu 
Bintulu 
Lahanan 
Kajaman-Sekapan 
Bukitan 

Beram-Tinjar 
Lelak 
Narom 
Kiput 
Tutong 
Berawan 

Rejang-Sajau 
Punan Bah-Biau 
Punan Merap 
Sajau Basap 
Burusu 
Basap 


Kayan-Kenyah 


Kayan 


Western Kayan 


Rejang Kayan 
Baram Kayan 


Mendalam Kayan 


Kayan Mahakam 
Busang 
Bahau 


Kayan River Kayan 


Wahau Kayan 


Murik Kayan (I) 


Murik Kayan 


Modang 


Muller-Schwaner “Punan” 


Modang 
Segai 


Aoheng 
Hovongan 
Kereho-Uheng 
Bukat 

Punan Aput 
Punan Merah 


Punan-Nibong (1) 
Punan-Nibong 


Kenyah 


Main Kenyah 


Western Kenyah 


Upper Baram Kenyah 
Bahau River Kenyah 
Kayan River Kenyah 
Mahakam Kenyah 


Kelinyau Kenyah 


Bakung Kenyah (I) 


Bakung Kenyah 


Sebob Kenyah (I) 


Sebob Kenyah 


Tutoh Kenyah (I) 


Tutoh Kenyah 


Wahau Kenyah (I) 


Wahau Kenyah 


Punan Tubu (I) 


Apo Duat 


Kelabit 


Lun Daye 


Sa'ban 
Lengilu 
Putoh 


Punan Tubu 


North-east Borneo 


Murutic 


Murut 


Selungai Murut 


100,000 
12,000 
19,000 
4,800 
20,100 
34,600 
(1986) 4,000 

45,000 
(1981) 45,000 
(1981) 45,000 
(1981) 45,000 
(1991) 10,000 
(1981) 30,000 


11,000 


25,320 
170 
100 


4,200 
350 
1,250 
410 


(1981) 220 
(1981) 2,420 
(1981) 2,460 

(1996) 12,000 

(1981) 870 


450 

(1981) 200 
(1981) 6,000 
(1981) 6,000 
(1981) 17,000 


3,030 
4,150 
1,500 
1,300 
(1981) 3,000 
3,200 
(1981) 6,000 
(1981) 1,000 


1,120 


6,300 
9,000 


2,630 
(1991) 1,000 
200 

400 

370 

140 


1,350 
1981 
2000) 2,860 


) 
) 
1981) 1,500 
) 
) 
) 


( 1,250 
( 
( 
(1981) 6,000 
( 
( 


1981) 7,000 
1,200 


1981 
7,330 
1,730 

600 
1,000 
2,000 
1,650 

15,700 
1,000 


(2000) 3 or 4 
6,000 


(2000) 1,010 


120 


Australasia and the Pacific 


Kolod (2000) 4,970 
Sumambu-Tagal 16,000 
Paluan (2000) 3,680 
Timugon 7,000 
Beaufort Murut 1700 
Keningau Murut (2000) 7,000 
Tidong 
Tidong (2000) 24,880 
Sembakung (2000) 9,850 
Baukan 1,800 
Kalabakan 1,500 
Serudung 1,000 
Dusunic 
Dusun 
Gana 2,000 
Kuijau 6,000 
Kadazan-Dusun 77,000 
Lotud 5,000 
Papar 800 
Rungus 14,000 
Eastern Kadazan 16,000 
Klias River Kadazan 1,000 
Kimaragang (1987) 10,000 
Garo? 5,000? 
Tebilung (1984) 2,000 
Bisayan 
Tatana 5,500 
Northern Bisaya (2000) 15,820 
Southern Bisaya (inc. with N. Bisaya) 
Tutong 25,000 
Paitanic 
Dumpas (2000) 1,080 
Lingkabau? (1991) 3,000 
Abai Sungai (2000) 400 
Tambanua (1991) 20,000 
Upper Kinabatangan (2000) 7,860 
Lobu (1986) 2,800 
Ida'an 
Ida'an 6,000 


4.5.22 Java and Bali 


The languages on Map 42 are all Western Austro- 
nesian of a comparatively simple type, with a moder- 
ately developed range of prefixes and suffixes and 
simple indication of possessives. Most contain a high 
language and a low language with markedly different 
vocabulary. The high language is used to address 
people of higher rank, and the low language is used 
by higher ranks when speaking to people of lower 
rank. The Malay subgroup languages shown are 
strongly influenced by other languages of the area. 


Table 4.16 Map 42: Java and Bali — Languages and 
numbers of speakers 


Javanese (I) 


Javanese (1989) 75,200,000 
Sundanese (I) 
Sundanese 24,000,000 
Malayan 
Madurese (I) 
Madurese (1995) 13,694,000 
Malay 
Jakarta Malay c. 6,000,000 
South-western Bali Malay 
(Bahasa Melayu Loloan) c. 10,000 
Bali-Sasak 
Balinese (2001) 3,900,000 


4.5.23 Sumatra 


The Austronesian languages of Sumatra (Map 43) are 
all Western Austronesian of a grammatically com- 
paratively simple type, with moderately developed 
range of prefixes and suffixes and simple indication 
of possessives. The languages of the islands near 
Sumatra in the south-west, especially Enggano, are 
highly aberrant in vocabulary. The Acehnese lan- 
guage is also very highly aberrant and shows strong 
influence from Austro-Asiatic languages. 


Table 4.17 Map 43: Sumatra — Languages and 
numbers of speakers 


Austronesian: Malayo-Polynesian 


Northern 
Simeulue (1981) 100,000 
Nias 480,000 
Mentawai (I) 
Mentawai 50,000 
Enggano (I) 
Enggano (2000) 700 
Acehnese (I) 
Acehnese 3,000,000 
Gayo (I) 
Gayo 180,000 
Batak 
Northern 
Alas-Kluet 80,000 
Karo Batak 600,000 
Southern 
Toba Batak 2,000,000 


Angkola-Mandailing 
Simalungan (I) 


(1989, 1991) 1,150,000 


Simalungan 200,000 
Malay 
Minangkabau 6,500,000 
Rejang (1981) 1,000,000 
Malay 
Sumatra Malay (1981) 10,000,000 
Orang Laut 2 
Lampung 
Komering 700,000 
Lampung (1981) 1,500,000 
Lom (I) 
Lom (2000) 2-10 


Java Languages 


4.5.24 West Malaysia and Singapore 


The Austronesian languages on Map 44 are Western 
Austronesian, of the Malay type: comparatively 
simple, with a moderately developed range of prefixes 
and suffixes and simple indication of possessives. The 
Austro-Asiatic languages shown are grammatically 
fairly complex, with subjects of the verb marked by 
prefixes, and inclusive and exclusive forms in the first 
person non-singular distinguished. The languages of 
the Thai-Kadai group shown are monosyllabic tonal 
languages with a grammar based essentially on the 
use of particles and on syntactic features. 


Table 4.18 Map 44: West Malaysia and Singapore — 
Languages and numbers of speakers 


Languages of Malaysia 
Austroasiatic 


Asian 
Northern Asian 
Tonga' 300 
Low Country Semang extinct 
Kensiu 3,300 
Jehai 1,375 
Batek 700 
Che' Wong 200 
Central Asian 
Lanoh 220 
Temiar 11,590 
Semai 18,330 
Jah H\+et 2,440 
Southern Asian 
Semaq Beri 2,080 
Semelai 2,930 
Besisi 1,360 
Thai-Kadai 
Thai 
South-western 
Southern 
Pak Tay 3,050,000 
Thai-Malay 1,500,000 
Unclassified 
Kenaboi extinct 
Austronesian 
Malay 
Malay 
Para-Malay 
Minangkabau 300,000 
Urak Lawoi' 2,500 


Duano' 1,920 
Aboriginal Malay 

Temuan 9,310 

Orang Hulu 9,800 

Orang Kanaq (1981) 34 

Orang Seletar 540 
Local Malay 

Malay 7,550,000 
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Introduction 


The text in Section 5 covers the following languages 
and language families: 1) Sino-Tibetan (including 
Sinitic (also known as Chinese) and Tibeto-Burman); 
2) Mon-Khmer; 3) Austro-Thai (including Tai-Kadai, 
Austronesian and Miao-Yao); 4) Japanese; 5) Korean; 
6) Ainu; the languages of the putative Altaic group 
including 7) Turkic and 8) Manchu-Tungus languages 
of China (nos. 1-8 all by David Bradley) as well as 9) 
Mongolian languages (by Alan Sanders). Nos. 10-12 
(all by David Bradley) cover the remaining languages 
of the area, which include 10) two Indo-European 
languages and two Portuguese creoles and 11) four 
mixed languages, followed by 12) a brief summary of 
the language policy in the nations within this area, 
and finally references. It should be noted that this 
introduction also gives complete details of the genetic 
position of all Tibeto-Burman, Mon-Khmer and Tai- 
Kadai languages of South Asia and all Mon-Khmer 
languages of Malaysia in the Australasia and the 
Pacific section, and so the texts and maps for those 
areas should be used in conjunction with this text. 

The inclusion of the term Altaic in the text and 
on the map keys is not intended to imply recognition 
of the validity of this grouping, which, in addition 
to Turkic, Mongolic and Manchu-Tungus, has been 
claimed to include Japanese and Korean. 

This section also gives brief world-wide details 
for languages which originated in this area but which 
now have substantial migrant communities else- 
where, such as Chinese, Vietnamese, Japanese, Korean, 
Khmer and so on. 

With the text on Chinese, there are brief sum- 
maries of the linguistic situation in two highly diverse 
cities with a Chinese majority, Hong Kong and Singa- 
pore. A note on the spread of Chinese in Malaysia and 
Indonesia will be found in Section 4 (see 4.4.1.1 and 
Figure 4.5). With the text on Thai, the Tai-Kadai 
national language of Thailand, there is a similar dis- 
cussion of the multi-ethnic and multilingual city of 
Bangkok. 


5.1 Sino-Tibetan 


Sino-Tibetan is a generally accepted genetic linguistic 
family. There is very widespread but not universal 
agreement that the primary split within this family 
was between Sinitic and Tibeto-Burman subgroups. 
For reconstructions of Proto-Sino-Tibetan, see Ben- 
edict (1972) and Shafer (1974). There are various 
proposals about the macro-level connections of Sino- 
Tibetan. The earliest was a proposal by Sapir, sup- 
ported by Shafer who published some cognates and 
named this superfamily Eurasial, that it should be 
connected with Athabaskan or Na-Dene of north- 
western North America. This was taken further by 


+ An *asterisk before a name in bold type indicates a proto-form, 
an earlier form reconstructed by means of the comparative 
method, a standard technique in linguistics. Bold is also used for 
language names; italics are used for language families and other 
groupings, and also of course for words in languages other than 
English. 
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Russian scholars, who also linked North Caucasian 
and Yeniseian groups in what they call Sino-Caucasian 
or Dene-Sino-Caucasian (Shevoroshkin 1991). A 
more recent proposal is that of Sagart that Sino- 
Tibetan should be linked with Austronesian, which 
for him includes Tai-Kadai as a subgroup (Sagart 
2004, 2005). 

It is difficult to generalize about the typological 
features of Sino-Tibetan, because the languages are so 
widespread and are spoken in several linguistic areas: 
East Asia, South-East Asia and South Asia. Almost all 
Tibeto-Burman languages are verb-final and post- 
posing, but Sinitic and the Tibeto-Burman languages 
Bai and the Karen languages are verb-medial; even 
these languages are mainly postposing. However, the 
reconstructible morphology of Sino-Tibetan includes 
a number of prefixes. 


5.1.1 Sinitic 


The Sinitic languages form one component of the 
Sino-Tibetan family. They are spoken by people who 
are generally called ‘Chinese’ in English. What unifies 
them is millennia of continuous use of a shared logo- 
graphic writing system, Chinese characters, a shared 
history and a common culture. While many of the 
spoken varieties of Chinese are not mutually intelli- 
gible, written Chinese unites them all. In 1958 China 
implemented a character simplification scheme, 
which eliminated all but 6,196 characters; of these, 
2,238 or over 36 per cent were simplified. This system 
of simplified characters or jiantizi is now used in 
China, Singapore and Malaysia. However, in Hong 
Kong, Macao, Taiwan and in many other overseas 
Chinese communities, traditional characters or fantizi 
continue in use. Chinese forms are cited here in 
pinyin ‘phonetic writing’, the official Romanization 
used in China, which represents putonghua ‘common 
speech’, the official standard language of China, based 
on the speech of Beijing. A slightly different variety 
of Beijing speech remains the official language of 
Taiwan, where it is known as Guoyu ‘national lan- 
guage’. For more details on language policy in China, 
see Bradley (1991). For a summary of the origins of 
Sinitic, see Sagart (1999). 

In China, the Chinese are the majority: over 
90 per cent of the population and an even higher 
percentage in the Hong Kong and Macao Special 
Administrative Regions. They are also the majority 
in Taiwan and Singapore, and a very substantial 
minority in Malaysia, Thailand and other South-East 
Asian countries. Smaller but still substantial numbers 
of Chinese are found throughout the world, particu- 
larly in urban centres such as Vancouver in Canada, 
San Francisco, New York and elsewhere in the United 
States, London, Sydney, Melbourne and so on. In 
Central Asia, there is a small Chinese minority of 
about 50,000 Chinese Muslims who speak Mandarin 
and are called Dungan (dong'an ‘eastern peace’), 
somewhat more in eastern Kyrgyzstan and somewhat 
fewer in eastern Kazakhstan. In China, the Chinese 
are known as Han from the name of a Chinese dyn- 
asty (206 Bc to AD 220). They are also referred to as 
Hua, as in the term Huaqiao ‘overseas Chinese’ (liter- 
ally ‘Hua bridge’). The term used to refer to the 
standard Mandarin of Singapore is Huayu ‘Hua 


language’. China is called Zhongguo ‘middle country’ 
in Chinese, Chinese writing is called Zhongwen 
‘middle writing’, and the citizens of China — includ- 
ing speakers of Chinese and other languages — are 
called Zhongguoren ‘people of the middle country’. 

Chinese is written using characters with a logo- 
graphic (‘pictographic’) origin that each represent 
one syllable. The majority of characters are composed 
of a semantic element (known as the radical) plus a 
phonetic element; there are over 200 radicals. Chinese 
characters have been in use for at least 3,500 years; 
the earliest surviving inscriptions are oracle inscrip- 
tions on animal bones. Classical Chinese texts, such as 
Confucius, date from over 2,500 years ago. Classical 
Chinese texts were the basis of the literary diglossic 
High known as wenyan, used in administration and 
in the examination system for recruitment and 
advancement within the Chinese government service 
until 1911. It was also widely used and imitated in 
Japan, Korea and Vietnam. The stability of Chinese 
characters and wenyan masked a major phonological 
and lexical diversification among the Sinitic lan- 
guages, with some syntactic changes as well; most 
scholars date the beginning of this diversification to 
the Tang dynasty (ap 618-907), though some suggest 
an earlier start. Fortunately, the Qieyun rhyme book 
gives a phonological overview of traditional Chinese 
phonology in ap 601. 

The Chinese tone system derives from an initial 
system of four tones, ping ‘even’, shang ‘rising’, qu 
‘departing’ and ru ‘entering’; the latter occurs only in 
stop-final syllables, where no other tones are possible. 
The qu tone is secondary, and developed in syllables 
with an earlier final *s. Subsequent developments in 
the tonal systems of all varieties of Chinese are con- 
ditioned by the type of syllable-initial, yang (voiced) 
or yin (voiceless), giving eight tonal categories. In 
addition, there are various types of tone sandhi 
processes within compounds. Beijing Mandarin is 
unusual as it splits the yin ru tone among the four 
tones derived from the other seven categories. “Tone 
sandhi’ is the assimilation process between tones of 
adjacent syllables. The Sanskrit word sandhi is the 
standard term in linguistics for such processes, 
though in Sanskrit it refers mainly to assimilation of 
consonants across morpheme boundaries, rather than 
tones. 

Chinese can be divided into seven major clusters: 
Mandarin, Wu, Gan, Xiang, Min, Yue and Kejia. 
The subclassification of these clusters is a matter of 
controversy; this atlas follows the standard classifica- 
tion by grouping Wu, Gan and Xiang together as 
Central, and grouping Min, Yue and Kejia together as 
Southern. In addition to these major clusters, there 
are three smaller residual groups in the south-west, 
Waxiang, Pinghua and Caijia, who speak distinct 
varieties of Chinese that are strongly influenced by 
other surrounding types of Chinese, but they prob- 
ably represent residual Chinese populations that 
preceded the spread of Mandarin and other varieties 
into these areas. The following discussion of Chinese 
varieties is drawn mainly from Wurm, T'sou and 
Bradley (1987). 

Mandarin, known as guanhua ‘speech of officals’ 
or beifanghua ‘northern speech’ in Chinese, is the 
speech of northern, north-western and south-western 
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China. It can be divided into four groups, Northern 
including Beijing, North-western, South-western and 
Eastern. North-western Mandarin is classified by 
some scholars as an eighth separate major cluster, 
known as Jin, but the differences from the rest of 
Mandarin do not justify this; they are mainly residual, 
such as the retention of a separate ru or stop-final 
category, which is also characteristic of the Mandarin 
varieties surrounding Jin. The spread of Mandarin 
varieties in south-western China has been a gradual 
process over the last couple of millennia, while its 
arrival in far north-western China beyond Gansu 
Province is somewhat more recent. 

The standard Mandarin of Beijing and all other 
types of Chinese — apart from the mixed language 
Hezhou — are verb-medial, but many other structural 
characteristics point to an earlier verb-final state, like 
nearly all the rest of Sino-Tibetan. Relative clauses 
precede the head, possessors precede the possessed, 
and most modifiers are postpositional. However, 
there are some structures, such as the so-called co- 
verb constructions, which are preposition-like. For 
example, the verb zai ‘be at’ is also grammaticalized 
into a coverb used like a locative preposition as in 
zai jia ‘at home’. Mandarin and all other modern 
Chinese varieties require a numeral classifier between 
a numeral and a noun, as in san ge ren (three human- 
classifier person) ‘three people’; however this classifier 
construction appears to be secondary, as it is absent 
from Classical Chinese. Because the writing system 
writes one syllable at a time, Chinese is usually said to 
be an isolating language without morphology; how- 
ever, there are progressively more and more nominal 
compounds, final syllables with neutral tone, and 
suffixes such as the human plural men and diminutive 
er. For more details of Chinese syntax, see Li and 
Thompson (1981). 

Standard Mandarin phonology is now usually 
represented in the pinyin phonetic system as used 
here. This has 23 initial consonants, 35 rhymes and 
two final consonants. The initials b dz zh j g represent 
voiceless unaspirated [p t ts ts te k] respectively; the 
initials p t c ch q k represent voiceless aspirated [p" t® 
ts" ts" te” k"]. The initials fs sh x h represent the voice- 
less fricatives [fs s ¢ x]. Initial nasals are m n; the only 
finals (apart from glides) are the nasals n ng [n ]; the 
latter does not occur initially. There are also liquids 
land rand glides w and y (representing [j]). Chinese 
has 35 distinct rhymes: seven monophthongs i wu ii 
([y]) a e (schwa) er (rhotacized schwa) and 9; five 
rhymes with onglides, four with offglides and four 
with both onglides and offglides; and also fifteen 
rhymes with final nasals — eight with final n including 
four with onglides, and seven with final ng includ- 
ing three with onglides. The inventory of rhymes 
has considerably decreased through the history of 
Chinese phonology, and is better-preserved in some 
non-Mandarin varieties such as Cantonese. 

Standard Mandarin is normally said to have four 
tones, Tone 1 [°°] high level, Tone 2 [*’] high rising, 
Tone 3 [*“*] low falling-rising and Tone 4 [”'] falling. 
The pitch values of tones are normally transcribed 
using superscript numbers after the syllable (with 5 
representing the top of the pitch range and 1 repre- 
senting the bottom), so raised [*] represents high 
level Tone 1. The superscript diacritics used to indi- 
cate the tones in pinyin are omitted here. There is a 
fifth tone, the so-called neutral tone, which occurs on 
various grammatical elements and other syllables that 
occur phrase or word finally. There are also some tone 
sandhi processes; for example, when two underlying 
Tone 3 words occur together in a compound, the first 
changes to Tone 2. 

In China there are about 900 million mother- 
tongue Mandarin speakers including nearly 500 
million for Beijing and related Northern varieties, 
over 50 million Jin or North-western, over 270 
million South-western and about 80 million Eastern. 
The main isogloss within Mandarin is along the 
Huaihe, which separates Northern and Eastern 
Mandarin across Hubei, Anhui and Jiangsu prov- 
inces; Eastern Mandarin also extends further north, 
covering much of Jiangsu. The main division within 


Mandarin is between Northern and North-western 
on one hand, versus Eastern and South-western on 
the other. For example, Eastern and South-western 
share a merger of n and J, usually to n. The main city 
of the Eastern Mandarin area is Nanjing; another 
term for Eastern Mandarin is Jianghuai. Northern 
Mandarin is further divided into various named local 
types, including North-eastern (in areas of the north- 
east of China more or less recently settled by Han 
Chinese), Northern (in Hebei and western Shan- 
dong), Jiaoliao (in eastern Shandong and southern 
Liaoning), Zhongyuan (in Shaanxi, eastern Gansu and 
south-western Shandong) and Lanyin (in Ningxia 
and north-western Gansu). 

In addition, nearly everyone in Taiwan and most 
ethnic Chinese in Singapore also speak Beijing-like 
varieties of Mandarin, usually in addition to one or 
more other varieties of Chinese. Since the mid-1970s, 
substantial numbers of Mandarin speakers have 
moved from China to nearly every urban centre in 
the Western world. In many areas, they joined existing 
overseas communities of non-Mandarin speakers, 
and those communities are also starting to use 
Mandarin as well as or instead of their traditional 
Chinese variety. The approximate total of Mandarin 
speakers outside mainland China is at least thirty 
million. There are, however, many ethnic overseas 
Chinese overseas who do not speak Mandarin, and 
indeed some who speak no variety of Chinese. 

There is also a small population of South- 
western Mandarin speakers in northern South-East 
Asia, mainly Muslims involved in various late- 
nineteenth-century rebellions in Yunnan, the so- 
called “Black Flag’ movement. When these rebellions 
were put down, some rebels moved into northern 
Laos, north-eastern Burma (where they are known as 
Panthay) and northern Thailand. This is largely an 
urban group, but is also a particularly large com- 
ponent of the population of Kokang district in the 
north-east Shan State of Burma. Another wave of 
non-Muslim Yunnanese Chinese speakers came in 
the early 1950s, when several KMT armies retreated 
into Burma. These armies and their followers moved 
around the eastern Shan State of Burma, north- 
western Laos and northern Thailand from then until 
the early 1970s; some were ‘repatriated’ to Taiwan 
in the early 1960s, but others remain, mainly in 
northern Thailand. Both the nineteenth-century 
Muslim arrivals and the ex-KMT (Kuomintang) are 
now known as Haw in Thailand and Laos. 

Gan is the traditional one-syllable name for 
Jiangxi Province, and Gan is also the name for a 
variety of Chinese with nearly 36 million speakers 
that covers most of Jiangxi Province and some areas 
of southern Hubei, eastern Hunan and north-western 
Fujian. The Hui dialect is spoken by about 3.6 
million people in the former Huizhou Prefecture area 
of southern Anhui Province, centred on the city 
of Tunxi. Some scholars classify this as a Mandarin 
dialect, but it is transitional to Gan and spoken to the 
north-east of the main Gan area. Gan has voiceless 
unaspirated reflexes of Proto-Chinese *voiced stops. 
Hui differs from Gan in that it has lost all Proto- 
Chinese final *nasals. 

Xiang is spoken by over 36 million people in 
central and southern Hunan and part of north- 
eastern Guangxi; Xiang is also the traditional one- 
syllable name for Hunan. It is normally divided into 
two main subgroups, the innovative New Xiang 
centred on the capital Changsha and other eastern 
areas, as opposed to conservative Old Xiang in south- 
western Hunan and into Guangxi. New Xiang is 
characterized by voiceless aspirated reflexes of Proto- 
Chinese *voiced stops, while Old Xiang retains them 
as voiced. 

The Wu cluster is centred on Shanghai, China’s 
major commercial centre, and covers most of 
Zhejiang, southern Jiangsu and parts of eastern 
Anhui, eastern Jiangxi and northern Fujian provinces. 
There is a total of over 85 million speakers, including 
over 100,000 in Hong Kong, Taiwan and Singapore, 
and among the overseas Chinese. Wu has substantial 
internal differences, divided into six subvarieties: 


Taihu (including Shanghai), Taizhou, Oujiang, Chuqu, 
Wuzhou and Xuanzhou. Wu preserves the Proto- 
Chinese three-way stop manner contrast, with *voiced 
stops remaining voiced, contrasting with voiceless 
unaspirated and voiceless aspirated stops. In tone 
sandhi, some types of Wu reduce tonal contrasts in all 
but the first syllable of a compound, while other types 
of Wu reduce tonal contrasts in all but the last 
syllable. 

Min has a great deal of internal diversity among 
its 64 million speakers on the mainland, mainly in 
Fujian Province but also in south-eastern Zhejiang, 
the north-east coast of Guangdong around Chaozhou 
and Shantou, the peninsula of Guangdong south of 
Zhanjiang, and eastern Hainan. It is also the main 
local variety of Taiwan and many overseas Chinese 
communities in South-East Asia, with at least a fur- 
ther 15 million speakers. The Min form of Fujian is 
Hokkien, and this is another name often used to refer 
to Min Chinese overseas, as in Singapore. Min can be 
divided into 1) Minbei ‘Northern Min’ in north- 
western Fujian; 2) Mindong ‘Eastern Min’ in north- 
eastern Fujian, including Fuzhou city, and in Puxian 
in a small area around Putian on the central coast 
(the Mindong of this area is also known as Henghua); 
3) Minzhong ‘Central Min’ in a small area around 
Sanming City in west central Fujian, and 4) Minnan 
‘Southern Min’ in southern Fujian around Xiamen or 
Amoy, as well as on Taiwan, with a subvariety along 
the east coast of Guangdong known from the Man- 
darin names of the cities Chaozhou and Shantou or 
their Min equivalents Teochiu and Swatow. Various 
subtypes of 5) Qiongwen Min are spoken in eastern 
Hainan and around its south coast. Min is the 
predominant overseas Chinese variety in most of 
Thailand, the Philippines, most towns in peninsular 
Malaysia, Sabah in East Malaysia, central Sumatra, 
Bali, south-eastern Kalimantan and parts of Sulawesi 
in Indonesia. Min is phonologically the most diver- 
gent of the varieties of Chinese; note, for example, the 
Min forms of place names: Hokkien, Amoy, Teochiu, 
Hokchiu and Quemoy versus the equivalent Man- 
darin forms Fujian, Xiamen, Chaozhou, Fuzhou and 
Jinmen. Min is also notable for complex tone sandhi 
processes. 

Yue has about 44 million speakers in western 
Guangdong and eastern Guangxi, with some in 
Danzhou, north-west Hainan. There are more than 
six million speakers in Hong Kong and Macao, and at 
least twelve million overseas Chinese Yue speakers in 
many areas of South-East Asia as well as in many 
western countries. It has substantial internal diversity, 
but less than Min. The conventional division is into 
seven subgroups: 1) Guangfu, including Guangzhou 
in central Guangdong and in Hong Kong; 2) Siyi 
(or Seiyap in Yue) on the coast around Macao and 
Taishan or Toisan; 3) Gaoyang further west along the 
Guangdong coast; 4) Goulou in north-west Guang- 
dong and eastern Guangxi; 5) Wuhua in a small area 
of far south-western Guangdong; 6) Qinlian in the 
Qin delta on the coast of Guangxi; and 7) Yongxun 
scattered in south-western Guangxi. Yue is very con- 
servative in rhyme developments, and also has large 
inventories of tones. It has internal differences in its 
initials. For example, Proto-Chinese *voiced initials 
become voiceless unaspirated in Goulou, voiceless 
aspirated in Wuhua, and elsewhere split between 
aspirated in syllables with ping and shang tones and 
unaspirated in syllables with qu and ru tones. The 
three westernmost varieties, Gaoyang, Yongxun and 
Qinlian, share a voiceless lateral reflex of Proto- 
Chinese *s. Overseas Yue communities are concen- 
trated in Vietnam, Cambodia (and among post-1975 
refugees from these two countries) and some towns 
in peninsular Malaysia, including Kuala Lumpur and 
Ipoh, as well as further afield in Papua New Guinea, 
Australia, New Zealand, the United States and Can- 
ada, Central and South America, parts of Southern 
Africa, Madagascar and Western Europe. Most of the 
overseas Yue speak Guangfu varieties like that of 
Hong Kong, but most in Papua New Guinea and 
Mexico speak Siyi Yue, and there are small numbers 
of Siyi speakers in other Yue communities overseas. 
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Kejia (the Mandarin term, which means ‘guest 
family’) or Hakka (the Cantonese equivalent) is a 
group of over 40 million speakers centred on eastern 
Guangdong, much of southern Jiangxi, south-eastern 
Fujian and part of south-eastern Hunan provinces. 
They are also very widely scattered across Guangxi 
and western Guangdong, with a substantial concen- 
tration in central Sichuan Province. Kejia is also 
spoken by about 10 per cent of the population of 
Taiwan, about 75,000 people in Hong Kong and 
nearly a million in overseas Chinese communities, for 
a worldwide total of about 43.5 million speakers. 
Kejia is receding in the face of Min, Yue, Mandarin 
and other varieties in many areas. Proto-Chinese 
*voiced initials become voiceless aspirated in Kejia, as 
in Gan; rhyme developments are diverse but mainly 
more conservative than Mandarin. Kejia tradition 
suggests that they come from the north of China, but 
their speech forms part of the Southern subgroup, 
not Mandarin. The overseas Kejia are concentrated in 
Java, north and south Sumatra, north-west Kaliman- 
tan, Ambon and West Irian in Indonesia, Kuching in 
east Malaysia, and in Timor Leste, with smaller con- 
centrations at Darwin in Australia, Samoa, Tahiti, 
Mauritius, the Seychelles and various locations in the 
Caribbean and southern Africa. 

Of the three residual south-western Chinese 
varieties, Waxiang (‘tile township’), also sometimes 
called Xianghua (‘township speech’), is spoken in a 
concentrated area along the border between Huaihua 
and Xiangxi prefectures in western Hunan Province 
by about 100,000 people. Pinghua is spoken in various 
areas along rivers in central and north-eastern 
Guangxi by about two million people. Caijia is 
spoken in north-western Guizhou by about 100,000 
people. All three are receding due to the spread of 
other varieties of Chinese. They may represent a 
residual Chinese population which was present prior 
to the spread of south-western Mandarin and the west- 
ward spread of Yue and Kejia. Their exact genetic 
position within Siniticis not certain, but it appears that 
at least Waxiang and Pinghua may form a subgroup. 

The migration of Chinese into South-East Asia 
and beyond over the last few hundred years came 
mainly from the south-eastern coast. Thus, most of 
the longer-established overseas Chinese communities 
come from areas where Min, Yue or Kejia varieties 
were spoken. Due to a very strong tendency to chain 
migration, each settlement area tends to have a large 
preponderance of speakers of one local variety of a 
Min, Yue or Kejia group, and to use that variety as 
a lingua franca among all overseas Chinese. Larger 
centres may be more diverse in their ethnic Chinese 
population; see for example the discussion of Singa- 
pore, Section 5.1.1.3 below. The figures for speakers 
in Table 5.1 include double counting of bilinguals 
and trilinguals; for example, the bilingual speakers 
of Southern Min and Taiwan Mandarin, and the 
trilingual speakers of Kejia, Southern Min and Taiwan 
Mandarin; thus the overall total does not add up. 
However, bilinguals in mainland China are counted 
only as speakers of their home variety; nearly every- 
one can also speak standard Mandarin. 

In addition to the members of the Han majority, 
there are many members of several large minority 
nationalities included in the above totals who speak 
only Chinese. This includes all the Hui (Chinese Mus- 
lim; not to be confused with the Hui variety of Gan 
Chinese) apart from the Hezhou, Kangjia and Tsat 
speakers (see below), almost all the Manchu, Gelao, 
She and Hezhe, most of the Tujia and Xibo, and most 
of the people formerly classified as Han who have 
reclaimed minority ethnicity in the last twenty years. 
Within most other nationalities, there are also sub- 
stantial numbers who do not speak their traditional 
languages. Since 1950, most other members of all 
nationalities in China have learned standard Chinese 
as a second language. 


5.1.1.1 Chinese in Taiwan 


The indigenous population of Taiwan traditionally 
spoke an extremely diverse range of Austronesian lan- 


Table 5.1 Sinitic languages 


Languages 


Number of speakers 


Mandarin Group 


930,000,000 


Northern 530,000,000 
Eastern (Xiangjiang, Jianghuai) 80,000,000 
North-western (Jin) 50,000,000 
South-western 270,000,000 
Central Group 
Xiang (Hunan) 36,000,000 
Old Xiang 22,000,000 
New Xiang (Changsha) 14,000,000 
Gan (Anhui) 39,600,000 
Gan 36,000,000 
Hui 3,600,000 
Wu 85,000,000 
Taihu 59,100,000 
Oujiang 4,800,000 
Taizhou 5,500,000 
Chuqu 7,500,000 
Wuzhou 4,600,000 
Xuanzhou 3,500,000 
Southern Group 
Yue (Cantonese) 62,000,000 
Min 100,000,000 
Central (Minzhong) 800,000 
Northern (Minbei) 2,400,000 
Easten (Mindong, Fuzhou, Puxian) 11,400,000 
Southern (Minnan) 80,000,000 
Qiongwen (Hainan) 5,400,000 
Kejia (Hakka) 43,500,000 
Old South-western 2,200,000 
Waxiang 100,000 
Pinghua 2,000,000 
Caijia 100,000 
Sinitic total 1,275,000,000 


(allowing for double counting of bilinguals and trilinguals) 


guages. Over more than 600 years, Taiwan has been 
extensively settled from the coast of Fujian Province, 
and most of its Chinese population speaks a variety 
of Southern Min, also known as Taiyu in Chinese or 
Taiwanese in English. There is also a substantial 
Kejia or Hakka population, nearly 10 per cent of 
the total, concentrated in north-western Taiwan and 
also widely scattered elsewhere. Taiyu or Taiwanese 
Southern Min is the language of most homes and 
informal domains, and is increasingly also used in 
public domains; Kejia also continues in use, but much 
more locally. 

Japanese occupation up to 1945 left a profound 
cultural influence, as well as extensive knowledge of 
Japanese language among the older generation of 
Taiwanese. The return to China in 1945 led to the use 
of Mandarin as the official language, under the name 
Guoyu ‘national language’. In the late 1940s, a sub- 
stantial number of speakers of a wide range of other 
types of Chinese, mainly Mandarin but also Wu and 
others, came to Taiwan. 

Mandarin has become an additional mother 
tongue since 1945, with distinctive Taiwanese Man- 
darin phonological and syntactic characteristics, as 
well as very extensive lexical and semantic differences 
from mainland Mandarin. For example, the term 
airen (literally ‘love-person’) means ‘mistress’ in 
Taiwan Mandarin and ‘wife’ in mainland Mandarin. 
Other non-Mandarin varieties brought from the 
mainland after 1945 and most of the indigenous 
Austronesian languages are in rapid decline. Most 
people in Taiwan now speak Taiwanese Mandarin in 
formal contexts and Taiwanese Southern Min in 
informal contexts. Taiwan continues to use traditional 
Chinese characters. 


5.1.1.2 Hong Kong and Macao 


Hong Kong (Xianggang in Mandarin; Hong Kong 
represents the Cantonese pronunciation of this word, 
which means ‘fragrant harbour’) is a major urban 
centre on Hong Kong Island, various surrounding 
islands and a large part of the adjacent mainland. 


Hong Kong Island and a small part of the Kowloon 
(in Mandarin Jiulong ‘nine dragons’) mainland was 
seized by the British in 1841 during the opium wars, 
to set up a safe base for their exports of opium into 
China. It replaced earlier trading posts in Guangzhou 
(Canton), where the Chinese government had repeat- 
edly tried to suppress this trade. The New Territories 
(various nearby islands and the remainder of the 
mainland area) were leased from China under duress 
for 99 years from 1898 after the suppression of 
the Boxer Rebellion in Beijing by British and other 
foreign troops. Hong Kong was under Japanese 
occupation from 21 December 1941 until 30 August 
1945, then again a British colony (with limited self- 
government from the late 1980s) until it was retuned 
to China on 1 July 1997. It is now a Special Adminis- 
trative Region of China. 

Prior to 1841, the small local population was just 
over half indigenous Cantonese farmers, just under 
half Hakka fishermen and farmers who arrived about 
300 years ago, and a small group of Hoklo, mainly 
fishermen and under 2 per cent of the population. 
During the British period, very large numbers of 
Cantonese speakers moved in, especially from the 
Guangfu dialect region. Smaller numbers of people 
also moved from various parts of British India. 

From the late 1940s up to the late 1960s, a major 
population influx came from all areas of China, 
including much of the Shanghai business elite in the 
late 1940s and speakers of a number of varieties of 
Chinese, but mainly speakers of Cantonese, with 
some Hakka and Chaozhou speakers and a few 
others. This refugee intake waned and waxed accord- 
ing to political changes within China and the attitude 
of the British to refugees. Since their arrival from the 
late 1940s onward, most of the non-Cantonese- 
speaking refugees have learned Cantonese, and their 
descendants mostly speak only Cantonese. Before the 
handover to China in 1997, many Hong Kong resi- 
dents who were not ethnic Chinese (and some who 
were) left; this substantially decreased the ethnic 
Indian population, built up over more than 150 years 
of British rule. All of the Gurkha troops from Nepal 
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and elsewhere in South Asia were withdrawn by mid- 
1997 and replaced by PLA troops from China. There 
is a very large Hong Kong Cantonese-speaking popu- 
lation in many overseas Chinese communities, espe- 
cially in Vancouver, Canada but also in almost every 
major English-speaking city in the world. Until about 
twenty years ago when more speakers of Mandarin 
started to go overseas, Hong Kong Cantonese was also 
the lingua franca in many overseas Chinese com- 
munities in western countries. 

In the 2001 census, the population of Hong Kong 
was returned as 6,708,389, an increase of 0.9 per cent 
from the 1996 total of 6,412,937. There are also very 
large numbers of guest workers from China and 
many areas of the Philippines, as well as some from 
Thailand and elsewhere in the region; most do not 
have Hong Kong resident status and are not enumer- 
ated in the census. Hong Kong is a major financial 
centre and a gateway into China, and has many other 
foreigners, especially British but also American, 
Australian, Japanese and so on. 

The official languages of Hong Kong are Chinese 
(spoken Cantonese, written with traditional Chinese 
characters), English, and Mandarin Chinese (spoken 
putonghua, written with simplified characters, the 
national language of China). The spoken Cantonese 
of Hong Kong is similar to that of Guangzhou 
city, but with some tone mergers and other changes. 
Public signs are mainly in traditional characters and 
sometimes English; spoken public announcements 
are in these two languages and sometimes also in 
putonghua. The lingua franca of business among 
the Hong Kong elite and the language of much of 
the education system remains largely English, but the 
mother tongue of nearly all those of Chinese ethnicity 
is Cantonese. Mandarin is spreading very slowly; the 
main speakers are the PLA troops and representatives 
of the Chinese government in Hong Kong. The large 
and growing influx of tourists from other parts of 
China also requires receptive and limited spoken 
putonghua skills among workers in the service sector. 

Census statistics can be found on <info.go.hk/ 
censtatd/>, the website of the Hong Kong Census and 
Statistics Department. In a detailed analysis of the 
1971 and 1996 census results on language, T'sou 
(1997) reports the breakdown of population shown 
in Table 5.2. Kejia and Siyi are the Mandarin forms of 
Hakka and Seiyap. 

The local speech of Guangzhou city is the 
standard for spoken Cantonese; the main local speech 
variety used in Hong Kong is very similar. In Seiyap/ 
Siyi, an area on the coast near Macao, another variety 
of Cantonese is spoken. In the Chaozhou area of the 
north-eastern coast of Guangdong, a southern Min 
language known as Chaozhou in Mandarin and as 
Teochiu in local speech, is used. In Hakka areas at the 
intersection of Guangdong, Fujian and Jiangxi, Hakka 
(Mandarin Kejia) is spoken. In the rest of Guangdong 
and eastern Guangxi, other local varieties of Canton- 
ese are spoken. It can be seen that 83.8 per cent 
of Seiyap/Siyi, 43.4 per cent of Chaozhou, and an 
estimated 55 per cent of Hakka born outside Hong 
Kong now report that they speak Hong Kong 
Cantonese; these proportions are almost certainly 
much underestimated, with family background lan- 
guage use over-reported. 

Comparing these figures to those of the 1971 
census, there has been a substantial increase in the 
proportion of Hakka from 12.8 per cent to 14.8 per 


cent among the Hong Kong-born population; this 
may reflect increased reporting of Hakka identity 
rather than an actual increase in the proportion of 
Hakka. According to Lau (2005: 28), the proportion 
of Hakka speakers in Hong Kong has been decreasing 
for a long time, from 15.1 per cent of the total 1911 
population to 1.2 per cent of the total 1996 popula- 
tion; these proportions represent a more or less con- 
stant Hong Kong-born Hakka speech community of 
about 75,000, but that is also now decreasing, with 
most Hakka speakers now more fluent in Cantonese. 
In a 1995 study of the Hakka in the Northern District 
of the New Territories, which was Hakka-majority in 
1911, Chow and Lau found that only 12.9 per cent of 
those then under 25 were fluent in Hakka, and every- 
one except for a few old ladies (who would now be 
over 75, and can understand Cantonese) was fluent in 
Cantonese (Lau 2005: 29). T'sou (1997: 294, 296) 
estimates that the proportion of the Hakka from out- 
side Hong Kong who speak Cantonese at home has 
increased from 20 per cent to 55 per cent between 
1971 and 1996, and this increase is continuing. The 
long-term trend is for Hong Kong Cantonese to 
replace Hakka and other varieties of Chinese inside 
Hong Kong homes, as it has outside them. 

Macao was a Portuguese colony from 1555 until 
1999, and Portuguese was thus the official language. A 
Portuguese creole known as Patua also developed, 
partly derived from the Portuguese creoles of India 
and Malacca but with an additional Chinese stratum 
borrowed from the local variety of Yue. A very small 
number of creole speakers also lived in Hong Kong; 
for more details, see the Indo-European section 5.10 
below. By the end of 1999, most of the Portuguese 
and creole community had left for Portugal, and 
standard Portuguese is also a fading memory. The 
vast majority of the population of Macao have always 
been speakers of the local variety of Yue, known as 
Siyi in Mandarin and Seiyap in Yue. Since the return 
to China in December 1999, traditional characters 
continue in use, but the learning of standard Man- 
darin and simplified characters is spreading through 
education. Like Hong Kong, Macao remains a Special 
Administrative Region of China. 


5.1.1.3 Singapore 


Singapore is a major urban centre on an island at the 
southern tip of the Malay peninsula, in the Straits 
of Malacca. Until its establishment by Raffles as a 
British trading port in 1819, it was inhabited by only 
a small number of local Malays. Developing rapidly, 
in 1832 it became the administrative centre for the 
British East India Company in the region, replacing 
Penang, which had been under British rule since 
1786. With Penang, Malacca and later Labuan, it 
became part of the Crown Colony of the Straits 
Settlements on 1 September 1867. Occupied by Japan 
from 15 February 1942 until August 1945, it became a 
separate Crown Colony on 1 April 1946, a self- 
governing state on 1 August 1958, joined the Malay- 
sian Federation on 16 September 1963 and left it to 
become an independent republic on 9 August 1965. 
During the British period, Singapore was exten- 
sively settled from British India, especially from the 
south, but primarily from various parts of the south- 
eastern coast of China. By the 2000 census, 76.8 per 
cent of the population was of Chinese ethnicity, 13.9 
per cent of Malay ethnicity, 7.9 per cent of Indian 


Table 5.2 Hong Kong: ethnicity and language, 1996 census, percentages 


Language Place of birth 
Hong Kong Guangzhou Seiyap/Siyi Chaozhou Hakka Kejia Other Chinese Other 

English 1.0 0 0 0 0 0.05 56.4 
Cantonese 81.7 94.4 83.8 43.4 (55) 94.3 15.9 
Hakka/Kejia 14.8 3.8 0.04 2.0 (43) 4.4 2.0 
Hoklo 22 1.4 0.1 54.5 0 0.5 1.8 
Seiyap/Siyi 0.04 0.2 15.5 0.03 0 0.1 0 
other Chinese 0.2 0.1 0.9 0.03 0 0.5 2.9 
other 0.1 0 0.06 0.1 0 0.2 21.2 


ethnicity, and 1.4 per cent other ethnicities. In add- 
ition, there are over 750,000 non-Singaporeans living 
and working in Singapore, mainly from Malaysia 
(over 300,000) and China, Hong Kong or Taiwan 
(over 160,000) but also from Indonesia (over 60,000), 
with many from Thailand and the Philippines. As a 
major financial and technological centre, it has also 
attracted numerous British, American, Australian, 
Japanese and other foreigners. 

The official languages of Singapore are English, 
(Mandarin) Chinese, Malay and Tamil, a Dravidian 
language of southern India. The home language of 
most people in single-ethnicity Chinese families was 
mainly the ethnic language until 1979, with Hokkien 
as the lingua franca among those of Chinese ethnicity 
and English as the general lingua franca. Malay was 
used by Malays and by mixed Malay-Chinese or Baba 
Malay families. Many families of the various Indian 
sub-ethnicities have already shifted to English at 
home, and most Singaporeans speak and are literate 
in English. In fact, a distinctive Singaporean variety 
of English, with South Asian English roots, some 
Chinese substratum, and some Malay and Hokkien 
lexicon, has developed. 

On 7 September 1979, a Speak Mandarin cam- 
paign started, to replace the use of non-Mandarin 
varieties of Chinese in all public domains in Singa- 
pore. The Chinese characters in use were also changed 
from the traditional characters (still used in Hong 
Kong, Macao and Taiwan) to the simplified characters 
which came into use in China in 1958, and the pinyin 
Romanization was also put into use. This campaign 
has resulted in a very substantial decline in the use of 
non-Mandarin varieties (called ‘dialect’ in Singapore 
English) since then. However, Cantonese films and 
songs remain very popular, and Hokkien remains a 
widespread but declining informal lingua franca. 

Census statistics tracing these linguistic shifts can 
be found on <singstat.gov.sg>, the website of the 
Singapore Department of Statistics. At the time of 
the 2000 census, the breakdown of ethnicity, home 
language use and literacy in the mother tongue was as 
shown in Table 5.3. In this table, ‘home language’ 
refers to the specific variety (‘dialect’) of ethnic 
Chinese subgroups. 

The ‘other’ category in Table 5.3 is only Singa- 
pore citizens; it does not include the more than 
750,000 foreign residents who speak English, Malay, 
Thai, various Philippine languages and so on. Separ- 
ate figures on home language for smaller Chinese 
varieties are not available; the overall rate for all these 
varieties is 38.4 per cent, though the largest subgroup, 
the Hakka, have a substantially lower retention rate 
than most other subgroups. Population breakdown 
by sub-ethnicity, home language and mother-tongue 
literacy within the Indian ethnicity is also not avail- 
able in official statistics. The other Dravidian lan- 
guages include a substantial number of Malayalam 
background and smaller numbers of Kannada and 
Telugu background; other Indic groups include 
Gujarati, Hindi/Urdu and so on. The Singapore 
education system would in principle provide Tamil- 
medium education or a choice of Tamil as a second 
language, but almost no children are now enrolled in 
such programmes, and literacy in Tamil and other 
Indian languages is thus certain to decline in time. 

Literacy in Chinese is dialect-neutral as all 
dialects are written with the same writing system. 
Older Chinese people (those educated before 1979) 
are literate in traditional characters; all but the oldest 
literate Chinese are — instead or also — literate in 
simplified characters. In addition, 2.8 per cent of the 
Malay population is literate in Chinese, and a tiny 
part of the Indian and Chinese population is literate 
in Malay. 

The preponderant subgroup within the Chinese 
ethnicity is the Hokkien (46 per cent of all Chinese), 
with related Min varieties spoken by five other groups 
totalling a further 34.9 per cent of the Chinese 
ethnicity. Most of the speakers of other Min Chinese 
varieties and many of the Cantonese, Hakka and 
other Chinese can also speak some Hokkien; in mixed 
marriages across subgroups within the Chinese 
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Table 5.3 Singapore: ethnicity, home language and literacy, 2000 (numbers in 1000s) 


Ethnicity Population Home language Mother-tongue literacy (%) 
number % number % 
All Chinese 2,236.1 76.8 82.2 
all Min 1,808.2 55.4 
Hokkien 1,028.5 31.5 298.3 29 
Teochiu 526.2 16.1 135.2 25.7 
Hainan 167.6 5. 64.4 
Foochow 46.9 1.4 18.8 
Henghua 23.5 0.7 9.0 
Hokchiu 15.5 0.5 6.0 
Cantonese 385.6 11.8 139.6 36.2 
Hakka 198.4 6.0 139.6 
Shanghai 21.6 0.7 8.3 
other 91.6 2.8 35.2 
Malay 453.9 13.9 408.7 90.1 O73 
All Indian 257.8 ree) 113 43.5 51.3 
Dravidian 220 6.8 
Tamil 155 4.8 
other 65 2.0 
Indic 38 1.2 
Punjabi 25 0.8 
other 13 0.4 
other 91.6 2.8 


community, Hokkien used to dominate. Even some of 
those who are not of Chinese ethnicity in Singapore 
can also speak some Hokkien. Since the Speak 
Mandarin campaign started in late 1979, a very large 
number of Chinese families have switched to 
Mandarin as the home language, despite the fact that 
very few Chinese Singaporeans had Mandarin as their 
original mother-tongue variety before then. This has 
already led to a decline in Chinese ‘dialect’ use in 
Singapore, which looks certain to continue. 


5.1.2 Tibeto-Burman 


There are two early classifications of Tibeto-Burman 
(TB) languages; one is that of Shafer (1974), which 
splits it into four main parts: Bodic, Baric, Burmic and 
Karenic, the other is that of Benedict (1972), with 
eight subgroups plus an ‘other’ category. These nine 
subgroups have been reclassified by Benedict (1976) 
into three groups. For a comparison of these and 
other early classifications such as that of Grierson 
(1903-9) for the TB languages of India, see Hale 
(1982). 

With additional data on languages of China and 
north-eastern India, it has become clear that some 
revisions are needed; specifically, some of Benedict’s 
‘other’ languages, classified tentatively as Burmic by 
Shafer, form an additional group called the Qiang 
group by Sun (1983b), part of the proposed Rung 
subgroup of Thurgood (1985), and here the North- 
eastern TB languages. Shafer’s classification of a 
number of specific languages and language groups 
has proved doubtful; for example most languages of 
his Burmic group are here included in the Burmese- 
Negwi subgroup of South-eastern TB or the Kuki-Chin 
subgroup, but some others such as Jinghpaw are in 
the North-eastern India TB group and others such as 
Qiang are in the North-eastern TB group. For the 
most recent views, see Bradley (1994, 2002) and 
Matisoff (2003). The classification followed here is 
from Bradley (2002). 

The classification of some languages is uncertain, 
in most cases due to very extensive contact: Bai and 
the two Tujia languages with Chinese, the Nungish 
languages with Ngwi languages, Lepcha with Tibetan 
and so on. The North-eastern India TB group 
includes Shafer’s Baric or Benedict’s Bodo-Garo- 
Konyak groups, now usually known as the Sal group 
from a name suggested by Burling (1983); plus 
Jinghpaw (Kachin) and the Sak or Luish group; it has 
some lexical peculiarities not shared with other TB 
languages; most languages have some morphology 


including parts of that reconstructed for Proto-TB 
by Wolfenden (1929), but relatively simple tonal 
systems. 

The Western TB group here corresponds mainly 
to Shafer’s Bodic group and Benedict’s Tibetan/ 
Kanauri plus Himalayan, with a few additions. Here it 
is divided into Bodish and Himalayan. Bodish has 
three subgroups: 1) Tibetan proper, 2) the West Bod- 
ish, Gurung or Tamang subgroup in west central 
Nepal, and 3) the East Bodish subgroup in eastern 
Bhutan and further east in Tibet and India. The other 
branch of Western TB is Himalayan, which includes 
West Himalayish in north-western India, Central 
Himalayan in central Nepal, and Kiranti in eastern 
Nepal and nearby in western Darjeeling and Sikkim 
in India. Some scholars like Benedict prefer to link 
West Himalayish and Bodish, and some have recently 
proposed Mahakiranti linking Central Himalayan and 
Kiranti. The Kiranti subgroup is internally quite close; 
Central Himalayan is more diverse and includes vari- 
ous languages not classified as Bodic by Shafer includ- 
ing Newari and Chepang. Most of these languages 
have extensive secondary morphology, especially on 
verbs; many have word rather than syllable tone sys- 
tems, often involving phonation as well, while some 
are nontonal. 

The Central TB subgroup includes the languages 
of the border between north-eastern India and Tibet, 
across northern Nepal and northern Bhutan, and also 
extending into northern Burma and adjacent areas of 
China. The Kuki-Chin subgroup, Shafer’s Kukish and 
Benedict’s Kuki-Chin-Naga, is typologically similar to 
the adjacent Central TB languages in having some 
preserved and some innovative verb morphology and 
relatively simple tonal systems, as do languages of the 
North-eastern TB subgroup. Some of the southern 
North-eastern TB group languages are lexically tran- 
sitional to South-eastern TB, but are phonologically 
and morphologically more typical of North-eastern 
TB. These include the Rawang-Nungish and Na 
languages. 

The South-eastern TB group includes the 
Burmese-Ngwi subgroup and the Karenic subgroup. 
These languages tend to have very little morphology 
and complex tonal systems typical of the northern 
Southeast and southern East Asian linguistic area. Of 
all the TB languages, only Bai and the Karenic sub- 
group have SVO syntax; but even Karenic retains 
various verb-final syntactic characteristics. Bai has 
been heavily influenced for millennia by SVO (Sub- 
ject-Verb—Object) Chinese. All other TB languages 
are SOV. 


5.1.2.1 Western Tibeto-Burman 


5.1.2.1.1 Bodish 
5.1.2.1.1.1 Tibetan (Central Bodish) 


Some sources suggest as many as 6.5 million speakers 
of Tibetan, but this is an overestimate; the actual total 
is about 4.9 million first-language speakers and over 
100,000 second-language speakers in Tibet and other 
Tibetan areas in China, and well over a million else- 
where including Bhutan, Nepal and India. Within 
Tibetan there is a vast range of varieties, nearly all of 
which are linked by sharing Tibetan Buddhism and 
thus modern literary Tibetan. Scholars tend to divide 
this range into Western, Central, Southern, North- 
eastern or Amdo and Eastern or Khams subgroups. In 
India and Nepal, most Tibetans are pejoratively called 
Bhotia or Bhote, and in China they are called Zang 
(tsang). Apart from its role as the language of Tibet, 
varieties of Tibetan are or were the official language 
of various kingdoms, from Ladakh in the west to 
Mustang in north central Nepal, Sikkim, Bhutan and 
so on. The Dzongkha variety of Southern Tibetan is 
the national language of Bhutan, widely known as a 
second TB language by speakers of the various other 
TB languages of that country. The Danjong variety 
of Southern Tibetan is one of four official languages 
of Sikkim in India; it is the language of the north of 
Sikkim and its former ruling family. 

Tibetan is written with an Indic-based script, 
which represents various prefixed consonants includ- 
ing b, d, g, r, 1, m and a-chung; the exact phonetic 
value of the latter prefix is a matter of dispute. These 
prefixes are not pronounced in most kinds of Tibetan, 
but they affect the development of tones. Many 
Tibetan words, including proper names, have such 
prefixes. For example, ‘monk’ is written with ‘b-’ pre- 
fixed on an initial ‘’. Where words with prefixes are 
proper names or titles, the prefix is represented in 
lower case and the syllable-initial is in upper case: so 
‘monk’ would be written bLama. In Tibetan, and in 
the index to this atlas, such names are alphabetized 
immediately after the same initials without a prefix, 
so ‘Lama’ would come just before ‘bLama’. 

The Tibetan-speaking area has been divided 
among various provinces in China, including the 
Tibetan Autonomous Region, north-western Yunnan, 
western Sichuan, much of Qinghai and parts of 
southern Gansu. Tibetan Buddhist influence spread 
much further, into the Mongol areas far to the north- 
east. From 1959 many Central Tibetan speakers from 
Lhasa and elsewhere moved to India and Nepal as 
refugees. Closure of the Indian—Tibetan border in 
1965 further disrupted traditional trading relation- 
ships and left small Tibetan populations that spoke 
divergent Central and Southern Tibetan varieties cut 
off. In Nepal, the northern quarter of the country 
is inhabited mainly by Central and Southern Tibetan 
speakers, with large post-1959 refugee groups around 
Kathmandu and elsewhere, Apart from the speakers 
of Nepali, most of the population of Bhutan and Sik- 
kim speak some variety of Tibetan or other Bodish 
language. There are also a few Tibetan speakers in 
northernmost Burma. A conservative total is about 
five million speakers. Several other TB groups follow 
Tibetan Buddishm and use literary Tibetan as a 
liturgical language, but have distinct more or less 
closely related TB languages of their own, including 
most West Bodish, East Bodish and North-eastern TB 
languages. Most North-eastern TB groups in western 
Sichuan are included in the Tibetan nationality in 
China. 

It should be noted that the names of many 
Tibetan varieties include the suffix -pa/-po ‘people/ 
men or the suffix -sKad/-kad/-kat or -ka/-kha 
‘speech’. 


5.1.2.1.1.1.1 Western Tibetan 
These languages are spoken in Kashmir, and are non- 
tonal. They are in various other ways phonologically 
conservative, some even reflecting the written 
Tibetan b-, d-, g-, r- and I- prefixes directly. 

Balti is spoken by a large Muslim population 
in the Pakistan-administered portion of northern 
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Kashmir around Skardu and Khappalu known as 
Baltistan, and written in an Arabic script. Purik is the 
Tibetan variety spoken by another Muslim popula- 
tion in the Kargil area of Indian-administered eastern 
central Kashmir to the south of Baltistan. These two 
languages are very similar, but contact has been 
limited for nearly sixty years. 

Ladakhi is the local Tibetan variety of the 
Ladakh area around the town of Leh in Indian- 
administered north-eastern Kashmir; it is also known 
as Ladwags from its local name. The distinctive 
Zangskari speech variety of the Zangskar valley to the 
south-west of Leh is not that different from Ladakhi; 
both populations follow Tibetan Buddhism and use 
literary Tibetan for religious purposes. 


5.1.2.1.1.1.2 Central Tibetan 

In higher regions near the northern borders of north- 
western India and northern Nepal, there are many 
small Tibetan communities. Most of these are based 
on a herding economy, and live in concentrated valley 
villages in winter but are widely scattered in the 
mountains with their livestock in summer. 

There are some Western varieties of Central 
Tibetan spoken in Lahul (further north than the West 
Himalayish Lahuli languages) that are included in the 
Western Group by other scholars. Nishi (1986) calls 
these “Western innovative’ Tibetan and hence suggests 
a closer grouping with Ladakhi, Zangskari, Purik and 
Balti to the north-west. These are Patnam, Tod and 
Khoksar, the three closely-related ‘Bhotia’ varieties 
spoken in Lahul, and sometimes called ‘Lahuli’. 
Patnam is known to its speakers as Miyad Nala; Tod 
is spoken in the Tod valley; and Khoksar is in four 
villages to the north of the Rohtang Pass. 

Spiti is known to its speakers as Piti and 
spoken in the Spiti valley. To its south in Kinnaur 
is mNyamskad or Nyamkat. Further east, Jad or 
Garhwal Bhotia is also known as Jangali (jungle 
talk’); census figures for this group are under- 
enumerated. 

The mNgahris subgroup of Central Tibetan is 
mostly found in Ngari (written Tibetan mNgahris, 
formerly Chinese Ali and now Linzhi) Prefecture in 
western Tibet. It also extends into north-western 
Nepal, where it includes the so-called Humli Tamang 
(who of course are not Tamang), Khan, and 
Karmarong varieties. There are six varieties in Ali 
Prefecture, named from the principal towns of each: 
Ruthog, sGar (Gartok), rTIsamdah, sPuhreng, 
mTshochen and dGergjas. Another is Lunggar in the 
western part of adjacent Shigatse Prefecture. How- 
ever, the north-eastern part of Ali Prefecture is 
inhabited by speakers of a Western Khams variety, 
sGerrtse. For more details on all these varieties, see 
Qu and Tan (1983); the total number of speakers is 
about 100,000. 

The gTsang subgroup of Central Tibetan includes 
most of the Tibetan varieties in the central and east- 
ern part of northern Nepal as well as those of the 
large Tibetan towns of Shigatse and Gyantse. There is 
more detail available on the subvarieties of Nepal. 
About 800,000 speakers are in Tibet, representing 
over 80 per cent of the total; the rest are in Nepal. 
Dolpo, Rengpungmo and Tichurong of north central 
Nepal are sometimes collectively known as Dolpo. 
bKag and gLo are the local speech of the Mustang 
area; gLo or Lo in Mustang itself, and bKag or Kag to 
the south. Nar, Gyasumdo, Nubri and Tsum are 
spoken to the north of the Gurung area. Kachad and 
Langtang are spoken to the north of Kathmandu. 

The gTsang variety Sherpa is spoken both in 
Tibet and in north-eastern Nepal, north and south of 
Mt Everest; the name means ‘eastern people’. The 
speakers are well-known for their mountaineering 
exploits and live mostly in north-eastern Nepal, with 
a substantial population in Darjeeling in India, about 
800 in Tibet, and some overseas. Not all ethnic Sherpa 
in India or overseas can speak the language. Closely 
related is Kagate, spoken to the south of the eastern 
Sherpa area by a small group. Jirel, Lhomi and 
Halung are spoken in north-eastern Nepal. Jirel lies 
between the two main Sherpa areas, Lhomi (hLomi, 


Kath Bhotia) is to the east of Sherpa, and Halung is in 
far north-eastern Nepal, north of Limbu; for a 
description, see Vesalainen and Vesalainen (1980). 
Jirel and Lhomi are quite similar. The local gTsang 
varieties spoken in Tibet include sKyidgrong, Dingri, 
gZhiskartse (Shigatse), rGyalrtse (Gyantse) and 
sNadkarrtse. 

dBus is central Central Tibetan, with about 1.4 
million speakers in Tibet and 200,000 elsewhere. 
It includes Lhasa (hLasa), Chushur, hPhanpo, 
rIsethang and hLunrtse varieties, among others, 
and extends into north-western and north-eastern 
Bhutan. Lhasa is the spoken variety of Tibetan of the 
capital, Lhasa; it is more generally known as ‘central’ 
dBus (transliterated from the written name) or U 
([y], from the spoken version of the same word). 
There is diglossia between modern literary Tibetan, 
used mostly for religious purposes but also for 
other reading-related activities, and the spoken Lhasa 
variety which is a widespread spoken lingua franca. 
The Lhasa variety has about 500,000 first-language 
speakers, including most of the nearly 150,000 
Tibetan refugees in Nepal, India and most Western 
countries. The Tibetan script is Indic-based and very 
widely studied by Tibetans and other Tibetan 
Buddhists. 


5.1.2.1.1.1.3 Southern Tibetan 

Southern Tibetan is slightly divergent from Central 
Tibetan, most speakers live in Sikkim and Bhutan, 
but Groma or Tromowa is the speech of the Chumbi 
valley in Yadong County, Tibet between Sikkim and 
Bhutan; it has about 10,000 speakers. 

Danjong was the language of the Sikkimese 
court, widely spoken in northern Sikkim and still 
used in education there. It has an orthography that is 
slightly different from literary Tibetan. 

Dzongkha or ‘fort language’, also sometimes 
known as Drukpa ‘dragon person’, is the local 
Tibetan variety of the western third of Bhutan. It is 
learned in schools throughout the country, with a 
slightly different orthography from literary Tibetan. 


5.1.2.1.1.1.4 North-eastern or Amdo Tibetan 

Amado varieties, with about a million speakers, are 
archaic in some ways, for example in the retention of 
nasal prefixes including h- (known in the Tibetan 
orthography as a-chung). Included is the frequently 
cited Golok (mGolog) variety among many others. 
Chinese sources divide Amdo into four subgroups: 
hBrogpa (660,000), Rongba (125,000), Rongba- 
hBrogpa (140,000) and the isolated southern rTahu 
(75,000). Within rTahu, Nishi (1986) identifies 
Braghgo and rTahu varieties. Various sources give 
fifteen or more named varieties within hBrogpa; 
Rongba includes two or more varieties. Rongba- 
hBrogpa is transitional between the two, and itself 
has two or more varieties. There are many subgroups 
within Amdo, not all of which have been fully 
described; for a description of one, see Sun (1986). 


5.1.2.1.1.1.5 Eastern or Khams Tibetan 

Khams speakers are often nomadic herdsmen, 
moving far across the arid areas of central and 
western Tibet, but they are concentrated in the south- 
east, especially in south-western Sichuan and north- 
eastern Yunnan. Not all scholars agree with the 
Chinese in attributing rGerrtse (spoken in north- 
eastern Ali Prefecture in western Tibet) to the Khams 
group, but if this is accepted then Khams is the most 
widespread subtype of Tibetan, extending from fairly 
far west to the furthest east and south of all Tibetan 
traditional territory. There are nearly 1.6 million 
speakers, including eight villages in northernmost 
Burma. Khams is also the lingua franca of Tibetan 
herdsmen over a very wide area. Khams is divided 
into four geographical subtypes, Western, Northern, 
Eastern and Southern. 

Of the Western Khams varieties, rGerrtse is 
spoken in Ali Prefecture, with very sparse population 
in a band to the north-east and extending to the 
north of almost the entire Central Tibetan area, 
with Nagchu among other varieties included. These 
varieties account for about 175,000 speakers. 


Eastern Khams is the largest group of Khams 
varieties, with over a million speakers. It has at least 
ten varieties, including sDege, dKarmdzes, Chabmdo, 
Braggyab, Nyingkhri, hBahthang, Nyagchu and 
others. 

Northern Khams, with about 100,000 speakers in 
a sparsely settled region, includes hBristod, Khrihdu, 
sKyergumdo and Nangchen varieties. Two other iso- 
lated Northern Khams varieties are Cone, with about 
80,000 speakers, and the 30,000 hBrugchu to their 
east; they are separated from the rest of Khams by the 
Amdo-speaking area. 

Southern Khams is spoken by all of the Tibetans 
in Yunnan Province and some of those in south- 
westernmost Sichuan Province; speakers number 
about 200,000, with varieties sDerong, bDechen, 
rGyalthang and Phyaphreng. 


5.1.2.1.1.2 West Bodish 


These groups are all in central Nepal. Members of 
some of these groups have served in the Gurkha 
regiments for nearly 200 years, and quite a few have 
moved away from traditional areas and now speak 
only Nepali. Within most of the groups, there is a 
cultural and religious continuum from north (with 
very strong Tibetan Buddhist influence) to south 
(with Hinduization). Some scholars prefer to call 
this the Gurung group, the Tamang group, the 
Gurung—Tamang—Thakali group or the TGTK group, 
an acronym of Tamang, Gurung and Thakali, after 
one or more of the main languages included. 

Ghale, Kutang Ghale and Kaike show more 
Tibetan influence than the rest of this group. Some 
people classified as Gurung actually speak Ghale, and 
some classified as Ghale speak Gurung; the map shows 
the linguistic situation. Kutang Ghale is spoken to the 
north of Ghale by a small population who are classi- 
fied as Bhotia (Tibetan). Kaike is spoken by people 
classified as Magar in three villages; it is also some- 
times known as Tarali Kham, though it is quite dif- 
ferent from Kham, a Himalayan language of western 
Nepal whose speakers are also classified as Magar. 

Gurung, also now known by its autonym of 
Tamu, has two major dialects, west and east, which 
are so different as to lead to lack of mutual intelligi- 
bility; and subdialects within both. 

In the eastern part of the Gurung range, there are 
small numbers of Dura speakers; the language is 
being replaced by Gurung and Nepali, but is still 
spoken by some older people. Little linguistic infor- 
mation is available, but it appears to be distinct from 
Gurung. 

The relatively small Thakali-speaking group is 
also known as Thaksya, or by the autonym Tapang or 
Tapaang. Usually included in the Thakali are the 
closely related Panchgaon (‘five village’) people to the 
north, including Punel (Marpha village), Syangtani 
(Syang village) and Chhimtani. The Thakali are 
vigorous and successful traders throughout Nepal. A 
recent grammar of Marpha Thakali is Georg (1996). 

Chantel, also known as Chantyal, is inundated 
with Nepali loanwords and endangered. It is spoken 
to the west of Thakali mainly in one village, with 
some other villages where ethnic Chantel speak only 
Nepali. One hypothesis is that it is a West Bodish 
lingua franca which developed in an area of mines, as 
it is isolated from the other West Bodish languages. 
For a brief description, see Noonan (2003). 

Manang is spoken by a strongly Buddhist group 
in the Manang valley. This language, also known as 
Nyishang or Nyeshang, shows Tibetan influence and 
is in close contact with nearby varieties of Tibetan. 

Formerly known as Murmi, Ishang or Sain, 
Tamang is widely spoken in the hills around the 
Kathmandu valley; there are substantial dialect 
differences. Ethnic Tamangs are scattered elsewhere 
throughout Nepal and into India, but most of those 
away from the original area do not speak Tamang. For 
a brief description of Tamang, see Mazaudon (2003). 


5.1.2.1.1.3 East Bodish 


East Bodish includes a variety of languages of Bhutan: 
the three closely related Bumthang languages, 


164 


East and South-East Asia 


Bumthang, Khengkha and Kurtopkha of central 
Bhutan, as well as Olikha Monpa, Nyenkha and 
Chalikha, Dakpakha and Dzalakha. 

Tshangla is concentrated in south-eastern Bhutan 
and the western border of Arunachal Pradesh. In 
Bhutan the Tshangla are also known as Sharchop, 
‘eastern people’; in India they are called Monpa; and 
in Tibet they are the Menba. Tshangla is a lingua 
franca of eastern Bhutan. For a description, see And- 
vik (2003); for data on the varieties in China, see Sun 
et al (1980) and Lu (1986). 

In Tibet and India, two distinct groups are rec- 
ognized within the Menba nationality and Monpa 
tribe. The two are Cona Menba or Central Monpa, 
and Motuo Menba or Eastern Monpa. Most of the 
Cona Menba speakers live in eastern Bhutan, where 
they are called Dzalakha, and around Tawang in 
Kameng District of Arunachal Pradesh, where they 
are called Central Monpa, but about 700 speakers live 
nearby in Cona County and elsewhere in Tibet. The 
8,000 Motuo Menba speakers in Tibet live mainly in 
Motuo County, but also in Linzhi and Milin counties 
in an area also known as Médog from its Tibetan 
name; there are also speakers nearby in north central 
Arunachal Pradesh in India, where they are known as 
Eastern Monpa. They are said to have migrated from 
Tawang several hundred years ago. 

The Lhokpu of south-western Bhutan and 
Gongduk of south central Bhutan have languages 
which are more distinct from the other East Bodish 
languages, which has led van Driem (2001) to classify 
them as further branches of Western TB under the 
term East Bodic. Both groups are extremely small. 
Another group which may belong here or in the West 
Arunachal Bugun-Khowa group is Sherdukpen of 
north-western Arunachal Pradesh in India and 
related groups there. 


5.1.2.1.2 Himalayan 


These languages form a group within TB. They com- 
prise the non-Bodish TB languages of Nepal and 
north-western India. In general Bodish groups inhabit 
the northernmost parts of north-western India and 
about the northern quarter of most of Nepal; Indic 
groups inhabit the lower areas of north-western India 
and the southern quarter of Nepal, the plains of the 
Terai. The Himalayan TB groups are between these 
Bodish and Indic groups. In addition, there has been 
extensive migration which has spread Nepali, the 
Indic national language, northward and eastward 
throughout Nepal and into north-eastern India; and, 
to a lesser but still substantial extent, there has been 
eastward migration of Himalayan TB groups who 
retain their ethnic identity outside their traditional 
areas but mostly speak only Nepali. 

The division in Grierson (1903-9) into pro- 
nominalised and non-pronominalised Himalayan 
languages (as originally proposed by Hodgson and 
then Konow) has been shown by Bauman (1975) 
and Caughley (1982) to be based on secondary and 
independent morphological developments. Shafer 
(1974) divides these languages into West Himalayish, 
West Central Himalayish and East Himalayish 
(Kiranti) sections of Bodic, with Newari less closely 
and clearly linked. Benedict (1972) prefers to connect 
the West Himalayish languages more directly to 
Tibetan in his Tibetan-Kanauri, with Magar inter- 
mediate between this and his Bahing-Vayu, which 
comprises the rest of the languages here. Glover 
(1974) does not consider the West Himalayish or 
Kanauri languages, but the rest he divides on lexical 
grounds into East Himalayish (Limbu plus the Rai 
languages) and Central Himalayish, which includes 
the rest as well as Tibetan and its outliers. This last 
classification accords with the traditional classifica- 
tion in Nepal, which groups the Rai or Kiranti TB 
languages of eastern Nepal as opposed to the others. 

The classification used here links (Shafer’s) West 
Himalayish group and (Benedict’s) Central Hima- 
layan or Mahakiranti group. The latter includes vari- 
ous TB languages of central Nepal and the Kiranti or 
Rai languages of eastern Nepal. Languages not 


included here, contra Glover, are the West Bodish lan- 
guages of western central Nepal — Gurung, Tamang, 
Thakali, and so on — nor are the numerous Tibetan 
varieties along the northern borders included here; 
the best known of these is Sherpa, in the north-east, 
but virtually every northern valley in Nepal has one. 
Also not included are the numerous post-1959 
Tibetan refugees. 


5.1.2.1.2.1 West Himalayish 


This subgroup falls into three parts. One is the 
languages of Lahul in western Himachal Pradesh, 
another is the languages of Kinnaur in north central 
Himachal Pradesh, and the third is the Rangpa lan- 
guages of north-eastern Almora in Uttaranchal, also 
extending slightly into north-western Nepal. Some 
scholars have suggested that the extinct Zhangzhung 
language, formerly spoken in western Tibet to the 
north of these languages, was also a member of this 
subgroup; it survives in a curious half-life as the litur- 
gical language of the Tibetan Bon religion, but with 
very heavy Tibetan influence. 

The Pattani language was formerly called Man- 
chati. It is spoken along the Chandra (upper Chinab) 
River by a Hindu group, and shows extensive lexical 
influence from Indic. The Chamba Lahuli variety of 
Pattani is spoken to the west, along the Chinab River 
at the eastern edge of Chamba District. 

Tinan and its Ranglo variety are spoken by small 
groups south of the Chandra (upper Chinab) River 
just to the south-east of its junction with the Bhaga 
River. Tinan is spoken just east of Pattani, with 
the Ranglo (‘near the pass’) variety further east. 
Gond(h)la is an alternative name for Tinan. 

The Bunan language (also known as Gari, Gahar 
or Gahri) is spoken further along the Bhaga River in 
western Lahul by a small group; it shows more 
Tibetan influence than Pattani. The term Lahuli is 
sometimes used to refer generally to the non-Tibetan 
West Himalayish languages of Lahul, including 
Pattani, Tinan and Ranglo as well as Bunan, and 
sometimes also for Tibetan varieties spoken there, but 
is not used in this way among speakers. 

There are two Central West Himalayish or 
Kanauri languages. Kanauri, also known as Kinnauri 
(and formerly as Thebor), is the non-Tibetan lan- 
guage of the Kinnaur area. The divergent Chitkhuli 
variety of Kanauri is spoken in two villages. Kanashi 
is spoken only by the Hindu inhabitants of Malana 
village, isolated from the rest of West Himalayish. 

There are four Rangpa languages in northern 
Almora, much further east in Uttaranchal. The 
westernmost is Rangpa itself, described in Zoller 
(1983). The Rangpa of north central Garhwal are also 
known as Marchha. The Darmiya are a small group 
in far north-eastern Almora; to their east are the 
Chaudangsi, and further to the north-east are the 
Byangsi. The latter two languages, especially Byangsi, 
are also spoken in north-eastern Nepal; some sources 
suggest that Chaudangsi and Byangsi are dialects of 
one language. 


5.1.2.1.2.2 Central Himalayish 


It now appears that most of the TB languages of 
Nepal — other than the Tibetan and West Bodish 
languages, and Dhimal — form a group known as 
Mahakiranti. This includes the Rai or Kiranti group, 
various TB languages of central Nepal whose classifi- 
cation was formerly unclear (Newari, Magar, Kham, 
Raute/Raji, Chepang) and two formerly included in 
West Himalayish (Baram, Thangmi). 

Newari is the traditional language of the 
Kathmandu valley, where it was the vehicle of a 
high civilization. The earliest dated inscription is 
from ap 1112; the traditional script is distinct from 
devanagari, but more recently devanagari has also 
been used to write Newari. Newari has lost its high 
status since the Gurkha (Nepali-speaking) conquest 
of Nepal, but is still very widely used in the 
Kathmandu valley and in the low hills to the east; it 
is also called Nepala bhasa (‘language of the Nepal 
(i.e. Kathmandu) valley’). Nearly all of its speakers 
are bilingual in Nepali, and many ethnic Newars, 


especially those further east in Nepal and India, do 
not speak Newari at all. There are various divergent 
varieties of Newari spoken in various towns further 
east in Nepal along the old trading route to Tibet, 
such as Dolakha, nearly 150 km (95 miles) north- 
eastward. For a description of Kathmandu Newari, 
see Hargreaves (2003); for a description of Dolakha 
Newari, see Genetti (2003). 

The Magar are a numerous and widely scattered 
group, found throughout the lower hills in western 
central Nepal; Magar or Mangar was formerly the 
name of a major kingdom in this area. Many Magar 
no longer speak the language, and two groups classi- 
fied as Magar speak instead other TB languages, 
Central Himalayan Kham and West Bodish Kaike. 
Magar has various dialects, which can be divided 
overall into western Syangja and eastern Tanahu. 

Raute and Raji are two names for one very small 
nomadic group of western central Nepal. Linguistic 
data are limited but suggest that the language is close 
to Magar. 

The Kham group to the north of the main Magar 
area are classified as ethnic Magar, but speak the quite 
distinct language Kham and call themselves Buhda. 
There is an excellent description in Watters (2002). 

The Baram or Bhramu language is still spoken 
by a few people to the north-west of Kathmandu. 
The Thami or Thangmi language is spoken to the 
north-east of Kathmandu. Baram and Tangmi were 
formerly classified as West Himalayish languages, and 
are in the process of being described. 

The Chepang group are to the south of the 
Kathmandu valley, and use the autonym Cho'bang, 
whence the Nepali name Chepang. There is also a 
small number of Bujheli (autonym Gharti) to the 
north-west of Chepang, and a few hundred nomadic 
Bankariya to the east. Like many Mahakiranti lan- 
guages near Kathmandu, Chepang is endangered, and 
Bujheli is more so. For a description, see Caughley 
(1982). 

The Kiranti or Rai subgroup of Mahakiranti 
includes all the TB languages of eastern Nepal other 
than Tibetan groups in the north and Dhimalish in 
the south-east. Apart from Limbu and Yakkha in the 
east and sometimes Sunwar in the west, all these 
groups are also included in the ethnic category Rai, 
also known as Kiranti (or Kirat) from the former 
kingdom of this area. Many Rai do not speak their 
traditional languages, especially those outside eastern 
Nepal, and quite a few Kiranti languages are nearly 
extinct, being replaced by Nepali or by other Kiranti 
languages. For example, Bantawa is replacing some of 
the smaller adjacent languages to its east. A detailed 
survey was undertaken some years ago by the Lin- 
guistic Survey of Nepal, and results (Hansson 1991) 
are available in part. Kiranti can be classified into 
eight subgroups: 1) Hayu; 2) Sunwar, Surel, Bahing, 
Chaurasia and Jerung; 3) Thulung; 4) a group with 
three subgroups, a) Khaling, b) Dumi, Kohi, Kulung, 
Sotang, Nachering, Parali and Chukwa; and c) Sang- 
pang, Bantawa, Dungmali, Khesang, Chamling and 
Puma; 5) Athpare, Belhare, Chintang, Chulung, 
Yakkha and Lumba; 6) Mewahang, Lohorong, 
Yamphu and Yamphe; 7) Tilung and 8) all varieties of 
Limbu. 

The Hayu or Vayu language is virtually extinct, 
with only a small number of older speakers; it is well 
described by Michailovsky (1988). 

Sunwar (Nepali names Sunuwar or Bahrathar, 
‘twelve clans’), preferred new autonym Koints, 
is often not regarded as a Kiranti language, but 
linguistically it belongs here; Surel is a variety of 
Sunwar. Bahing is also known as Rumdali or Rumdali 
Rai; the language is closely related to Sunwar and 
Chaurasia. Chaurasia is also known as Umbale, 
Ombule or Wambule; the language is extremely 
closely related to Jerung; for a description, see 
Opgenoort (2002). The Jerung or Jero language is 
known to its speakers as Jeromala. 

Thulung forms a separate subgroup within the 
Kiranti languages. 

Khaling also constitutes an independent sub- 
group within Kiranti. Some scholars suggest that it is 
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mutually intelligible with Dumi, but this may be due 
to contact. 

The Dumi language is spoken fluently mainly by 
older people. There are far fewer speakers than ethnic 
Dumi. The Kohi or Koi group speaks a language very 
similar to Dumi; the two are closest to Kulung and 
Nachering. Kulung is also known as Kulunge. Sotang, 
also known as Sotange and centred around the village 
of Sotang, is sometimes said to be a dialect of Kulung. 

Nachering is the language of a rather small 
group; another group, Dimali, is sometimes separated 
from it. Parali is almost undescribed and has very few 
speakers; it is similar to Nachering. 

Chukwa is another independent subgroup of 
Kiranti, similar to Kulung and Nachering but not part 
of either. 

One dialect of Sangpang (or Sangpahang) spoken 
at Kudak is actually closer to Kulung and Nachering, 
but other dialects are close to but distinct from 
Bantawa. 

The large and dialectally diverse Bantawa group 
is widely scattered outside its traditional area, and 
was also used as a lingua franca at an earlier period. 
Various subvarieties exist, including Pangduwali, 
Amchoke, Arthare, Dilpali, Wahitpang and probably 
others. Lambichong, also known as Mugali from its 
location in Muga village, is a very small group whose 
language is similar to Yakkha. They are classified as 
Bantawa, and their language is being replaced by 
Bantawa and Nepali. 

The Dungmali and Khesang languages form a 
subgroup of Kiranti fairly close to Bantawa; on the 
map they are shown together. 

Chamling, also known as Rodong, has numerous 
speakers and is widely scattered. The Puma are a 
small group whose language is similar to Chamling. 
Of the other Kiranti subgroups, Chamling and Puma 
are closest to Bantawa. Chamling has been described 
by Ebert (2003). 

The Athpare language is in the Athpare-Yakkha 
subgroup of Kiranti along with Lumba, Lambichong 
and Chulung. Not surprisingly, given its location, it 
shows some affinities with Chattare Limbu as well. 
Ebert (1997) provides a description. 

Belhare is the language of Belhariya village in the 
Athpare area; some scholars treat it as a variety of 
Athpare to which it is very similar. Bickel has 
provided excellent documentation of Belhare, for 
example Bickel (2003). 

Chintang is a small language in the process of 
being replaced by Bantawa; it is closest to Chulung. 
Both form part of the Athpare-Yakkha subgroup of 
Kiranti. The small Chulung group speaks a language 
close to Chintang. 

The Yakkha are a fairly large group not usually 
included within the ‘Rai’ category, and the language is 
closest to Kiranti languages such as Athpare. Lumba 
is another of the Athpare-Yakkha subgroup of 
Kiranti. 

Mewahang, also referred to as Newahang, is 
a large language with eastern and western dialects. 
Several other extinct Kiranti languages are very 
closely related to it; these include Saam, Sambya, 
Bungla and Pongyong. 

With two dialects, northern and southern, the 
Lohorong or Lorung language forms part of the 
Lohorong-Yamphe group. Yamphe is a small Kiranti 
group whose language is also known as Yakkhaba or 
sometimes included in Mewahang. 

The Yamphu language occupies an intermediate 
position in its subgroup between Lohorong, Yamphe 
and southern Lohorong. 

Tilung is a quite distinct language; it probably 
includes the so-called Dorungkecha Rai and 
Choskule Rai. 

The Limbu group is divided into four subgroups, 
of which the south-western Chathare (‘six clans’) 
Limbu speak a distinct language. The south-eastern 
Panchthare (‘five clans’) dialect extends into India 
(Darjeeling and Sikkim); central Phedappe and 
northern Tamarkhole are spoken only in eastern 
Nepal. The traditional ‘Kiranti’ (Limbu) Indic script 
is being brought back into use, both by Limbu literati 


in Nepal and through the education system of Sikkim 
in India. Limbu is usually not included in the 
category Rai in Nepal, but it is a Kiranti language. Van 
Driem (1987) provides a description. 


5.1.2.2 Sal 


This subgroup was named by Burling (1983) from 
the distinctive etymon for ‘sun’ found in most such 
languages; it is characterized by numerous other 
innovative etyma, such as *wal ‘fire’, *s-rang ‘sky’ and 
*nu ‘mother’. It was first proposed as a subgroup in 
Benedict (1976), and comprises: 


1 Shafer’s Baric group (Benedict’s Bodo-Garo- 
Northern Naga), which covers the plains of 
north-eastern India as well as a large area of the 
hills to the east; 

2 Jinghpaw (Kachin), the central nucleus of TB 
according to Benedict (1972), a part of Burmic 
according to Shafer (1974); 

3 Luish (for Shafer, another part of Burmic) or 
the Sak group; and 

4 Kuki-Chin, which is the most distinct of the 
four subgroups. 


Burling (2003) suggests a particularly close connec- 
tion between groups 2 and 3. The inclusion of Kuki- 
Chin in this subgroup is not universally accepted; it 
may be a further major subgroup of TB, while Shafer 
classified it as part of his Burmic. All these languages 
are SOV, with a certain amount of prefix and suffix 
morphology. The Sak group is scattered and mori- 
bund but formerly covered a much wider area; for 
one view see Luce (1985). Jinghpaw is the core group 
in the Kachin cultural system, which also includes 
several Burmish groups and a few other groups which 
fit elsewhere linguistically. Baric includes Boro (Bodo, 
Bara, Bara or ‘plains Kachari’), formerly the main 
language of the upper Brahmaputra valley in north- 
eastern India, with very closely related languages such 
as Dimasa (‘hills Kachari’), Kokborok (Tripuri), 
Lalung and so on covering the plains and low hill 
areas to the south, and fairly closely related Garo in 
the hills to the south-west. The ‘Koch’ (Western Bodo- 
Garo) languages Atong, Rabha, Wanang and so on are 
genetically close to Garo and in the plains to their 
west, with the still rather closely related Northern 
Naga languages of northern Nagaland, south-eastern 
Arunachal Pradesh, and adjacent areas of Burma to 
the east. Jinghpaw is spoken in a large area immedi- 
ately to the east of the latter, with the Sak group scat- 
tered (in an area now mainly inhabited by speakers 
of other TB languages) to the south. Léffler (1966a) 
has suggested that Chakma (spoken in Bangladesh 
and India) and its dialect Tongcengnya (Daingna, 
Daingnet, Doingnak, Dengnak, Tunzunnya, spoken in 
Bangladesh and Burma) should be included with the 
Sak group, but, whatever the historical facts, lin- 
guistically this is now a divergent dialect of Bengali 
spoken by Buddhists. 


5.1.2.2.1 Bodo-Garo languages 


Burling (2003) prefers to call this group Bodo-Koch; 
Koch is a collective term for various western Bodo- 
Garo languages beyond Garo. He suggests that the 
Northern Naga or Konyak languages are close to but 
distinct from Bodo-Koch within Sal. 

It has been claimed (All Bodo Students’ Union 
1987) that the large and widely distributed Bodo 
group has over four million members, though many 
of these now speak mainly or exclusively Assamese. 
Most live north of the Brahmaputra in Assam, with 
smaller numbers south of the river, in adjacent states 
and into Bhutan, Nepal and Bangladesh. In some 
areas, such as Nepal, they are known as Mech or 
Meche; this is now viewed as a pejorative name in 
India. Another general term is Kachari; the Plains 
Kachari are Bodo. The Bodo language has a Roman 
orthography and a devanagari orthography and is 
used as a medium in some primary and secondary 
schools. 

Lalung or Tiwa is the language of a substantial 


group, but not all group members can speak the lan- 
guage; the group now prefer their autonym, Tiwa, to 
the exonym Lalung. Intermingled with Bodo to its 
north and Garo to its south-west, the language is very 
close to Bodo; though again much of the group now 
speaks mainly or exclusively Assamese. 

Centred in the North Cachar Hills, the language 
of the large Dimasa group is also known as Hills 
Kachari; it is also scattered in adjacent areas. There is 
a new Assamese-based orthography and an older but 
little-used Roman orthography, and some dialect 
diversification. 

The Hojai language of the Nowgong area in 
central Assam is spoken among a small group; only 
very limited linguistic data are available. 

Better known as Tripuri, the language of the state 
of Tripura in north-eastern India, the Kokborok lan- 
guage has a Roman orthography as well as an earlier 
Indic orthography. The current name Kokborok 
means ‘people’s language’; the word for ‘people’ is of 
course related to the name of the Bodo. The main 
dialects include the standard Debbarma (western), 
also Riang (Reang, south-eastern), Halam, Noatia or 
Tipra (eastern), plus various smaller dialects: Jamatia 
and Darlong (north-eastern), Aiang, Dahula, Kar- 
pong, Koloi, Laitong, Muslung and Rupini. In the 
Chittagong Hill Tracts of Bangladesh, the Kokborok 
are known as Mrung from the Arakanese name for 
this group. 

Garo, the language of the Garo Hills in western 
Meghalaya, has a very large number of speakers; 
about 10 per cent live in Bangladesh. The standard 
dialect, Achik, covers two-thirds of the area in the 
east, and the other main dialect, Abeng, is in the west. 
Between the two in the south is the Matabeng or 
Matjanchi dialect. Within Achik there are Gara and 
Ganching; Matchi and Dual; Chisak; and Awe or 
Akawe subdialects from south to north; the standard 
is based on the Awe subdialect of the Achik dialect. 
Some Koch languages (Atong, Wanang, Hajong) are 
also officially (but incorrectly) regarded as dialects of 
Garo. There is a well-established Roman orthography 
which has replaced the former Indic orthography. For 
a recent grammar, see Burling (2004). 

Spoken to the south-east of Garo and included 
in Garo census figures, Atong is a Koch language 
spoken along the Simsang River in India and Bangla- 
desh. Another such language is Wanang, spoken by a 
small group at the western edge of the Garo area and 
an even smaller group in Jalpaiguri District of West 
Bengal; Burling (1961) prefers to classify Wanang as a 
dialect of Rabha. The major Koch group is the Rabha, 
immediately to the north-west of the Garo in the 
Brahmaputra valley and with a number of dialects, 
including Tintikia and possibly Wanang. Most ethnic 
Rabha now speak only Assamese, but about a quarter 
of them still use the language, especially those in 
Rangdani and Maitori. A Roman orthography exists. 

Hajong is another small Koch language spoken 
to the north-west of Garo and scattered elsewhere; it 
is severely endangered by Assamese, with little 
remaining Bodo-Garo lexicon. The final Koch lan- 
guage, Ruga, is spoken in Rugapura and Dimapura 
villages of the Dalu area in India, to the south of 
Garo; it is moribund. Most Koch languages are 
poorly known and have small and declining speaker 
numbers. 

Also known as Chutiya, Deori Chutiya or Deuri, 
the Deori group is scattered along the Brahmaputra; 
for a full description see Jacquesson (2005), who indi- 
cates that there are 10,000 to 15,000 speakers. Accord- 
ing to Benedict and Burling, this is the most divergent 
of the core Bodo-Garo languages. 


5.1.2.2.1.1 Northern Naga 


This subgroup is Shafer’s Nagish portion of Baric, 
where Benedict and more recently Burling also place 
it. Burling (2003) prefers the term Konyak Group 
from the name of one large language which is 
included. Its subgrouping has been the subject of 
French (1983); Marrison (1967) independently 
separates it from the other Naga languages as his 
Naga A. The names of ‘Naga’ groups are notoriously 
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confusing and confused: Assamese or other names of 
villages, rivers or towns where contact occurred, clan 
names for the very numerous subgroups of each 
group, names used by other ‘Naga’ groups to refer to 
a group, autonyms and descriptive names are all used. 

Khyamnyungan is spoken by the people known 
in the anthropological literature as Kalyokengnyu 
from their stone-roofed houses; this large group is the 
southernmost Sal Northern Naga group. About a fifth 
of the group is in India but some 40,000 are in 
Burma, where some are known as Nok-aw or Nauk-o 
(a clan name), Ponyo (a village name), Para or Paya (a 
Burmese name of uncertain origin) and so on. In 
India the Sema call them Tukhemmi and the Chang 
call them Aoshedd. The language is being described 
by Alec Coupe. 

The Chang, a small group of northern Nagaland, 
has a Roman orthography. Linguistic data is being 
collected by Alec Coupe. The Ao and Konyak name 
for the group is Majung, Mojung or Manjung, and 
the Sangtam name is Machongrr. Phom is an adjacent 
small group with a Roman orthography but little 
linguistic data. Their former autonym was Ching- 
mengnu, and they appear as Assiringia (a village 
name) and Tamlu in the literature. 

Konyak is the largest Naga group in India, with a 
Roman orthography. It is spoken at the northern tip 
of Nagaland. The standard dialect is spoken at 
Wakching; in older sources Konyak is also referred to 
as Tableng, Mulung, Kongon or Angwangku. Various 
sources list over thirty current clan or village dialects; 
the Htangan of Burma may speak a dialect of Kon- 
yak. The Wancho are another substantial group, 
mainly at the southern tip of Arunachal Pradesh and 
extending into Burma. A Roman orthography exists. 
This group was formerly known as Banpara, Mutonia, 
Joboka or Jokoba, with subgroups Khulung-Muthun, 
Bor-Muthun and Horu-Muthun. It has two main 
subgroups: Changjan and Tangjan. French (1983) 
links it closely with Chang. 

Nocte is a group of about 40,000 speakers, about 
half in India and half in Burma. There is a Romanized 
orthography in India. Formerly known as Namsangia 
(a village), Borduria (another village) or Jaipuria 
(a town which many Nocte visit), it has six main 
dialect groups: Hawajap, Japejap, Kapajap, Lazujap, 
Photungjap and Tangjap. Another group very closely 
related to Nocte has two names: Tangsa in India and 
Rangpan in Burma. They are somewhat more 
numerous in Burma than in India. Again, a multitude 
of clan and other names appear in the literature; 
Moshang (Mawshang) and Shangge are two in 
Burma. There is a Romanized orthography for Tangsa 
used in India. 

Haimi and Htangan are groups of about 20,000 
each in Burma, with many named clan dialects. 
Marrison (1967) suggests that Htangan is very similar 
to Konyak, which is spoken just to its east, and may 
be a Konyak dialect. The speakers of Nocte, Rangpan, 
Haimi, Htangan and Wancho in Burma have agreed 
to a shared Roman orthography based on the 
Moshang clan dialect of Rangpan. This orthography 
is a considerable improvement on all those used in 
India for these languages, as it indicates the tones. 


5.1.2.2.2 Jinghpaw 


Known in India as Singpho, in China as part of the 
Jingpo nationality, and in Burmese as Kachin, the 
Jinghpaw group forms the core of the Kachin culture 
complex in northern Burma, with minor extensions 
into China and India. The official 1983 population of 
‘Kachin’ in Burma was 465,484; this includes a large 
number of Burmish-language speakers — Maru, Atsi, 
Lashi and Ngochang — but is certainly an underenu- 
meration for the wider ‘Kachin’ group. In China there 
were 132,143 members of the Jingpo nationality in 
2000, but only a bit over 20,000 speak Jinghpaw as 
their first language. In India the Singpho total is 
about 2,000, plus some Turung who now speak a 
quite divergent variety of Jinghpaw. A conservative 
current estimate of the ‘Kachin’ population is 0.9 
million, of whom over 600,000 speak Jinghpaw as a 


first language, with the rest speaking it very fluently 
or bilingually as a second language and using it as the 
medium of literacy; less fluent second-language 
speakers include many Rawang, Shan, Lisu and 
Khamti speakers in Burma. 

The Jinghpaw Roman orthography is very widely 
used, but unfortunately does not indicate tones. In a 
very minor revision, the Jinghpaw orthography used 
in China has replaced the digraph aw with the single 
vowel o. Separate Roman orthographies are now being 
developed for some Burmish ‘Kachin’ languages, but 
at present they all use Jinghpaw as a lingua franca and 
a medium of literacy. Within Jinghpaw there are 
some divergent dialects, most notably the Gauri 
(Kauri, Hkauri) dialect of the area just north of 
Bhamo in Burma. Often clan names are incorrectly 
cited as if they were dialect names. Jinghpaw is rela- 
tively well-described; for a brief summary, see Dai 
and Diehl (2003). 


5.1.2.2.3 Sak or Luish group 


The term Luish comes from Meithei Iui ‘slave’ and 
refers to the status of the Sak groups in Manipur. The 
best-maintained language of the Sak group is Sak 
itself, often known from the Burmese form of its 
name, Thet, or Arakanese That. Their autonym is 
[atsa']; they should not be confused with the Bengali- 
speaking, but Buddhist, Chakma. The best source is 
Bernot (1967), but see also Luce (1985). They total 
about 7,000 speakers, with several thousand each in 
Burma and Bangladesh. 

The Kadu appear in the history of Burma as the 
former dominant group of the Tagaung kingdom in 
Upper Burma, under the name Kantu; they should of 
course not be confused with the near-eponymous MK 
group in Vietnam. Their language is spoken by under 
a thousand older people in 16 villages, with about 
20,000 ethnic Kadu in Pinlebu and Banmauk areas of 
Katha District, Sagaing Division; the group’s auto- 
nym is [asa']. Closely related to Kadu and located to 
its immediate west, Ganan has some 7,000 ethnic 
group members but at most 500 speakers in about 
twenty villages. Both Kadu and Ganan are moribund 
and poorly described. 

Reported in one village north of Homalin in 
Burma, Taman is west of Kadu and Ganan. It has 
fewer than a thousand group members and is most 
probably extinct. Malin was very close to Taman, but 
is reported to have been extinct for about eighty 
years. There are three very small groups in adjacent 
areas of Manipur in India who formerly spoke other 
closely related languages long ago replaced by 
Meithei, and known in the literature as Andro or 
Undro; Sengmai; and Chairel (the last is a village 
name), and more recently reported under the names 
Phayeng, Sekmai and Chakpa. 

A much more ancient extinct Sak language may 
have been the language of the Pyu, who ruled the 
plains of central Burma up to ap 960 and left inscrip- 
tions in a distinctive Indic orthography. There is one 
much later Pyu inscription, one of the four faces of 
the Myazedi inscription, dated ap 1112. This shows 
that knowledge of the Pyu language and its script 
persisted well into the Pagan period of Burmese 
dominance. 


5.1.2.2.4 Kuki-Chin 


This subgroup is relatively cohesive, both geographic- 
ally and linguistically, and has been extensively investi- 
gated by Shafer, who classifies it as part of Burmic. 
Benedict uses the earlier term Kuki-Chin for the 
subgroup, and likewise links it to Burmese-Ngwi. 
Kuki-Chin includes Marrison’s Naga B and Naga C 
groups, but not Naga A, which covers the Northern 
Naga or Konyak Group languages of the Sal group. 
Burling (2003) suggests a fairly major restructuring of 
Kuki-Chin into seven subgroups, which he calls 1) 
Ao Group (Marrison’s Naga B), 2) Angami-Pochuri 
Group (mostly Marrison’s Naga C1), 3) Zeme Group 
(Marrison’s Naga C2), 4) Tangkhul Group, 5) Karbi, 
6) Meithei and 7) Kuki-Chin proper. All scholars 


agree that Mru, which is in the southern Chin area 
and has sometimes been classified as Kuki-Chin, does 
not fit here; it is closer to Burmese-Ngwi (see Léffler 
1966b, Luce 1985). 

Names for these groups are much more 
numerous than distinct languages. Firstly, there are 
general names: in India those who live in Nagaland 
and northern Manipur are often called Naga, those 
who live in southern Manipur and points west are 
often called Kuki, while the largest group in Mizoram 
has renamed itself Mizo; it was formerly Lushai. In 
Burma all are within the overall category of Chin, 
which includes 54 recognized ethnic groups. For 
example, the Thado (or Thadou) are usually con- 
sidered as Kuki or more specifically Thadou Kuki in 
India, but form part of the Northern Chin group and 
are thus considered as Chin — or more specifically 
Thado Chin — in Burma. One group of eastern 
Manipur, the Anal, not wanting to be Anal Kuki, 
decided some years ago to classify themselves as Naga 
rather than Kuki. Secondly, there are more specific 
names for subgroups — in many cases, former names 
and names used by outsiders as opposed to autonyms. 
For example, Arleng was formerly known by the 
exonym Mikir, and is now officially known by an 
alternative autonym, Karbi. Thirdly, in India various 
southern Naga groups have recently been amalgamat- 
ing, and new combined names have been coined to 
refer to these larger groups. Firstly, Zeme, Mzieme 
and Liangmai combined into Zeliang. Later, the 
Nruanghmei joined in, and the new group is called 
Zeliangrong. However, some people prefer to remain 
under a previous category, so there are still some 
Zeme, some who prefer to be Zeliang and so on. 
For obvious reasons, the older cover term for the 
Zeliangrong groups, Kachha Naga (‘bad Naga’), and 
other such pejorative terms are no longer used. 
Fourthly, town or village names are sometimes used 
instead of other names: for example, Tiddim Chin 
instead of Kamhau Chin. 

Some of the languages included are more 
divergent; the foremost example is Arleng (Mikir, 
Karbi), which has long been in contact with Sal TB 
languages as well as non-TB languages. It has even 
been suggested that there may be a connection 
between Kuki-Chin and Lepcha, via Arleng (Bauman 
1976). Also somewhat divergent is Méeithei 
(Manipuri), which has long been the language of a 
Hindu civilization in the Manipur valley and thus 
shows more Indic influence. Most of the other lan- 
guages are spoken by hill groups, some extremely 
small. 

Linguistically these languages are characterized 
by tones (mostly not indicated in orthographies), 
extensive verb morphology involving tonal alterna- 
tions and extensive suffixing, and some prefixes as 
well. There is a widespread *ni ergative suffix on NPs 
(noun phrases); the basic word order is SOV. The 
following summary presents the Kuki-Chin languages 
from north to south. 

The Ao Group or Naga B includes Ao, Sangtam, 
Lhota and Yimchungrii. The Ao language, with 
Roman orthography based on the Chungli dialect, also 
has the very distinct Mongsen dialect to the east, 
which shows some contact effects with Northern Naga 
Phom and Chang to its east. There are also tran- 
sitional dialects even closer to Northern Naga, such as 
the Ao speech of Yacham and Tangsa villages. The 
Sangtam language, formerly also known as Thukumi, 
is found in three main locations, with some dialect 
differences; the ‘standard’ basis for the Roman ortho- 
graphy is the northern Lophomi dialect. For Lhota, 
Wokha is the ‘standard’ dialect and basis for the 
Roman orthography. The Yimchungrii language, 
formerly also known as Yachumi, is spoken in two 
separate areas in India, and extends well into Burma; 
the Roman orthography represents the Yachumi 
dialect. Coupe (2003) is a detailed phonological study 
of Ao, and Coupe (forthcoming) is a grammar. 

The Angami-Pochuri Group or Naga C1 includes 
North Rengma, Meluri or Pochuri, Rengma, Sema or 
Simi, and a more closely linked subgroup of Angami, 
Chokri, Kheza and Mao (Burling 2003). Rengma, 
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perhaps formerly more widespread, is spoken in 
various locations. Though regarded as one group, it 
probably includes three languages: Rengma in the 
west, for which the Tseminyu dialect is the basis for 
the orthography; Ntenyi in the north; and Meluri 
(with the autonym Anyo and also known as Pochuri) 
in the east. Ntenyi and Meluri are genetically closer to 
Sema than to (western) Rengma. The Sema, also 
known as Simi, are a large group, with a Roman 
orthography. Angami was an ethnonym that formerly 
included three languages: western Angami, eastern 
Chokri and southern Kezha. The latter two are now 
separated from Angami, and have amalgamated with 
part of the Sangtam living near the Chokri under the 
new name Chakhesang. The Angami orthography is 
based on the Tengima dialect. The Mao language 
is quite close to Angami, Chokri and Kezha, and its 
speakers were also formerly classified as part of the 
Angami ethnic group. The town of Mao (also known 
as Sopvoma) is the largest village of this group and 
the source of the name. 

The Zeme Group or Naga C2 includes Nruangh- 
mei (formerly Kabui), Puiron (sometimes viewed as a 
dialect of Nruanghmei), Khoirao, Zeme (formerly 
Empeo), Mzieme, Liangmei (formerly Kwoireng or 
Liang) and Maram (Burling 2003). Initially, the 
Zeme, Mzieme and Liangmai amalgamated into 
the new Zeliang group; subsequently, Nruanghmei 
was added, and the expanded new group called 
Zeliangrong. A former cover term for these three 
groups, Kachha Naga ‘bad Naga’, is no longer used. 
Separate Romanized orthographies exist for Zeme, 
Mzieme, Liangmai and Nruanghmei. Maram is 
named from its main village; linguistically, it is 
similar to the languages included in Zeliangrong. 

The Tangkhul Group includes Tangkhul and 
Maring. Tangkhul is also known as Luhupa (‘sav- 
ages’) in Meithei; speakers are equally divided 
between Burma and India, with about 50,000 in each. 
The standard dialect, written in a Romanization, is 
based on the speech of Ukhrul town. The small 
Maring group lives to the south of the Tangkhul; until 
the Kuki Anal declared themselves Naga, Maring was 
the southernmost language called Naga in India. 

Arleng is a very widespread language of the 
Karbi Hills and nearby areas of north-eastern 
India with various dialects: Amri (western), Bhoi 
(south-western, with Khasi influence), Rengkhang 
(southern) and so on. Many ethnic Arleng are only 
Assamese speakers. Formerly known as Mikir and 
now known officially as Karbi, which is another 
autonym, this group’s main autonym is Arleng; it is 
described in Grtissner (1978). 

Also known as Manipuri from the place name 
Manipur, the Meithei language has an old traditional 
Indic orthography known as Mayek, but has been 
written with Bengali script since ap 1729. There are 
claims of extreme antiquity for the original script, 
which is now being revived, but the oldest dated 
inscriptions are only about 900 years old (Chelliah 
and Ray 2002). In August 1992, Meithei under 
the name Manipuri was officially recognized as a 
Schedule VIII language of the Constitution of India; 
it is the only TB language with this status. It is spoken 
mainly in the Manipur valley but also by small 
numbers in Burma and other parts of north-eastern 
India; many of the astrologers in Burma were 
formerly Meithei. For a brief description, see Chelliah 
(2003). So-called Bishnupriya Manipuri is a mixed 
language spoken by former Bengali immigrants, with 
substantial Meithei lexicon but basically Bengali 
structure and reduced morphology; the speakers of 
Bishnupriya Manipuri now live west of Manipur. 
Many non-Meithei Kuki-Chin and other people in the 
valley also speak Meithei; some of their languages 
(especially Sak group and some Kuki languages) are 
being replaced by it. 

Core Kuki-Chin includes Kuki, North Chin, 
Central Chin including Mizo, South Chin and Khami/ 
Khumi subgroups. Thang (2001) compares 21 Kuki- 
Chin languages of Burma and suggests that they fall 
into two groups on lexical and comparative phono- 
logical grounds: 1) North Chin plus Mara and 2) 


Central Chin plus a subgroup comprising South Chin 
and Khumi; his reconstruction does not consider the 
Kuki languages of India. 

The term Old Kuki was used in the Linguistic 
Survey of India to refer to various small Kuki groups 
around the Manipur valley and to the west. It 
includes some of the Kuki groups of Tripura and 
nearby parts of Bangladesh and Cachar, for example 
speakers of Rangkhol, Hallam and Langrong. It also 
includes the strongly Mizo-influenced group Hmar; 
the Anal group mostly in south-eastern Manipur but 
also in Burma; the Chawte (Chote, Chaw, Kyaw) 
group of eastern Manipur and into Burma; the 
nearby Mayol (Moyon, ‘Mon’) and Lamgang; the 
Kom of south central Manipur; and various other 
groups whose languages are nearly extinct, having 
been replaced by Meithei: Aimol, Kolhreng, Purum, 
Tarao and so on. Roman orthographies have been 
created for some Kuki languages, but are not widely 
used. Some groups classified as Kuki in Indian 
sources are actually North Chin, and so are discussed 
below: Chiru, Gangte, Pawi, Simte, Thadou and so 
on. Mara was formerly known as Lakher from the 
Mizo name for the group; it is also known as Miram 
from their Central Chin name. Most Mara speakers 
are in India, but there are a few in Burma. There is a 
Roman orthography and a grammar (Lorrain 1951), 
but the language is being replaced by Mizo as are 
many other Kuki languages in India. Thang (2001) 
also groups Lawtu or Lautu with Mara, though other 
sources suggest it is a North Chin language. 

The North Chin languages of Burma are spoken 
by about 150,000 people concentrated in the Tiddim 
area of northern Chin State and along the western 
edge of Sagaing Division; including those in India, 
there are perhaps 250,000 speakers in all. In Burma 
some of the named subgroups are Thado; Siyin, Paite 
and Vuite; Sokte; Kamhau (as spoken around Tiddim 
or Tedim town) and so on. There are Roman ortho- 
graphies in use for most of these. For a description of 
Kamhau Chin, see Henderson (1965). The North 
Chin group includes the largest ‘Kuki’ group in India, 
Thadou with over 50,000 speakers; some also live in 
Burma. Speakers of North Chin languages of other 
smaller groups classified as ‘Kuki’ in India, such as 
Chiru, Gangte, Pawi and Simte, are concentrated in 
southern Manipur and more sparsely scattered 
elsewhere. 

The Central Chin are sometimes collectively 
known as Laizo (‘central people’); this includes a very 
large number of dialects, some of which are so differ- 
ent as to be mutually unintelligible. For Burma, the 
speaker population is about 400,000, with more in 
India and a few in Bangladesh; in Mizoram all 
other Central Chin languages and even some Kuki 
languages are classified as dialects of Mizo. Central 
Chin includes the western varieties Bawm (formerly 
known as Banjogi; see Reichle 1981) and Paangkhua 
(formerly known as Pankho) in Bangladesh; north- 
ernmost Zahao or Laizo in Burma and extending into 
India; also Bualkhua, Tashon or Taisun, Ngawn, 
Zanniat, Zophei, Lailen, Senthang or Sentang, Tawr, 
Khualsim and some other named groups elsewhere in 
Burma. They inhabit the central area of the Chin 
State, and some varieties are also known by place 
names including Falam, Haka or Hakha and so on. 
Several Roman orthographies exist and are in wide- 
spread use. 

Mizo, formerly known as Lushai, is also a Central 
Chin language. It has more speakers than any other 
Chin language (mostly in India, some in Burma and a 
few in Bangladesh) and various dialects. It is also 
known as Hualngo or sometimes still as Lushai where 
it is spoken in Burma; see Chhangte (1988) for a brief 
description. Various smaller Kuki and Central Chin 
groups are assimilating into the Mizo, culturally 
and linguistically. In India, Mizo is widely used in 
its Roman orthography and has special status in 
Mizoram State where it is the dominant local lan- 
guage. Some Chin nationalists have taken to using the 
new term Zomi to refer to the North and Central Chin 
together with the Mizo. 

Again, the South Chin group has many different 


names. In Bangladesh there are some speakers under 
the name Khyang (Bernot and Bernot 1958), which is 
an archaic pronunciation of the Burmese word for 
Chin. In Burma there are roughly 300,000 group 
members, but some of those living in the plains now 
speak only Burmese. Northern South Chin subgroup 
names include Zolamnai (‘on the trail’) as well as 
Welaung and Matu (place names; the latter call them- 
selves Ngala). In the central part are M'kang or 
Kaang, Nitu (with a new autonym Daai, about 30,000 
speakers), Hngizung, Utpu (a large subgroup, called 
Chinbon in Burmese); and to the south the Kcho 
(Chinbok or Saingbaung Chin in Burmese, about 
30,000 speakers; also known by alternative autonyms 
Miiiin and Ng'men). South Chin in general is also 
known to its speakers as Ash6 Chin (with ashé mean- 
ing ‘person’, cognate with zo in Central Chin). 

The Khami/Khumi group includes several 
varieties, which fall into two subgroups: Khami versus 
Khumi. It is also suggested that both Khami and 
Khumi varieties spoken in Bangladesh and Burma are 
different, giving a maximum total of four languages. 
In Burma they total about 50,000, with fewer in Bang- 
ladesh; a Roman orthography exists. 


5.1.2.3 Central Tibeto-Burman 


The classification of these languages has not been 
finally determined, mainly because good data have 
until fairly recently been lacking. This is really a 
residual subgroup that includes most of the languages 
spoken along India’s north-eastern border, the north- 
ern tip of Burma, and the adjacent border area of 
north-western Yunnan Province, China. More or less 
recent intrusions by Tibetan from the north and 
Assamese from the south-west have influenced some 
of these languages more than others. 

There are seven groups here: 1) Lepcha, 2) 
Dhimalish, 3) Bugun-Khowa, 4) Adi-Mising-Nishi, 
5) Digarish ‘Mishmi’, 6) Mijuish ‘Mishmi’, and 
7) Rawang-Nungish. Benedict (1972) agrees with 
most of those, but calls 4 Mirish and classifies 7 as 
a link between Burmese-Ngwi and the rest of TB. 
Shafer (1974) agrees with 1 and 2, calls 3 Hrusish, 4 
Mishingish, has 5 and 6, and includes 7 in his Burmic. 
He is uncertain of the position of the other groups 
here within TB, tending to group them with Bodic or 
Burmic rather than Baric. It is clear from new data 
that Shafer was right: they do not form part of the Sal 
group, Burling’s (1983) name for Benedict’s (1976) 
extension of Shafer’s Baric. Sun (1993) has made 
a detailed reconstruction of the Adi-Mising-Nishi 
group, which he prefers to call Tani from part of the 
name of Apa Tani. He also demonstrates that a close 
relationship within this Central TB group cannot be 
proved on lexical grounds; the only subgroups he can 
tentatively link are 4 and 5, which are geographically 
adjacent. 

All these languages are verb-final; those spoken 
further east are tonal. The official Chinese nationality 
classification includes languages from groups 3, 4 and 
5 in the Luoba nationality, whose name is derived 
from Tibetan Hlopa, which refers to non-Buddhist 
groups of south-eastern Tibet. In China, languages of 
group 6 are unclassified for nationality, while lan- 
guages of group 7 are in the Dulong nationality and 
part of the Nu nationality; languages of groups 1 and 
2 are not spoken outside South Asia. 


5.1.2.3.1 Lepcha 


Lepcha or Rong was the traditional language of much 
of Sikkim and parts of Darjeeling District of West 
Bengal in India, and a small area of south-western 
Bhutan, but not many speakers remain; certainly 
fewer than 30,000, and perhaps as few as 4,000. It has 
a traditional Indic script; see Sprigg (1983) for details. 
In 1977, it was made one of the official languages of 
Sikkim; since then a substantial effort has been made 
to teach the script in schools. Plaisier (2003) provides 
documentation of the various Lepcha manuscripts 
in European libraries, and Plaisier (2006) is a solid 
description of the language in its current state. 


168 


East and South-East Asia 


The genetic position of Lepcha is widely debated; 
while clearly TB, it has been classed with the TB lan- 
guages of Nepal by the Linguistic Survey of India, 
with Ao and thus in Kuki-Chin by Shafer, and recog- 
nized as aberrant for its geographical location by 
Benedict. Bauman (1976) notes some similarities 
with the outlying (and geographically nearest) 
Kuki-Chin language, Arleng. Most recently, Bodman 
(1987) suggested a fairly close connection with Adi 
and Rawang/Nungish, and a more indirect one with 
Jinghpaw; he also casts doubt on the AA (Austro- 
Asiatic) connection suggested in Forrest (1962); how- 
ever, as noted above, Sun (1993) rejects Bodman’s 
suggestion. Lepcha is inundated with Tibetan loans, 
and is clearly not Western TB, Sal or Burmese-Ngwi. 


5.1.2.3.2 Dhimalish 


Dhimal and Toto form another Central TB subgroup. 
The two names refer to what may be the same lan- 
guage in different locations. These are spoken in the 
plains south of Lepcha and west of most other 
Central TB languages, but near the easternmost 
Kiranti languages. Dhimal is spoken by a large group 
in the Terai plains of south-eastern Nepal, and Toto is 
spoken by a much smaller group in Jalpaiguri District 
of West Bengal. Toba and Dhimal (1999) provide 
data. 


5.1.2.3.3. Bugun-Khowa 


There are four languages of north-west Arunachal 
Pradesh that form a group, being quite different from 
the other TB languages that surround them. They 
include Sulung, exonym for a small group who were 
formerly slaves of the dominant western Adi-Mising- 
Nishi groups around them. Their autonym is [poh 
yut], sometimes represented as Puroik, and most are 
now resettled in separate villages. Most speakers are 
in India, but there are about 2,000 speakers in Tibet, 
classified in the Luoba nationality. The group whose 
autonym is Bugun are better known as Khowa, and 
are under the cultural influence of the Monpa, but 
they claim a close relationship with the Sulung. 
Dhammai, better known as Miji and not to be 
confused with the Keman or Miju ‘Mishmi’ far to the 
east, is a small group whose language is similar to 
Hrusso, but only limited linguistic data are available. 
Hrusso (autonym) is better known by the exonym 
Aka or Angka; the language has two dialects. Apart 
from Sulung, all these languages are very poorly 


described. 


5.1.2.3.4 Adi-Mising-Nishi 


The Nishi group — including the Nishi, Bangni and 
Apa Tani languages — was formerly known collectively 
as Dafla, a name now out of use because it is viewed 
as pejorative. They are now more commonly referred 
to as Bangni (Bengni) or Bangru in the west, Apa 
Tani (Apa Tanang) around Ziro, Tagen or Tagin in 
the north-east, and elsewhere as Nishi, Nyishang or 
Nishang. Of the Nishi languages, only about 200 
speakers of Bangni live in Tibet. There, they are 
included in the Luoba nationality. 

Hill Miri is a small and scattered group in central 
Arunachal Pradesh; their language is closely related to 
Nishi and less closely to Miri or Mising. Speaker 
numbers are small, and census information includes 
them with the Miri or Mising. 

Adi, formerly known as Abor, is now divided into 
a large number of named subgroups, all speaking very 
similar dialects. It includes Gallong (autonym Galo) 
in the south-west, Bokar (Bogar) in the north-west, 
Pailibo, Bori and Ashing in the north central area, 
Tangam and Shimong (Simong) in the north-east, 
Minyong at the centre (with the Karko subgroup), 
Milang to their east, Pasi and Panggi at the south 
centre, and Padam in the south-east. Most of these 
groups have the status of separate tribes in India. 
Bokar extends into Tibet, where there are about 500 
speakers; they are classified in the Luoba nationality. 

Formerly known as Miri, the very large Mising 


group (sometimes also called Mishing) is scattered 
over the upper Brahmaputra valley and into the hills 
to the north. Many ethnic Mising in the plains speak 
only Assamese. A Romanization exists. The language, 
with many dialects, is particularly closely related to 
the Adi languages. 


5.1.2.3.5 Digarish Mishmi 


Idu is one of three languages whose speakers are 
included in the category ‘Mishmi’ in India; the Idu 
were formerly known as Chulikata (‘crop-haired’) 
Mishmi. For data, see Sun et al (1980) and Sun 
(1983a). Most speakers live in India, but there is one 
village of 60 people in south-eastern Tibet, where 
they are included in the Luoba nationality along with 
Sulung, Bangni and Bokar. 

The Taraon ‘Mishmi’ group, autonym Taruang, 
is also known as Tain, Taying or (in China) Darang 
Deng; they were formerly called Digaru Mishmi, 
which is the source of the subgroup name. There are 
700 in China and one village in far northern Burma 
where the language is moribund, being replaced by 
Rawang, but most are in India. In China they are 
unclassified for nationality, and in Burma they are 
simply ignored. For some data, see Sun et al (1980). 


5.1.2.3.6 Mijuish Mishmi 


The language of the Miju ‘Mishmi’, autonym Geman, 
is the third of the “Mishmi’ group. Other names based 
on their autonym include Kaman in India and 
Geman Deng in China. In China, they are unclassified 
for nationality. There are 200 Geman speakers in 
China, but the vast majority are in India. 

The recently located Zha or Zaiwa language is 
spoken by about 700 people in the villages of Songgu, 
Lading and Tama in south-eastern Tibet and by about 
300 people in and around Walong in north-eastern 
Arunachal Pradesh in India. In Tibet they are 
included in the Tibetan nationality, while in India 
they are classified in the Meyor or Zakhring tribes 
according to their clan names. Zha is the Tibetan 
term for them; the term Zaiwa should not be con- 
fused with the Burmish group Atsi or Tsaiwa, repre- 
sented as Zaiwa in pinyin. 


5.1.2.3.7  Rawang/Nungish 


The Jinghpaw name for this group is Nung, which is 
the source for the terms used by Benedict and Shafer, 
and should not be confused with the Central Thai 
Nung group of north-eastern Vietnam and adjacent 
areas of Guangxi, China. Former Chinese names 
for the TB Nung include Nuzi, Luzi and Jiuzi. The 
Rawang/Nungish group includes various subgroups 
with rather different languages and many subdialects. 
In Burma, where there are nearly 150,000 speakers, 
the term Rawang (which formerly referred only to the 
largest clan supergroup of four in Burma) is now 
used for these groups as a whole. In China, speakers 
are included in two nationalities, Dulong and Nu; the 
Nu nationality includes speakers of four different lan- 
guages: Dulong, Anung, Nusu and Zaozou. The Nusu 
group of southern Fugong County speak a distinct 
Negwi language, and the Zaozou of south-western 
Lanping County speak another; Dulong and Anung 
are Rawang/Nungish languages. 

In Burma, the Mvtwang clan dialect of the 
Mvtwang clan cluster of the Rawang supergroup has 
been chosen as the standard, and a Romanization 
implemented among Christians, with widespread 
literacy achieved. In China, Romanizations have been 
devised for Dulong and Anung but are not widely 
used. Among these groups there are various levels of 
self-classification: by clan, of which there are probably 
nearly two hundred; by clan cluster (as shown on 
the map); by supergroups of several clan clusters 
speaking a language; or more recently by the overall 
collective terms Rawang in Burma and Dulong or 
Nu in China. For some data on the history of this 
group, see Morse and Morse (1966); for a dialect 
study, see Morse (1989). 


The Dulong nationality in China (7,426 people), 
plus over 7,000 of the northernmost members of 
the Nu nationality, speak the language of the 
Dulong (Dulung, Trung) clan cluster, the northern- 
most Rawang/Nungish group. The total number of 
speakers is over 14,000; for descriptions, see Sun 
(1982) and LaPolla (2003). 

In Burma, the Zorwang clan cluster is also 
known as Jerwang or Tvluq; it forms part of the 
Gvnong supergroup, which is quite distinct from the 
Rvwang (Rawang) supergroup. A rough estimate is 
15,000 speakers. The clan cluster known as Dvru or 
Daru is also part of the Gvnong supergroup, and has 
approximately 35,000 speakers in Burma. 

The Anung clan cluster is divided, with 7,300 in 
Fugong County in China and about 10,000 across the 
mountains in Burma where they are also known as 
Kwinpang. In China, this language has been nearly 
completely replaced by Lisu, surviving only in one 
villiage among speakers over 40; but in Burma the 
language survives much better, and a Romanized 
orthography has been prepared for the 4,000 or so 
remaining speakers. In China they form part of the 
Nu nationality, along with some Dulong speakers and 
all Nusu and Zaozou speakers. 

The Rvwang (Rawang) supergroup of clan 
clusters includes the Dvngsar clan cluster, also known 
as Tangsarr, with about 15,000 speakers. It also 
includes the Mvtwang clan cluster, which is the larg- 
est, with about 50,000 speakers. The Mvtwang clan 
dialect forms the basis for standard Rawang ortho- 
graphy. The Dvmang clan cluster is a subgroup of 
Mvtwang. Many Rawang in Burma who have a differ- 
ent first dialect can also speak Mvtwang, as this has 
been used as the literary dialect. 

Lungmi, also known as Longmi, is the southern- 
most clan cluster, and has undergone considerable 
Jinghpaw influence. With nearly 30,000 speakers, it is 
rather different from both Rvwang and Gvnong 
supergroups. 


5.1.2.4 North-eastern Tibeto-Burman 


While virtually all scholars agree that all the languages 
here are TB, their exact position within TB is not 
generally agreed. One proposal by Sun (1983b) is that 
most of them (except Bai, Na, Baima and Tujia) form 
a subgroup, which he calls Qiangic. Thurgood sug- 
gests that at least some of these languages (Jiarong, 
Qiang) are part of his wider Rong group. The former 
classification in China was to link Bai, Naxi and Tujia 
with the Newi languages, with Pumi and Qiang more 
distantly related, and not to recognize various others 
as separate languages from Tibetan, since the speakers 
are classified as members of the Tibetan nationality; 
this is clearly incorrect. Like nearly all TB languages, 
the North-eastern TB languages are SOV. Unlike most 
other subgroups (and all adjacent subgroups), they 
have extensive verb morphology, which can be 
reconstructed for this subgroup. Nearly all are tonal. 
It is likely that the extinct Xixia or Tangut language 
also fits here; for further details of Xixia, see Gong 
(2003). 


5.1.2.4.1 Core North-eastern Tibeto-Burman 


An older Chinese name for this group was Xifan 
‘western barbarian’; a Tibetan name, which is also 
sometimes seen in the older literature, is Hér. All 
these languages fall into the Tibetan cultural orbit, 
and most groups are Tibetan Buddhists, speak some 
Tibetan and use literary Tibetan as their medium of 
literacy. Apart from the Qiang and the Pumi, who are 
recognized as separate nationalities in China, all are 
simply classified as Tibetan, despite their languages. 
Qiang (written with a different character) is also an 
ancient Chinese name for an ethnic group of the 
Gansu area, but it is unlikely that it has always had 
only its current referent. The languages of this group 
are quite diverse, but probably more closely related to 
each other than to anything else in TB. None has an 
orthography in widespread use, though Romaniza- 
tions are being developed for Qiang and for Pumi, 


169 


East and South-East Asia 


and a Tibetan-based script is also being developed for 
Pumi. 

The ethnic groups officially classified as Tibetan 
are the 14,000 Baima, 170,000 Jiarong, 35,000 Ergong, 
7,700 Zaba, 7,000 Guichong, 15,000 Ersu, 7,000 Choyo, 
15,000 Muya, 1,800 Shixing and 1,700 Namuyi. The 
40,000 Pumi in Sichuan are also classified as Tibetan, 
while those in Yunnan are classified as a separate 
nationality. There are over 300,000 ethnic Qiang, but 
only about half can speak the language. 

Baima is spoken by about 10,000 of the 14,000 
ethnic Baima, who live mainly in Jiuzhaigou and 
Pingwu counties of northern Sichuan and Wen 
County in southern Gansu. Some scholars regard 
this as a variety of Tibetan, but there is a small 
non-Tibetan TB element, most probably of the 
North-eastern TB group. A grammar by Katia 
Chirkova should appear soon. 

Qiang has a 2000 census population of 306,072, 
which has increased radically in the last twenty years 
through non speakers of full or partial Qiang eth- 
nicity reclassifying their nationality; the 1982 census 
figure was 102,678. There are probably about 150,000 
speakers of a very diverse range of varieties; the main 
division is into northern and southern. Northern 
Qiang, with about 70,000 speakers, is nontonal and 
has an amazing array of initial consonant clusters, 
while southern Qiang, with about 80,000 speakers, is 
tonal and has lost many of the initial clusters. There 
are various excellent studies of Qiang, including those 
of Sun (1981), Evans (2001) and LaPolla and Huang 
(2003). The former Maowen Qiang Autonomous 
County in north central Sichuan, now known as Mao 
County, has over 80 per cent of speakers among its 
Qiang population; Heishui and Songpan counties to 
the north also have a similar proportion of speakers 
among their smaller Qiang populations. Wenchuan 
and Li counties to the south have about 60 per cent 
of speakers among their Qiang populations, while 
almost none of the many ethnic Qiang in Beichuan 
County to the east can speak Qiang. 

Jiarong, written as rGyarung in Tibetan, is a 
complex chain of many dialects with a total of about 
170,000 speakers over a wide area of Aba Prefecture, 
including Ma'erkang, Jinchuan, Zamtong, Xiaojin, 
Hongyuan, Heishui, Li and Wenchuan counties, and 
also Danba and Dawu counties in north-east Ganzi 
Prefecture. These are all in north-western Sichuan. In 
the area where they overlap with Qiang in the north- 
east of this area, the Qiang speakers live at lower alti- 
tude, between 1,200 and 2,800 m (4,000 to 9,000 ft), 
while the Jiarong live between 2,500 and 3,400 m 
(8,000 to 11,000 ft). In the west of the Jiarong area, 
Tibetan speakers live above 3,200 m (10,500 ft). 
Jiarong is divided into several rather distinct dialects 
that could also be regarded as languages, including 
north-western or Sidaba, eastern or Sita and north- 
eastern or Chabao, as well as Lavrung in the southern 
part of this area. A number of scholars including 
Jackson Sun Tianxin and Guillaume Jacques are work- 
ing on Jiarong, and there are various excellent earlier 
studies including Nagano (1983) among others. 

About 35,000 Ergong speakers, also known as 
sTau in Tibetan or as Daofu in some Chinese sources, 
and not to be confused with the Tibetan-speaking 
rTahu, are in Dawu, Danba, Luhuo and Xinlong 
counties of Ganzi Prefecture and in Jinchuan, 
Ma'erkang and Zamtong counties of Aba Prefecture, 
to the south-west of the Jiarong area. Ergong is lin- 
guistically quite close to Jiarong among the languages 
of this group of TB. 

Some 7,700 Zaba speakers live in Zaba Town- 
ship of Dawu County and in Zamai Township of 
Yajiang County, in Ganzi Prefecture of western 
Sichuan. 7,000 Guichong speakers are found in 
Yutong District of Kangding County, Sichuan. The 
15,000 Ersu in Jiulong, Mianning, Puge, Yuexi, 
Ganluo and Muli counties of south-western Sichuan 
speak a very diverse range of varieties, some perhaps 
not mutually intelligible; their name means ‘white 
people’. They are also known as Tosu; there is an 
old vocabulary under this name, using Tibetan 
and Chinese to represent Ersu, collected in the early 


fifteenth century ap, published and analysed by 
Nishida (1973). Other Chinese names for Ersu are 
Liizi or Liisu. Over much of its range, Ersu is in con- 
tact with the Ngwi language Nosu, which moved into 
the area several hundred years ago, and many Ersu - 
speakers also speak Nosu. 

The 7,000 Choyo, referred to as Queyu in 
Chinese, speak their language in Yajiang, Xinlong and 
Litang counties of Ganzi Prefecture, Siichuan. The 
15,000 Muya speakers are also known as Manyag in 
Tibetan, whence Menia in some western citations. 
They live in Kangding and northern Jiulong counties, 
Ganzi Prefecture, Sichuan. Shixing has about 1,800 
speakers, who are all of Tibetan nationality, in 
Shuiluo township, Wachang District, Muli County, 
Sichuan; descriptive work by Katia Chirkova is under 
way. 

Namuyi is spoken by about 1,700 people of 
Tibetan nationality in a compact area, mainly in 
north-western Xichang, but also in southern 
Mianning and north-eastern Yanyuan counties in 
southern Sichuan. Some scholars classify Namuyi as 
transitional to the Ngwi languages; Namuyi is in 
close contact with Nosu and has borrowed some 
Newi lexicon from this source. Descriptive work by 
Fuminobu Nishida is under way. 

Pumi is the southernmost core North-eastern TB 
group, with 41,028 people (2000 census) in north- 
western Yunnan and about 40,000 more in southern 
Sichuan classified in the Tibetan nationality; their 
autonym means ‘white people’. Their likely original 
territory was in Ninglang County in north-western 
Yunnan and adjacent areas of Yanyuan, Muli and 
southern Jiulong counties in Sichuan. Many Pumi 
served as soldiers during the Yuan Dynasty, and a 
substantial number moved to what are now Lanping, 
Weixi and Lijiang counties, further west in Yunnan, 
initially as soldiers. Some Pumi now speak only 
Chinese, others in parts of Ninglang speak only 
Moso, and many are bilingual in these or other sur- 
rounding languages such as Nosu, Lisu and so on. 
There are now over 70,000 speakers; some sources 
divide these into northern and southern dialects, but 
more work is needed to clarify the situation. Ding 
Sizhi has been documenting Pumi for a number of 
years; for descriptions, see Lu (1983) and Ding 
(2003). Many Nosu speakers have moved into the 
Pumi-speaking area over the last couple of centuries, 
and there is extensive bilingualism among Pumi, 
Nosu and Moso in Yanyuan and northern Ninglang 
counties. 


5.1.2.4.2 Other North-eastern TB languages 


The Na group comprises the two Na languages, Naxi 
and Moso, transitional between North-eastern TB 
and Burmic, they share lexical material with both 
subgroups. Like Bai and the Tujia languages, Na 
languages lack the extensive morphology of core 
North-eastern TB. Naxi (‘Na people’) to the west has 
two mutually intelligible dialects, the more conserva- 
tive one with about 15,000 speakers furthest west in 
Weixi County in the Mekhong valley, and the less 
conservative one mainly in Lijiang but also extending 
into southern Zhongdian (Shangrila) and north- 
western Yongsheng counties; there are about 270,000 
speakers. Naxi has a traditional logographic writing 
system used by traditional tomba shamans and exten- 
sively studied by Joseph Rock. This is a mnemonic for 
religious texts known partly by heart; not all words 
are written. There is also a related syllabic system 
that represents every word and can be read without 
knowing the text. A Naxi Romanization has also been 
developed, but is hardly used. For a description of 
Naxi, see He and Jiang (1985); work by Alexis 
Michaud and others is in progress. 

The Moso (Chinese exonym) or Na (autonym) 
live around Lugu Lake, mainly in Ninglang County in 
Yunnan, where they are classified as Naxi nationality, 
and in western Yanyuan and south-western Muli 
counties, with a total of about 50,000 speakers includ- 
ing some Pumi. Most Moso in Sichuan are classified 
as Mongol nationality, but about a quarter are 


classified as Naxi; this is said to be because a Mongol 
general and his army came into the area and inter- 
married with the Moso. The Moso are widely known 
for their matrilineal society; their culture and the 
scenery have made Lugu Lake a major tourist destin- 
ation. Apart from Tibetan Buddhism, they also have 
their indigenous religion with daba shamans, like the 
Naxi tomba but without the orthography. The exact 
boundary between Naxi and Moso remains to be 
determined; there are some transitional varieties in 
north-western Ninglang and north-eastern Lijiang 
counties, and some Naxi speakers scattered across 
northern Ninglang and south-western Muli counties. 

Tujia is a very large nationality, with over eight 
million recognized members; but only about 61,000 
of them speak either of the two Tujia languages, Biji 
and Mozi. This nationality and language were “dis- 
covered’ after 1950, before which they were regarded 
as Han Chinese. Biji or Northern Tujia is spoken by 
about 60,000 people, mostly in southern Longshan 
County along the Youshui River, and Mozi or 
Southern Tujia is spoken by about 1,000 people west 
of Tanxi Town in Luxi County, both in western 
Hunan Province. Speaker numbers have rapidly 
declined, from about 170,000 in 1982. The decline of 
Mozi has been particularly rapid, losing 90 per cent 
of its speaker base in this time. There are still some 
villages where children learn the Biji language, and 
one with some child Mozi speakers, but this is the 
exception. On the other hand, Biji is still widely used 
by adults in its remaining core area. The Tujia lan- 
guages have very numerous Chinese loanwords; there 
are no monolingual Tujia speakers. Brassett and 
Brassett (2005) provide a detailed sociolinguistic 
survey of the language situation, and Tian et al (1986) 
is the first among various descriptions. 

By contrast, Bai has a long and distinguished 
history, as the main language of the Erhai Lake area, 
centre of the Nanzhao kingdom of western Yunnan. 
Due to nearly two millennia of contact and extensive 
borrowings from Chinese dialects, it is extremely dif- 
ficult to determine the exact position of Bai within 
TB. Some linguists have even suggested that Bai has 
by now become a Chinese dialect. Its syntax is SVO 
like Chinese, but there are some TB syntactic features 
as well. The widespread idea that Bai is a Ngwi lan- 
guage seems to be incorrect, but its exact classifica- 
tion within TB requires much further work, including 
the identification of Chinese loanwords, which make 
up over 80 per cent of its core lexicon. A Roman 
orthography exists, based on the northern or Jian- 
chuan County dialect of Bai, but the speakers have 
long been accustomed to use Chinese characters for 
writing. The dialects of Bai have substantial differ- 
ences, as documented by Allen (2004), but not such as 
to lead to complete lack of intelligibility. In 2000, 
there were 1,858,063 members of the Bai nationality, 
but only about 1.1 million speak Bai, mainly those 
who still live in the traditional area of north-western 
Yunnan; there are some small groups of speakers 
elsewhere, such as in Tuanjie Township of Xishan 
District just west of Kunming and in Yinyuan Town- 
ship of Yuanjiang County in south central Yunnan. 
For a brief description of Bai, see Wiersma (2003); Xu 
and Zhao (1984) provide a fuller description. 

There is a separate language known as Laemae in 
Luobenzhuo Township of Lushui County and as 
Lama in western Lanping County, both in far north- 
western Yunnan, which is closely related to Bai but 
not mutually intelligible. Speakers are classified in 
the Bai nationality; there are about 5,000 speakers. 
In Fugong County, some descendants of Laemae 
speakers are still classified as Bai nationality, but 
speak only Lisu. However, most are now dispersed 
across the Lisu range in China and Burma, are classi- 
fied as Lisu and function as a Lisu clan; these people 
are unaware of their non-Lisu heritage. 


5.1.2.5 Burmese-Ngwi 


Burmese-Negwi (BN; also formerly known as Burmese- 
Lolo or Lolo-Burmese) is a large and diversified part of 
TB; it can be subdivided into two main subgroups, 
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Burmish (including Burmese) and Newi (formerly 
Loloish, also known as the Yi Branch in China and 
sometimes recently as Yipho following an unpub- 
lished suggestion of Paul K. Benedict). Two other 
subgroups should be added. One is Mru from the 
border of Burma and Bangladesh, and the other is 
Gong or Ugong in western central Thailand. All BN 
languages are verb-final, with complex tonal and con- 
sonant systems but little or no morphology. One key 
criterion for membership in BN is the innovation of a 
third reconstructible tonal category, in addition to the 
two found in many TB groups. There are also many 
other BN lexical and other innovations. 


5.1.2.5.1 Mru 


Mru is spoken by about 40,000 people, about two- 
thirds of them in north-western Arakan State of 
Burma, and one-third in the Chittagong Hill Tracts 
of Bangladesh. In addition to its BN core, it shows 
various layers of contact vocabulary from Kuki-Chin 
languages and from Burmese. According to Loffler 
(1966b) and Shafer, Mru is not Kuki-Chin as some- 
times suggested, but is rather a separate and divergent 
major subgroup of BN. 


5.1.2.5.2 Gong 


The Gong or Ugong ‘Gong people’ language is 
spoken by about 80 of a small and diminishing 
population of 500 in two villages of western central 
Thailand. All speakers are fluent in one or more 
varieties of Thai; those under 40 who speak Gong at 
all, about three-quarters of the speaker total, are semi- 
speakers, some with very limited ability. In many 
former locations, the language is no longer spoken. 
This is a key language for the understanding of the 
development of BN, and has some spectacular sound 
changes: *mr to gas in the group name, and so on. 
There are also complex tone sandhi processes and a 
complex vowel system, both of which are breaking 
down in the speech of the semispeakers. For some 
details, see Bradley (1989). 


5.1.2.5.3. Burmese 


Burmese is the national language of Burma, with 
about 35 million speakers as a first language, usually 
known as Burmans and officially renamed Bamars in 
1990, and over ten million speakers as a second lan- 
guage. Some members of other ethnic groups in 
Burma, especially many Mons and some Shans, are 
also monolingual in Burmese. The Burmans probably 
arrived in Burma as the armies of the Nanzhao king- 
dom, which conquered the Pyu in what is now central 
Burma in ap 960. The first Burman kingdom was 
based at Pagan, with an era backdated to ap 638; 
however, the first historical king was Anawrahta 
(1044-7), who conquered the Mons in Lower Burma 
and unified the plains of Burma for the first time. 
His general and successor Kyanzittha (1077-1113) 
brought Mon culture to Pagan. 

Burmese has a long literary history, with the 
earliest dated inscription of 1112 in its Indic orthog- 
raphy adapted from Mon. This is a quadrilingual 
inscription in Pali, Mon, Pyu and Burmese known 
as the Myazedi from the name of a temple at Pagan, 
or as the Rajkumar (Pali for ‘crown prince’) from 
the person who dedicated it. It was probably prepared 
for Alaungsithu (1113-65), the son and successor of 
Kyanzittha. Burmese is still written in a spelling 
reflecting late Pagan pronunciation. There are dig- 
lossic high and low varieties, with the former used 
mainly in written and formal spoken contexts and 
often reflecting a more archaic form, but not quite as 
archaic as the spelling. The standard spoken variety 
is the speech of Mandalay, the last capital of the 
Burmese kingdom before its conquest by the British 
in 1885. The most divergent and phonologically 
conservative local variety is spoken to the north of 
Mandalay. For further details, see Bradley (1988). 

Burmese is a typical TB language with verb-final 
syntax and modifier-head order. Most grammatical 


markers have diglossic high and low paired forms; for 
example, the realis verb marker, high thi and low teh, 
and the locative marker, high hnai' and low hma. 
Some frequent closed-class lexical items also have 
different forms, such as ‘this’, high i and low di. As in 
other BN languages, when combined with a noun the 
numeral follows the noun and must be followed by a 
classifier. 

The standard phonology is the same for both 
high and low Burmese. There are stops and fricatives 
including unaspirated p ts c k, aspirated hp ht hs hc hk 
and voiced b d zj g. The fricatives s hs z derive from 
former affricates [ts] [ts"] [dz]; there is an increasing 
tendency to merge hs into s. The initials c ch j derive 
from former consonant clusters with initial velars and 
medial liquids or glides, and are still written this way. 
Other consonants are voiced nasals and lateral mn ny 
ng I; voiceless hm hn hny hng hl (with the digraphs ny 
and ng representing palatals and velars respectively); 
fricatives th (an interdental [6]) sh (like English 
sh) and h; and the glides w and y. There is also an 
archaic and highly marginal hw. The only clusters are 
initial py hpy by my hmy. The vowels fall into three 
categories: oral, nasalized and with final glottal stop. 
The oral non-stopped vowels are i e eh a aw o u; the 
digraphs eh and aw represent [¢] and [9] respectively. 
Nasalized vowels and diphthongs are in ein ain an 
aun oun un; stopped-syllable vowels and diphthongs 
are i’ ei’ eh' ai'a'au' ou' u'; here, eh' again represents 
[e']. There are also a few diphthongs with a w onglide: 
we weh wa win weh'. The vowels un and u' are in the 
process of merging with in and i’. There are three 
tones in non-stopped syllables: falling, low and high 
creaky. The nasalized syllables and those ending with 
glottal stop are written with various final nasals or 
stops, reflecting their former pronunciation. There is 
extensive juncture voicing, marking close juncture 
within the noun phrase and between the verb and 
associated markers. The standard transcription of 
Burmese used here is that of Okell (1969), also an 
excellent grammar of low or spoken Burmese. For a 
grammatical summary of high or literary Burmese, 
see Okell and Allott (2001). 

One distinct variety closely related to Burmese is 
Arakanese (known as Rakhine in Burma since 1989) 
in the west, with about 1.7 million speakers; this is 
also spoken in south-eastern Bangladesh and southern 
Tripura in India, where it is usually known as Mogh 
or Magh. Arakan was reconquered by the Burmans in 
the 1780s, and at that time much of the Arakanese 
court fled to what is now Bangladesh, where their 
descendants now call themselves [marama] but are 
better known as Marma. Other such varieties are 
south-eastern Tavoyan (400,000) and Beik or Meguese 
(250,000); east central Intha (90,000, around Inle 
Lake), Danu (100,000) and Taungyo (40,000); and 
west central Yaw (20,000). All the regional dialects are 
in some ways more conservative phonologically than 
standard Burmese; for example, Arakanese retains 
the distinction between r and j. Tavoyan keeps medial 
I and so on. There is ongoing convergence back 
towards standard Burmese under way in most of 
these languages. Other than the Arakanese, who have 
their own state, most are considered to be members 
of the Bamar majority ethnic group. 


5.1.2.5.3.1 Other Burmish languages 


To the north-east of the Burman area is a hill area 
with some inhabitants speaking closely related lan- 
guages. There are four main groups, all to some 
degree integrated into the Kachin or Shan cultures of 
the surrounding majorities. These groups go by dif- 
ferent names as shown in Table 5.4. 


In Burma, they are usually referred to by their 
Burmese names, mostly derived from the Jinghpaw 
names, and in China by their Chinese names, the 
latter mostly derived from local autonyms. The 
Achang, with about 45,000 speakers (including 
33,936 in China), formed a separate Shan-like valley 
state and are recognized as a nationality concentrated 
in Longchuan and Lianghe counties, with smaller 
numbers nearby in Tengchong, Luxi and Longling 
counties along the river valleys of Dehong and 
Baoshan prefectures, and further afield in Yunlong 
County. In Burma they are mingled with the Lashi 
and as individual blacksmith families in many Shan 
villages in northern Burma. Lashi speakers number 
some 30,000, mostly in the Bhamo and Myitkyina 
areas of Burma with a few nearby in Yingjiang 
County in China. The 100,000 Maru speakers are 
widely dispersed across the northern Shan State of 
Burma and the Bhamo area, as are the 150,000 Atsi. 
In China, Atsi are the vast majority among the Jingpo 
nationality; they and the Maru are concentrated in 
Longchuan, Yingjiang and Luxi counties and Ruili 
City. 

For most purposes the Atsi, Maru, Lashi and 
some Achang operate as clans within the Kachin 
culture complex, bilingually speaking Jinghpaw as 
their literary language. With clan exogamy, every 
speaker of a father’s Burmish language also speaks a 
mother’s other Burmish or Jinghpaw language 
natively, and possibly other languages of the culture 
complex. Two very small clan-based groups, Bola and 
Chintau, are found only in China. The 400 Bola 
speakers are classified as Jingpo nationality and are 
mingled with the Atsi in Luxi and Yingjiang counties; 
all also speak Atsi and Jinghpaw. The 70 Chintau 
speakers are classified as Achang nationality, and live 
in two villages, Meng’erzhai and Xiandaozhai, in 
Yingjiang County. They are also known in Chinese as 
Xiandao; their autonym is [k"an”' taw’']. All Chintau 
speakers can also speak Jinghpaw and Chinese and 
can understand Achang; the language is severely 
endangered, with no child speakers. Romanized 
orthographies for Atsi, Maru and Achang exist, but 
are hardly used. Within each group, the regional 
differences are substantial, sometimes even as great as 
those between different Burmish groups who live 
together and intermarry. Maru and Lashi are charac- 
terized by the addition of final stops to some syllables; 
these are absent in Atsi and Achang. In China, most 
members of the Jingpo nationality are Atsi, with 
smaller numbers of Maru, Lashi, and Bola, as well as 
some speakers of Jinghpaw; in Burma the proportion 
of the Kachin who speak Burmish home languages is 
much smaller, but still substantial, especially in the 
Northern Shan State and the south-east of the Kachin 
State. There are also small communities in Thailand. 
Achang is described in Dai (1985); Atsi is described 
in Xu et al (1984). 

The final Burmish language is Hpun (Phun, 
Hpon, Hpo6n), formerly spoken in the gorges of the 
upper Irrawaddy north of Bhamo; it has two dialects, 
north and south. Both are virtually extinct; at the 
most there are a few old semispeakers left. It was 
phonologically more conservative than the Burmish 
languages of the Kachin group. 


5.1.2.5.4 Ngwi languages 


For details of the phonological and lexical subgroup- 
ing of these languages, see Bradley (1979a). Basically, 
all Ngwi languages share an innovative two-way tonal 
contrast in original stop-final syllables; the Northern 
Negwi languages have mostly reversed phonetic values 


Table 5.4 Burmish languages in the Kachin culture complex 


Own name Jinghpaw name 
Lawngwaw Maru 

Tsaiwa Atsi 

Lachik Lashi 
Ngochang Ngochang 


Burmese name Chinese name 


Maru Langsu 
Zi Zaiwa 
Lashi Leqi 
Maingtha Achang 
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for these two tones compared to other Ngwi lan- 
guages. Central Ngwi is characterized by extensive 
tonal splits, leading to complex tonal systems includ- 
ing contour tones. 

In China, the Yi nationality includes seven lan- 
guage clusters, three of which (Nosu or Northern Yi, 
Nasu or Eastern Yi and Nisu or Southern Yi) are 
closely related to each other and form Northern Ngwi. 
Three of the remaining four, South-eastern Yi 
(including Sani, Axi, Azhe and possibly Azha), Cen- 
tral Yi (Lipo and Lolopo among others) and Western 
Yi (including Lalo and others), form part of Central 
Negwi. The Pula cluster includes Pula, Muji and so on, 
which are South-eastern Ngwi languages also classified 
in China as South-eastern Yi. There are a few South- 
ern and South-eastern Yi in northernmost Vietnam 
and a few Central Yi in north-eastern Laos. Other 
nationalities in China whose members speak Negwi 
languages are the Hani, Lisu, Lahu and Jinuo. 


5.1.2.5.4.1 Northern Ngwi 


The three main languages in the Northern Ngwi 
group, Nosu, Nasu and Nisu, all use the autonym 
‘black people’; the form of the word ‘black’ of course 
differs. All three, as well as the Central Ngwi Sani, had 
traditional logographic scripts known to traditional 
religious practitioners. Revised versions of these 
scripts are now being disseminated. 

Nosu is the Ngwi language with the largest 
number of speakers; there are about 2.1 million in 
Liangshan Prefecture and adjacent areas of southern 
Sichuan, and about 200,000 more, mainly in Ning- 
lang County in north-western Yunnan province, but 
also a few in north-eastern Yunnan and a couple of 
villages in north central Yunnan and south into 
Fumin and Anning counties around Kunming. Over 
the last fifty years, many villages of Nosu speakers 
have moved further west, reaching nearly every 
county of north-western Yunnan; most of these speak 
the Central or Shengza dialect. There are three main 
dialects: Northern with subdialects Tianba and Yinuo, 
Central Shengza, and Southern with subdialects Sondi 
and Adur; the northern and central dialects are fairly 
similar to each other, while the southern dialects 
are more different. Map 46 shows these three main 
dialects within Nosu. The largest group, over half of 
the total, is the Shengza, and the Xide local variety of 
Shengza has been selected as the standard, with a new 
syllabic orthography based on the traditional Nosu 
characters in extensive use since 1978; a Romaniza- 
tion is used occasionally. 

There are several different Ngwi groups who are 
called Shuitian ‘irrigated field’ or Shui ‘water’ Yi in 
Chinese. One is the 10,000 Muhisu of Xichang, Xide 
and Mianning counties in south-western Sichuan, 
briefly described in Zhu and Zhang (2005); only 
about 1,000 older people can speak the language; all 
speak Chinese and some also speak Nosu. They 
report that they came from Yunnan, where the Nasoid 
language Naluo (also known in Chinese as Shuitian) 
is spoken, but Muhisu is a Nosoid language, that is, it 
is most closely related to Nosu, not Nasu. A second 
Nosoid group is Ghomozo, which is spoken in north- 
eastern Heqing County in Yunnan by about 5,000 
people, presumably originating from the Nosu area in 
Sichuan. The third Nosoid group is the 1,200 Nyisu 
who live along the Pudu River west of Kunming in 
one village in north-western Anning County and a 
cluster of three villages in north-western Fumin 
County; Nyisu is now moribund, with only about 300 
speakers remaining; it is quite similar to the southern 
Adur dialect of Nosu, and some suggest that the 
Nyisu are escaped slaves of the Nosu. The locations of 
the Nyisu are in a very complex area with a range of 
Nasoid languages around them, and are not indicated 
on the map. 

Nasu is fairly closely related to Nosu, but they are 
not mutually intelligible. This group is widely spread, 
with nearly 850,000 in western Guizhou, over 600,000 
in north-eastern Yunnan, and 9,712 in north-western 
Guangxi. Of its roughly one million speakers, more 
than half are in Yunnan; there are several hundred 
thousand more nonspeakers, mostly young people 


in all three provinces but especially in Guizhou, who 
are members of the group. Dialect differences are 
extremely complex, and there are three main sub- 
groups: south-eastern (known also as Panxian, from 
the county in Guizhou where they are concentrated); 
north-eastern (most of the other Nasu in Guizhou, 
and some in extreme north-eastern Yunnan and 
south-eastern Sichuan; four main subdialects); and 
western (all in Yunnan, with three main subdialects: 
western Naisu, also known as Red Yi; central Nasu, 
also known as Black Yi; and south-eastern Gepo, also 
referred to in the missionary literature as Képu). The 
distribution of Naisu, Nasu and Gepo varieties is 
shown on the map. The traditional characters are 
being brought back into use among the Nasu in 
Guizhou; separate Pollard missionary scripts existed 
for all three Yunnan subdialects, and these still enjoy 
some very limited use among Christians. In Guizhou, 
the traditional Nasu script is being promoted. In 
Yunnan, a new combined Yi script was created in the 
1980s and is intended to be used by all Yi groups in 
Yunnan except the Nosu, but this has encountered 
resistance and lack of enthusiasm, and its use is 
limited. 

There are various small groups speaking Nasoid 
languages similar to Nasu and classified as Eastern Yi 
by Chinese linguists. These include Aluo, Ayizi, 
Chesu, Samei, Samataw and Sanie. 

Aluo goes under a wide range of other names. 
One is the Nasu name Laka, used also in Christian 
missionary work in a Pollard script for this language. 
Another is the vague Chinese exonym Gan Yi, which 
also refers to a number of other Ngwi groups in the 
same area. Their autonym is Ala in some dialects, 
and Aluo in others. They are also referred to as Yala 
(local Chinese exonym), Lila or Niluo in Harrell 
(2001: 265-7) in Miyi County, Sichuan, where some 
hundreds of them live in Hengshan and Malong 
townships. Most of the 40,000 Aluo speakers live in 
northern Wuding, Luquan and Yuanmou counties of 
Yunnan, and across the Upper Yangtse in Huili and 
Miyi counties of Sichuan. There are about 40,000 
group members, but probably only about 25,000 
speakers. 

Ayizi is spoken in Aimailong village of Beidacun 
Township in north-western Shilin County, where 
there are under 500 people of Yi nationality who call 
themselves Ge. The language is moribund, with fewer 
than fifty old speakers, and undocumented. There 
may be other Ayizi nearby in other counties. Their 
autonym suggests that they may form part of the 
Gepo subgroup of Nasu. 

Chesu is spoken in north-western Xinping, 
south-eastern Shuangbai and south-western Eshan 
counties by about half of a group of 6,600 people. 
This is the south-westernmost Nasoid language, and 
is in very close contact with Nisu, which is replacing 
it, and with Central Ngwi Hlersu in the same area. 

Samei is the language of a large group immedi- 
ately south-east of Kunming city, mainly in Guandu 
District and centred on Ala Township, but also 
extending eastward as far as western Yiliang County. 
It has about 20,000 speakers out of about 35,000 in 
the group, and is moribund in many areas closest to 
Kunming and in decline elsewhere. Their autonym 
is Sani, but they are usually known by the Chinese 
exonym Samei, which avoids confusion with the Sani 
to their east and the Sanie on the other side of 
Kunming. 

Samataw is the language of Zijun village (also 
known as Da'er ‘big ear’ village; the inhabitants are 
said to have large ears) in the north-western part of 
Guandu District, just south of Kunming city. While 
there are 2,800 people of Yi ethnicity there, only 
about 500 people over 50 can speak the language. 
It is Nasoid, and has nothing to do with the similarly- 
named Samatu group who speak a Central Ngwi Laloid 
language far to the west. However, the authorities are 
attempting to revive the language in the local school, 
and have set themselves up as a Yi cultural tourist 
attraction. 

Sanie is spoken by about 8,000 of an ethnic 
group of 17,300 who live mainly in Xishan 


District just west of Kunming, but also extend slightly 
into northern Anning County and south-western 
Fumin County. Again, the language is moribund in 
villages closest to Kunming, such as Chejiabi, and 
better-maintained further away in Tuanjie and Gulu 
Townships. The easternmost dialects maintain a con- 
servative autonym, Sa'ngwie, with an initial velar in 
the second syllable of the autonym. This provides the 
best support for the hypothesis that the autonym of 
the Ngwi groups contained this cluster. For further 
information on Sanie, see Bradley (2005). 

The Kazhuo language is spoken by a concen- 
trated group of Mongol nationality in Xinmeng 
Township of Tonghai County in central Yunnan; their 
name means ‘Ka people’. They claim descent from a 
Mongol army, but their language has no vestige of 
Mongol; it is simply another Northern Ngwi language. 
There are 6,341 Mongols in this area, but only about 
4,000 speakers, as many younger people use only 
Chinese. For a description, see Mu (2003). 

Nisu or Southern Yi is also rather diverse; in 
addition to about 800,000 speakers, there are some 
nonspeaker members of the group. It has a wide 
range of dialects. This includes Nasu (not to be con- 
fused with the Eastern Yi Nasu) in Yimen, Jinning, 
Hongta and northern Eshan counties or districts; 
Nisu in Xinping, Shuangbai, Yuanjiang, Mojiang and 
Shiping counties; and Niesu in Jianshui, Honghe, 
Luchun and other counties of south central Yunnan. 
A few Nisu live in Vietnam where they are know as 
Lolo. There is a logographic Nisu orthography still 
used by indigenous religious practitioners, the vehicle 
of a very rich religious and other traditional litera- 
ture. A surprising number of Nisu manuscripts are in 
overseas libraries in the UK, France and elsewhere. 


5.1.2.5.4.2 Central Ngwi 


The South-eastern Central subgroup of Central Ngwi 
comprises four named groups, Sani, Axi, Azhe and 
Azha, which are spoken in east central Yunnan, just 
to the south-east of Kunming. The Chinese official 
classification groups these as ‘South-eastern Yi’ along 
with Pula, Muji and other languages of the South- 
eastern Ngwi group. The best described is Sani, for 
which a traditional logographic writing system exists; 
see Ma (1951). Sani is the main speech form of the Yi 
nationality in Shilin County, and is also spoken in a 
number of surrounding counties by a total of about 
120,000 people. The autonym of this group is actually 
Ni; Sani is a Chinese exonym. The Shilin County 
government promotes the use of traditional Sani 
logographic characters and research on Sani topics. 
Axi is spoken just to the south of Sani in north- 
eastern Mile County by about 100,000 people. Azhe is 
spoken in north-western Mile County by another 
60,000 people; it is written with a logographic script 
similar to Nasu. Azha is spoken in southernmost Mile 
County and beyond by about 40,000 people; it may 
be that Azha will turn out to be a South-eastern Ngwi 
language, once a better description is available. 

There are three languages in the Lisoid subgroup: 
Lisu, Lipo and Lamu. Lisu is spoken by a large and 
widely dispersed group totalling about 850,000, with 
over 600,000 in China (most in Yunnan, especially the 
north-western part, but about 11,000 in southern 
Sichuan as well); nearly 300,000 in Burma, mainly 
in the north; about 40,000 in Thailand, and 2,700 
in north-eastern India, where they are also known 
as Yobin from the former Burmese name Yawyin, 
in turn derived via Jinghpaw from Chinese yeren 
‘wild people’. Dialect differences are substantial; the 
Thailand dialect has extensive Chinese loans. Various 
orthographies exist; those now used include the 
Fraser script, devised by a missionary of that name, 
which uses upper-case Roman letters, upright and 
inverted, and punctuation marks for tones; and the 
new Chinese Romanization, which uses numerous 
digraphs, and uses extra consonants after the vowel to 
indicate tones. Use of the latter is on the decline. For a 
brief description of Lisu, see Bradley (2003). 

Lamu is a small group of 295 people in two 
villages in north-eastern Binchuan County who are 
classified as Lahu, but who speak a language fairly 
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similar to Lisu. All are multilingual, speaking also the 
local varieties of Lipo and Chinese; many also speak 
local Lolopo. A very small number of people with a 
Lipo father and a Lamu mother can also speak Lamu; 
the Lipo in this area, who are the westernmost Lipo 
speakers, are classified as Lisu nationality. 

Lipo and Lolopo are groups classified by the 
Chinese as Central Yi, who speak these two languages. 
The Lipo in Binchuan, Yuanmou, Wuding and Luquan 
counties of Yunnan and Huidong, Huili, Miyi and 
nearby counties in Sichuan are classified as Lisu 
nationality, but the rest are all classified as Yi national- 
ity. Lipo has about 250,000 speakers and is linguistic- 
ally very close to Lisu. Some scholars have even called 
this an eastern dialect of Lisu; however, the two are 
not mutually intelligible. Lipo is spoken by about 
212,000 people in Yunnan from Binchuan and 
southern Yongsheng counties in the west, across 
Dayao, Yao'an and Yongren counties, to Yuanmou, 
Wuding and Luquan counties in the east; it is also 
spoken in several southern counties of Sichuan by 
about 8,000 more people classified as Lisu nationality, 
and another 30,000 classified as Yi nationality, mainly 
in Panzhihua City. Lolopo is spoken by about 550,000 
members of the Yi nationality, mainly in western 
Chuxiong Prefecture, but also widely scattered to the 
west and south, even as far as north-eastern Laos, 
where this is a recognized ethnic group with 1,407 
people; it goes under a bewildering variety of exo- 
nyms including Xiangtang. Some young people of 
these groups do not speak the group languages. These 
two languages are surrounded by Nosu or Northern 
Yi on the north, Nasu or Eastern Yi to the east, Nisu 
or Southern Yi to the south, and Lalo or Western Yi 
to the south-west; hence their designation as Central 
Yi. Some of the Lipo use a Pollard missionary script; 
there was no traditional Lipo or Lolopo script, but 
some Lolopo are reported to write using Nasu script. 

Micha are widely scattered from Fumin, Luquan 
and Wuding counties to the north-west of Kunming 
in central Yunnan and further west and south in 
Yongren, Dayao, Yao'an, Nanhua, Jingdong, Zhenyuan 
and Jinggu counties. In many areas the language is 
extinct, moribund or endangered, but it still probably 
has some 30,000 speakers among a group population 
well over 50,000. Some older Micha speakers who had 
married Lahu wives reached Thailand in 1972, but 
Micha is no longer spoken there. Other names for 
them include Mielang, Migie and so on; they are 
classified as part of the Yi nationality. 

Hlersu is spoken by about 15,000 members of 
the Yi nationality in north-western Xinping County 
and adjacent areas of Shuangbai, Eshan and other 
nearby counties. It is also known as Shansu or Sansu 
from the Nisu name for this group. Hlersu, Talu and 
Kucong are the only Central Ngwi languages to retain 
a voiceless lateral. 

Naluo is spoken by about 15,000 members out of 
a group of about 20,000 people of Yi nationality, 
mainly in southern Huaping County in Yunnan, with 
3,500 in south-eastern Yongsheng County in Yunnan 
and about a thousand in Pingjian and Futian town- 
ships on the western edge of Panzhihua City in 
Sichuan (Harrell 2001: 283-91). Because of phono- 
logical variation in south-western Mandarin Chinese, 
their name also appears in the literature as Laluo and 
Naruo. In Chinese, they are also known in Sichuan as 
Shuitian ‘irrigated field? Yi and in Yunnan as Shui 
‘water’ Yi. This group should not be confused with 
the Muhisu further north in Sichuan who are also 
called Shuitian Yi, nor with the Lalo of western 
Yunnan. Naluo is moribund in Sichuan and better- 
maintained in Yunnan. 

Maci is the speech of the Yi nationality in Maci 
village, Yongren County, about 10 km (6 miles) 
north-east of the county town. There are 102 people 
there, but their language has not yet been investi- 
gated; local sources report that it is quite different 
from the speech of other nearby Yi. 

Talu is spoken by 10,138 people of Yi nationality 
in north-western Yunnan; nearly half live in Liude 
Township, Yongsheng County and the rest live nearby. 
It is well-described in Zhou (2004). Tagu is spoken by 


about 3,500 people in Taguping village, Dongshan 
Township, Yongsheng County and nearby in Ning- 
lang County, just south of Talu. Tazhi is the former 
language of a cluster of villages near Puwei Township 
of Miyi County in far southern Sichuan. They are 
classified as members of the Yi nationality, but little is 
left of their language (Harrell 2001: 270). Given the 
similarity of their autonyms and their proximity, it is 
likely that these three languages are related. 

Yongsheng County in western central Yunnan 
also has three other very small Central Ngwi groups 
among its Yi nationality. Apart from about 10,000 
Lolopo, who are locally called Xiangtang, about 3,500 
Naluo and some Lipo, there are part of the Talu and 
Tagu and all of the Nazan, Liude and Lang'e. Nazan , 
also cited as Nazhan or Naza, is spoken by about 
1,600 people in two clusters west of Tagu in south 
central Yongsheng County. Liude is spoken by about 
1,000 people just north-west of Talu in Yongsheng 
County. Lang'e, also seen in the literature as La'u, is 
spoken in south-western Yongsheng by about 2,000 
people. Kuangsi is spoken by about 3,000 people in 
eastern Heqing County, just to the west of Yongsheng 
across the Upper Yangtse. The only one of these four 
languages for which any information at all is available 
is Lang'e, which is a typical Central Ngwi language. 

Tanglang is spoken by about 2,000 people of 
Yi nationality in southern Tai'an Township, south- 
western Lijiang County, and a few nearby in 
Jianchuan County. It is briefly described in Gai (2002). 

There are two Laloid languages: Lalo and 
Samatu. The Western Yi language Lalo has a very 
wide range of dialects under a variety of autonyms 
and exonyms and a total of about 600,000 speakers, 
but in many areas the language is being replaced by 
Chinese; there are many ethnic Lalo who do not 
speak Lalo. Bjérverud (1998) provides an excellent 
description of one variety. Closely related is the 
Samatu language with about 7,500 group members, 
but there are only a few elderly speakers in Zhenkang 
and Yongde counties in western Yunnan; this should 
not be confused with the Northern Ngwi Nasoid 
language Samatao spoken near Kunming. 

There are two Lahoid languages, Lahu and 
Kucong. The Lahu total about 725,000, with 453,705 
in China, over 200,000 in Burma, over 60,000 in 
Thailand, and about 8,700 in Laos; there is also a 
small group of about 1,200 refugees from Laos in 
the United States. Lahu is recognized as a nationality 
of China and as an ethnic group of Vietnam, Laos, 
Thailand and Burma. The main dialect difference is 
between Black Lahu (Lahu Na) and Yellow Lahu 
(Lahu Shi), but there are extensive smaller differences 
within each; for details see Bradley (1979b). The 
Baptist Lahu use a Baptist Romanization; there is also 
a Chinese Romanization which is in limited use in 
China. In recent years, there have been two attempts 
to create separate Yellow Lahu orthographies based 
on the Baptist Black Lahu orthography modified for 
the Banlan dialect, but this is regarded with great sus- 
picion by other Lahu. For a superb grammar of Black 
Lahu, see Matisoff (1982). 

The people recognized as the Lahu ethnic group 
in Vietnam were formerly known as Cosung; this 
latter name is from the Chinese term Kucong also 
used to refer to them in south central Yunnan, mainly 
far to the east of any Lahu speakers. This group was 
unclassified for nationality in China, requested 
nationality status in the early 1980s, but were amal- 
gamated into the Lahu nationality in 1989. They 
speak a range of dialects, none of which is mutually 
intelligible with Lahu. Their autonym is Lahlu; 
Kucong is a Chinese exonym. Kucong is linguistically 
quite close to Lahu, and has about 47,000 speakers, 
over 40,000 in Jinping, Xinping, Jinggu, Luchun, 
Mengla, Jiangcheng, Yuanjiang and other counties in 
China and 6,874 in Vietnam. The language is poorly 
documented and has a great deal of internal diversity. 

The compact Jinuo nationality in far southern 
Yunnan had 20,899 members in 2000 but only about 
10,000 speakers. It became a separate nationality in 
China in 1979, the last nationality recognized 
in China. There is no orthography. Concerning its 


linguistic position in Central Ngwi, see Bradley 
(1982). Most live in the Jinuoshan area of north- 
eastern Jinghong County, but a few who speak a 
divergent dialect live to the south of Jinghong town. 

The small Zaozou group of about 2,100 lives 
mainly in south-western Lanping County in north- 
western Yunnan, China; they are classified as Nu 
nationality, but the Zaozou language is Central Ngwi. 
For a description, see Sun et al (2002). 

Nusu speakers, over 12,000 of them, live in 
southern Fugong and northern Lushui counties in 
north-western Yunnan. There are northern, central 
and southern dialects with substantial tonal differ- 
ences. They are also classified as Nu nationality. All 
Nusu speakers are bilingual in Lisu, and use Lisu 
as a medium of literacy and lingua franca. For a 
description, see Sun and Liu (1986). 


5.1.2.5.4.3 Southern Ngwi 


As for the mainly Northern Ngwi Yi nationality, 
there is a Southern Ngwi nationality in China whose 
members speak a variety of distinct languages: Hani. 
Chinese linguists break this into three main sub- 
groups: Hani-Akha, Haoni-Baihong and _ Biyue- 
Kaduo. The first two subgroups are fairly closely 
related within the Akoid subgroup of Southern Ngwi, 
spoken by quite large populations in south central 
Yunnan and also spread across south-western 
Yunnan, north-western Vietnam, northern Laos, 
north-eastern Burma and northern Thailand. The 
third forms a separate cluster of languages, the Bi-Ka 
subgroup, mainly in south-western Yunnan but also 
in Laos, Vietnam and Thailand. Also included here 
are the Bisoid languages of northern Laos, north- 
western Vietnam, northern Thailand, north-eastern 
Burma and nearby areas of far south-western Yunnan 
in China. 

In the Akoid group are the over 565,000 Akha 
speakers — 240,000 in southernmost China, over 
200,000 in Burma, 66,108 in Laos and 56,616 in 
Thailand. Dialect differences within Akha are fairly 
minor. According to their traditions, in which the 
Akha ritual group name is [za’'ni’'], they migrated 
from the north-east. To the north-east, mainly in 
southern central Yunnan, the roughly 760,000 Hani 
[xani’'] still live, including 17,535 in Vietnam and 
1,122 in far north-eastern Laos, and 737,399 in 
Honghe Prefecture and Jiangcheng County. Hani is a 
cluster of very similar varieties, nearly mutually intel- 
ligible with Akha. Geneologies suggest that the Akha 
migration took place about 25 generations or some 
600 years ago, at a time when incoming Chinese 
migrants would have put the Hani under demo- 
graphic pressure. To the north-west of the Hani are 
the 140,000 Haoni [xo”'ni’] and Baihong, speaking 
somewhat less similar dialects. The main innovation 
shared by Hani and Akha — but not Haoni or 
Baihong — is the merger of voiceless aspirated and 
unaspirated stops and affricates, which are now in 
complementary distribution: aspirated in non- 
creaky tone syllables and unaspirated in creaky tone 
syllables. Haoni is spoken as far north as Wuding and 
Luquan counties, north of Kunming, but mostly in 
areas south-west of Kunming in western Yuxi, eastern 
Simao and south-eastern Chuxiong prefectures. For 
Akha there are many competing Romanizations, with 
three in fairly wide use: Catholic, Baptist and a third 
based on the Hani pinyin Romanization used in 
China. For Hani in China there is a Romanization that 
was first proposed in 1957 and then introduced in 
revised form in 1984. Hani is a recognized ethnic 
group of Vietnam and Laos; Akha is a recognized 
ethnic group of Laos, Thailand and Burma, but is 
included in the Hani nationality in China. There are 
various divergent languages included within the Akha 
ethnic group, including Akheu, Chepya and probably 
others not yet located. Akheu is spoken by about 
5,000 people in Jinghong and Menghai couties in 
China, in north-western Laos, in far north-eastern 
Burma and in far northern Thailand, in some cases 
mingled with Akha but elsewhere in separate villages. 
It differs from Akha in that the pitch of non-creaky 
tones is reversed: Akha high tone corresponds to 
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Akheu low tone, and vice versa. Chepya is spoken by 
about 2,000 people in the western part of Phongsaly 
Province in north-eastern Laos. Both Akheu and 
Chepya are endangered, and being replaced by Akha. 

The Bi-Ka subgroup of Southern Ngwi is named 
from the Chinese term for its two main components: 
Biyue (Piyo) and Kaduo (Khatu). Nearly 300,000 of 
them are included in the Hani nationality of southern 
Yunnan; a small number of Khatu live in northern- 
most Laos as well. The 2,000 Mpi live in two villages 
in Thailand; their tradition reports that they were 
brought as war captives from the north about 200 
years ago — presumably from the Biyue area. All Mpi 
are now bilingual in Northern Thai and culturally 
assimilated, but most people in the villages retain 
their language. Between these groups, there are the 
Muda, the Phana, Bana or Pana and the Sila. Sila is a 
recognized ethnic group of north-eastern Laos with 
1,772 people, and of north-western Vietnam with 840 
people. Phana is spoken in Mengla County, China; 
also in one village nearby in Namtha Province of 
north-western Laos and one further south on the 
Mekhong. They total about a thousand; the language 
is closely related to Sila further east, but is being 
replaced by Akha, Lahu or other languages. Muda is 
spoken by about 2,000 people in a few villages in 
Nanpianshan Township, Jinghong, where they are 
classified as Hani nationality and can also speak Akha. 

There are four clusters of Bisoid languages: 
1) Bisu, 2) Laomian, 3) Sangkong and 4) Phunoi. 
Bisu is spoken in two villages, Doi Chomphu and Doi 
Pui in Chiangrai Province, northern Thailand by 
under 1,000 people classified in the Lawa hill tribe, in 
a small cluster of villages in the north of the eastern 
Shan State of Burma (classified as the Pyen ethnic 
group from an alternative exonym, also seen in the 
Chinese term Laopin) and in Laopinzhai village in 
Menghai County of China (with 240 people of 
unclassified nationality) — a total of under 2,000 
speakers. There are two other villages where Bisu 
used to be spoken in Thailand and more in Burma. 
Contact among the remaining villages in the three 
countries has recently been re-established, a Thai- 
based orthography created, and language revitaliza- 
tion work is under way (Person 2005). 

Laomian is spoken in Laomianzhai and two 
other nearby villages in Lancang County (483 people), 
six villages in western Menglian County (1,400 
people, with young people in one village losing the 
language) and nearby in several villages in Burma; a 
total of around 2,500 speakers. In China, they were 
unclassified for nationality until 1990, when they 
were added to the Lahu nationality; most ethnic 
Laomian also speak Lahu, and many have assimilated 
into the Lahu population and speak Lahu but not 
Laomian (Xu 2005). 

Sangkong is spoken in Jinghong County by 
about 2,000 people of Hani nationality; they are also 
locally called Buxia in Chinese. A recent grammar by 
Li (2002) provides extensive documentation. 

Phunoi, also known as Sinsali, Singsali or Singsili 
for a few years in the 1990s, is a recognized ethnic 
group of 35,635 people in Phongsaly Province of 
north-eastern Laos and the recognized Céng ethnic 
group of 1,676 in Laichau Province in north-western 
Vietnam. It is divided into six subgroups. Of these six 
groups, Laopan is linguistically close to Bisu and 
Laomian, but the speech of the other five subgroups — 
including two Phunoi varieties as well as Pisu, 
Phongku and Lawseng — differs from them and from 
each other. The speech of the Céng ethnic group of 
Vietnam is another variety like Phunoi. 

The Bisoid languages share various major innov- 
ations, such as the change of certain *prefixed nasals 
into voiced prenasalized stops or stops, as in ‘fire’ 
[mbi”'] or [mi’'], which is usually [mi] elsewhere in 
Negwi and TB. They are more conservative for final 
consonants than most other Ngwi subgroups, with 
final p tm n and in some languages also k and ng. 


5.1.2.5.4.4 South-eastern Ngwi 


The South-eastern Ngwi group, also known to Shafer 
as Tonkin Lolo, includes a variety of Ngwi languages 


of south-eastern Yunnan and far northern Vietnam. 
They are classified as the Pula portion of South- 
eastern Yi in China, and as the Phula ethnic group 
of Vietnam (with 9,046 people there). Descriptions 
are very limited, but South-eastern Ngwi appears to 
include various languages including Pula, Muji 
and others. Preliminary work by Jamin Pelkey indi- 
cates that there are about 250,000 speakers of Pula 
and about 76,000 speakers of Muji, both including 
a very diverse range of dialects or languages and 
both with some speakers in Vietnam. Other lan- 
guages which belong here are Mo'ang of Longxiang, 
Longmai, Mula and Ligong villages in southern 
Funing County in far south-eastern Yunnan and in 
three nearby villages in Guangxi with 5,000 speakers 
(Wu 1993), Kathu of Balong Township, Guangnan 
County, south-eastern Yunnan with 5,000 speakers 
(Wu 1994), and Laghuu or Xapho in Vietnam with 
less than 500 speakers included in the Phula ethnic 
group (Edmondson 2003) and speaking a language 
similar to Muji, as well as several other languages not 
yet well known. 


5.1.2.6 Karen 


The Karenic languages, mostly in Burma but extend- 
ing into western Thailand, are clearly TB but have a 
number of distinct characteristics. The main one is 
that all are SVO, which is near-unique in TB, paral- 
leled only by Bai. There is also a substantial non-TB 
element in the Karenic lexicon, as demonstrated in 
Luce (1985). Benedict (1972) treated Karenic as a co- 
ordinate subgroup of Sino-Tibetan with TB and 
Sinitic, but later revised his view and placed Karenic 
within TB, and indeed closest to the Burmic portion 
of TB (Benedict 1976). Shafer (1974) places Karenic 
within TB as a separate group co-ordinate with 
Burmic, Bodic and Baric. There is also some dis- 
agreement about the internal subgrouping of Karenic, 
which includes a number of languages, many with 
several alternative names. 

Karen population was reported at 2,122,825 in 
Burma in 1983 and 353,574 in Thailand in 1999, plus 
about another 120,000 refugees from Burma; Kayah 
are separately counted at 141,028 in Burma. These 
figures are grossly underenumerated; many Karens 
try to pass for members of the majority group, 
Burman or Thai, and many lived in areas not directly 
controlled by the Burmese government in 1983 and 
thus were not counted. Moreover, many of the 
smaller Karen groups are not included in these totals. 
The Karen who are scattered in the Lower Burma 
delta region, enumerated at about 1.5 million in 
1983, are mostly Sgaw with a smaller population of 
Pho; many of these are in the process of becoming 
Burmanized and not all speak a Karen language. The 
‘official’ balance in Burma, about 0.8 million, is an 
absurd underenumeration. Manson (2003) estimates 
between six and ten million ethnic Karen, over five 
milllion of whom speak some Karen language. There 
is also a small Karen community in the Andaman 
Islands of India. The Karen of Burma are now 
officially classified into 23 ethnic groups, but some of 
these are alternative names for the same subgroup or 
for a dialect of one of the sixteen languages listed 
below. 

The consensus is that there are three sub- 
groups of Karenic: Northern Karen, Central Karen and 
Southern Karen. Pa-O, Kayan and Gekhu among 
other languages are Northern Karen; Kayah or Bwe, 
Blimaw, Bre, Geba, Yintale and Manu among others 
are Central Karen; the main languages in Southern 
Karen are Sgaw and Pho. Lehman differs from Solnit 
and Jones in grouping Gekhu with Bwe rather than 
Kayan, and Jones has a different view about the 
position of several groups. Manson (2003) uses lexical 
and phonological information that supports this 
grouping, though of course more work is needed, 
both descriptive and comparative. 

The Pa-O were formerly called Taungthu 
(Burmese ‘hill people’) and Shan Tonghsu; these 
names are now regarded as pejorative. This group 
probably totals over half a million speakers, nearly all 


in Burma and 257 now in Thailand. There is a Pa-O 
orthography. The Kayan, formerly known as 
Padaung, are the group whose women traditionally 
wear neck and knee rings; there are about 500,000 
speakers. The new autonym is much preferred by the 
group to the former name, regarded as pejorative. 
Also included here linguistically are Gekhu (‘upper’, 
also seen as Gheko, Geko, Gekho, etc.), Yinbaw 
(autonym Ka-ngan), Zayein (Latha) and probably 
Sawntung, with a total of about 85,000 speakers. 

Central Karen includes Kayah, sometimes still 
known as Karenni or Red Karen, and better known to 
its speakers as Bwe. This group has substantial dialect 
differences between east and west. It is a separate 
nationality with a separate state in Burma; there are 
some speakers in north-western Thailand as well. 
Officially it has over 140,000 speakers in Burma, but 
this is much underenumerated (and includes Manu, 
Yintale and perhaps some other Karenic languages 
within the Kayah State). Including Thailand, there are 
at least 250,000 speakers. Solnit (1997) provides 
documentation of one Kayah variety. 

Other Central Karen languages include the small 
western subgroup Manu; their Burmese name is 
Manumanaw, the Kayah name is Punu. This means 
‘western’ (dialect of Kayah); it may have 10,000 or 
more speakers. The name of the subgroup Yintale has 
been folk-etymologized into the Burmese Yin-Talaing 
or Mon Karen. It is a south-western variety of Kayah, 
with perhaps 10,000 speakers. The Blimaw subgroup 
of Karen is sometimes simply known as Bwe ‘central’ 
Karen. Studied by Henderson, it has about 20,000 
speakers. The language formerly known to its 
speakers as [bre'] and hence Bre or Brek, has about 
25,000 speakers who now prefer the autonym Kayaw. 
Geba is another Central Karen group, with about 
10,000 speakers. 

Sgaw is the Karen language with the most 
speakers; it may have up to 2.1 million. It is known 
to its speakers as [syo’] from the word for ‘person’. 
Just under half of the Sgaw are in the delta region 
of Lower Burma; they are also nearly half of the 
Karen in Thailand. There is a Burmese-based ortho- 
graphy using extra vowel and tone symbols, and a 
traditional ‘chicken track’ orthography not widely 
used any more. For an early but still useful grammar, 
see Gilmore (1899). Various divergent dialects have 
appeared in the literature as separate Karen lan- 
guages; for example Mopwa (also known as Palaychi 
from one of the villages where it is spoken; studied 
by Jones) in the north-west, Paku (which is the 
Kayah word for Sgaw and also refers to a north- 
eastern dialect of Sgaw), as well as Wewaw, 
Monnepwa and probably others. Briefly reported by 
Jones, the Lekeh language also has a traditional 
orthography derived from Burmese, but not widely 
known or used. The number of Lekeh speakers 
is unknown; some at least live in the delta region 
north-west of Rangoon. 

The other main Southern Karen language is Pho, 
also known to its speakers as Sho, to the Sgaw as Pwo 
or Pho, and in Thailand (from the word for ‘person’ 
there) most often as Phlong. This language has about 
1.6 million speakers, with nearly one-third of these in 
the delta region of Burma, and about half of the 
Karen in Thailand. This language extends quite far 
to the south, virtually to Burma’s southern end, and 
onto some adjacent islands. There are very sub- 
stantial dialect differences; the standard dialect has a 
Burmese-derived orthography with additional sym- 
bols and conventions for the vowels and tones of Pho, 
and a Thai-based orthography for a variety spoken in 
Thailand as well. Internal dialect differences are very 
substantial. For a recent summary description, see 
Kato (2003). 

Table 5.5 shows the numbers of speakers of all 
TB languages in East and South-East Asia; it does not 
include those in South Asia or elsewhere, nor does it 
include ethnic group members who do not speak 
their traditional language. To avoid cluttering the 
table with the names of large numbers of TB sub- 
groups and languages spoken only in the South Asian 
area, these have been omitted. 
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Table 5.5 Tibeto-Burman languages (speaker totals in 
East and South-East Asia only) 


Languages Number of speakers 
Western (Bodic) 
Bodish 
Tibetan (Central Bodish) 
Western Tibetan 
all Ngari 100,000 
Central Tibetan 
all gTsang 800,000 
all dBus (U) 1,400,000 
including Lhasa 
Sherpa 800 
Southern Tibetan 
Groma (Tromowa) 10,000 


North-eastern Tibetan (Amdo) 


hBrogpa 660,000 
Rongba 125,000 
Rongba-hBrogpa 140,000 
rTahu 75,000 
Eastern Tibetan (Khams) 
Western 175,000 
Eastern 1,000,000 
Northern 100,000 
Cone 80,000 
hBrugchu 30,000 
Southern 200,000 
East Bodish 
Cona Menba/Central Monpa 700 
Motuo Menba/Eastern Monpa 8,000 
Sal (Baric) 
Bodo-Garo 
Northern Naga 
Khyamnyungan 40,000 
Wancho 5,000 
Nocte 20,000 
Tangsa, Rangpan 30,000 
Haimi 20,000 
Htangan 20,000 
Jinghpaw (Kachin) 
Jinghpaw 900,000 
Luish (Sak) 
Kadu 1,000 
Ganan 500 
Taman extinct? 
Malin extinct 
Sak 3,500 
Kuki-Chin 
Ao Group 
Yimchungrii 20,000 
Tangkhul Group 
Tangkhul 50,000 
Maring 2,000 
Kuki 
Anal 5,000 
Chawte 200 
Mara 500 
Chin 
North Chin 
North Chin 150,000 
Thado (Thadou) 10,000 
Central Chin 
Laizo 400,000 
Mizo 5,000 
South Chin 
South Chin 200,000 
Khumi, Khami 50,000 
Central 
Bugun-Khowa 
Sulung 2,000 
Adi-Mising-Nishi 
Nishi 
Bangni 200 
Adi 
Bokar 500 
Digarish ‘Mishmi’ 
Idu 60 
Taraon 710 
Mijuish ‘Mishmi’ 
Geman 200 
Zaiwa 700 


Rawang/Nungish 
Dulong 


Gvnong 
Zerwang 
Dvru 


Anung 


Rvwang 
Dvngsar 
Mvtwang 


Lungmi 
North-eastern 


Core North-eastern (Qiangic) 
Baima 
Qiang 
Jiarong 
Ergong 
Zaba 
Guichong 
Ersu 
Choyo 
Muya 
Shixing 
Namuyi 
Pumi 


Na 

Naxi 

Moso (Na) 
Tujia 

Biji 

Mozi 
Bai 

Bai 

Laemae (Lama) 


Burmese-Ngwi 
Mru 
Gong 


Burmish 
Burmese 
Arakanese (Rakhine) 
Tavoyan 
Beik (Merguese) 
Danu 
Intha 
Taungyo 
Yaw 
Hpun 
other Burmish 
Achang (Ngochang) 
Lashi (Lechik) 
Atsi (Zaiwa) 
Maru (Langsu) 
Chintau 
Bola 
Ngwi 
Northern Ngwi 
Nosu 
Muhisu 
Ghomozo 
Nyisu 
Nasoid 
Nasu, Naisu, Gepo 
Samei 
Sanie (Sa'ngwei) 
Samatao 
Chesu 
Aluo 
Ayizi 
Nisu 
Kazhuo 
Central Ngwi 


South-eastern Central 
Sani 
Axi 
Azhe 
Azha 
Lisoid 
Lisu 
Lamu 
Lipo 
Lolopo 
Micha 
Hlersu 
Laloid 
Lalo 
Samatu 


14,000 


15,000 
35,000 


4,050 


15,000 
50,000 


30,000 


10,000 
150,000 
170,000 
35,000 
7,700 
7,000 
15,000 
7,000 
15,000 
1,800 
1,700 

70,000 


285,000 
50,000 


60,000 
1,000 


1,100,000 
5,000 


40,000 
80 


35,000,000 
1,700,000 
400,000 
250,000 
100,000 
90,000 
40,000 
20,000 
extinct? 


45,000 
30,000 
150,000 
100,000 
70 

400 


2,300,000 
1,000 
5,000 

300 


1,000,000 
20,000 
8,000 
400 

3,300 
25,000 
50? 


800,000 
4,000 


120,000 
100,000 
60,000 
40,000 


945,000 
300 
250,000 


500,000 
30,000 
15,000 


600,000 
50 


Lahoid 
Lahu 725,000 
Kucong, Cosung, Lahlu 47,000 
Kuangsi 3,000 
Talu 10,138 
Tagu 3,500 
Tazhi extinct? 
Naluo 15,000 
Nazan 1,600 
Liude 1,000 
Lang'e 2,000 
Maci 102 
Tanglang 2,000 
Jinuo 10,000 
Zaozou 2,100 
Nusu 12,000 
Southern Ngwi 
Akoid 
Akha 565,000 
Hani 740,000 
Haoni, Baihong 140,000 
Akheu 5,000 
Chepya 2,000 
Bi-Ka 
Kaduo, Khatu 180,000 
Biyue, Piyo 120,000 
Muda 2,000 
Mpi 2,000 
Sila 2,600 
Phana 1,000 
Bisoid 
Bisu 2,000 
Laomian 2,500 
Sangkong 2,000 
Phunoi 37,311 
South-eastern Ngwi 
Pula 250,000 
Muji 76,000 
Laghuu 500 
Mo’ang 5,000 
Kathu 5,000 
Karenic 
Northern Karen 
Pa-O 500,000 
Kayan 500,000 
Gekhu etc. 85,000 
Central Karen 
Kayah (Bwe) 250,000 
Blimaw 20,000 
Bre 25,000 
Geba 10,000 
Yintale 10,000 
Manu 10,000 
Southern Karen 
Sgaw (Mopwa, Paku) 2,100,000 
Pho (Pwo) 1,600,000 


5.2 Mon-Khmer 


The Austro-Asiatic languages are probably the original 
autochthonous languages of mainland South-East 
Asia. There are also some in surrounding areas 
including south-western China and the Nicobar 
Islands; Austro-Asiatic also includes Khasi as well as 
the Munda languages in north-eastern South Asia. 

There is general agreement that the core South- 
East Asian subgroups of Austro-Asiatic as well as 
Khasi and Nicobarese form a group called Mon- 
Khmer (hereafter MK, named from Mon and Khmer, 
two languages with a long historical tradition). This 
includes the following subgroups: Palaungic, Waic, 
Angkuic, Khmuic, Pakanic, Vietic, Katuic, Bahnaric, 
Pearic, Khmer, Monic, Aslian of peninsular Malaysia, 
and (in South Asia), Khasi and Nicobarese. 

Many scholars group Palaungic, Waic, Angkuic, 
Khmuic and Pakanic into Northern MK; most scholars 
group Katuic and Bahnaric into Katuic-Bahnaric. 
Diffloth (2005) prefers to link Katuic with Vietic, and 
Bahnaric with Khmer, which together form his new 
Khmero-Vietic. Some scholars prefer to treat Nico- 
barese as a major branch of Austro-Asiatic, co- 
ordinate with MK and Munda, and some treat Aslian 
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as a further major branch. By contrast, Diffloth (2005) 
proposes a new subgroup of Nico-Monic, comprising 
Aslian plus Monic on the one hand, and Nicobarese on 
the other; he also proposes a new linkage of Khasi 
with Northern in his new subgroup Khasi-Khmuic. 
Long ago, there was some controversy about the 
inclusion of Vietnamese, but it is clearly MK as are 
the closely-related Vietic languages. The Chamic 
languages were included in some early proposals of 
Austroasiatic or MK, but they are now known to be a 
subgroup of Austronesian. There is a MK element in 
the lexicon of some Austronesian languages of north 
Sumatra, such as Acehnese, and more in Chamic. 

The general typological features of MK languages 
include extensive morphology, including infixation, 
and verb-medial syntax. In this sense, Nicobarese with 
its verb-initial syntax is an exception. In phonology, 
many MK languages of the core South-East Asian 
area have developed a register (phonation-type) 
system, but this appears to be secondary. 


5.2.1 Northern Mon-Khmer 


The classification of these languages is not universally 
agreed by scholars: the traditional approach, as in 
Thomas and Headley (1970), proposes two sub- 
groups, Palaungic (from Palaung, the Burmese name 
for one group included, also sometimes called 
Palaung-Wa) and Khmuic (from the autonym Khmu, 
the largest group). Ferlus (1974a) proposes to include 
both in one group, his Northern MK, and to recognize 
the Mang language (tentatively included in Palaungic 
by Thomas and Headley 1970) as an additional major 
subgroup of MK; this subgroup is now better known 
as Pakanic. It includes two additional languages, 
Paliu, Palyu or Lai and Pakan or Ben'gan, and is 
grouped by some as an additional Northern MK sub- 
group. Diffloth (1980) further subdivides Palaungic 
or Palaung-Wa into three subgroups, Palaungic, Waic 
and Angkuic. 

The general locations of the subgroups are geo- 
graphically consistent with the genetic relationships. 
Palaungic as narrowly defined is furthest north, 
mainly west of the Salween; Waic groups are to the 
south, mostly between the Salween and the Mekhong, 
and Angkuic groups further south, mainly along the 
Mekhong, including Lamet furthest east in north- 
western Laos. Further south still, along and east of the 
Mekhong, Khmuic includes Mal/Pray and then 
Mrabri is south of the Lamet. Pasing or Bit, Khao, 
Khang and Xinhmul are to the north-east. The 
Khmuic language with the most speakers is Khmu in 
the south-east; Iduh was recently located by Suwilai 
Premsrirat further south-east. There are probably 
some more small Northern MK groups in Burma and 
China that have not yet been described, and the exact 
classification of some languages into Palaungic, Waic 
and Angkuic is in some cases unclear. All these lan- 
guages are verb-medial and have less morphology 
than most other MK languages. For a description of 
Khmu, see Premsrirat (1987). 

Most of the Palaungic languages are virtually 
undescribed, and the situation in Burma remains par- 
ticularly nebulous. Palaung is a Burmese exonym; the 
autonym is Ta-ang. The name of this group was 
changed from Benglong to De'ang in China in the 
mid-1980s to recognize this autonym. Palaung is 
sometimes divided into Golden Palaung further west 
and Silver Palaung further east. Mitani (1978) evalu- 
ates old sources to suggest that Palaung or Ta-ang (as 
spoken by about 150,000 around Nam Hsan in 
northern Shan State, Burma and also by nearly 9,000 
members of the De'ang nationality of 17,935 in 
China, including the Pale ‘standard’ variety in China) 
is somewhat different from Rumai (about 150,000 
speakers, of whom 2,500 in China. Speakers of 
Palaungic languages are included in the De'ang 
nationality in China. There is no orthography 
(though one is being developed for the ‘standard’ 
dialect, Pale). About 1,600 Palaung speakers from 
Burma have recently moved into Thailand. Danau is a 
further poorly-documented language, spoken in one 
village near Inle Lake in Burma (Robinne 2000). 


Phuman or U/Hu has about 30,000 speakers, 6,000 in 
Shuangjiang and more in surrounding counties in 
China, and widely scattered further south; they are 
classified as Bulang, Wa, or in some areas, unclassified 
for nationality; some scholars prefer to classify Phu- 
man as an Angkuic language. 

Waic (Wa + -ic; two syllables, see Diffloth 1980) 
languages include Wa itself, with very diverse and 
numerous dialects. There are about 900,000 speakers, 
with 400,000 speakers in China (a few of whom are 
included in the Bulang nationality, but the rest all in 
the Wa nationality; the 2000 census figure for Wa was 
396,610) and about 500,000 speakers in the north- 
eastern Shan State of Burma. Some Wa have recently 
moved to the Mong Hsat area of the eastern Shan 
State, next to the Thai border, and a few also live 
inside Thailand north-west of Chiangdao. Paraok is a 
widespread autonym, both in China and in Burma; 
this term is also used to refer to the ‘standard’ dialect, 
which is a medium of literacy. There is a missionary- 
developed Romanization in Burma and a 1950s 
Chinese-developed Romanization, both of which are 
widely used. Varieties of Wa include Awa Wa (70,000, 
included in the Wa total above) and Bulang Wa 
(30,000, included in the Wa total above) among 
others; older literature cites Son, En, La, Khala and 
other more or less divergent Wa subgroups. Zhou and 
Yan (1984) provide a brief description. Wa has many 
loanwords from the TB language Lahu due to long- 
term contact. Rianglang or Riang (also known to 
others as Yanglam or Black Karen, or also Yang Sek) 
has a number of fairly similar dialects spoken by over 
70,000 people, mostly in the eastern Shan State, 
Burma, but also about 4,000 in China; a Romaniza- 
tion for Rianglang has been used in Burma. Fairly 
closely related to Wa is the Lavua language, east of 
Mae Sariang in north-western Thailand, spoken by 
about 15,000 of a group of 20,000 with surprisingly 
great dialect differences; a Thai-based orthography 
exists. 

The Angkuic subgroup includes, from north to 
south, 1) Plang or Phang with about 40,000 speakers 
of the Bulang nationality in Menghai and surround- 
ing counties, China, a few in Burma, and one village 
in Thailand, Kien Ka, where the language is no longer 
spoken; 2) Mok in Jinghong County, China, also 
Burma and Ya-ang village in Thailand; 3) Loi in the 
eastern Shan State of Burma and north-western Laos; 
4) Phalo just west of Mae Rim in northern Thailand 
and 5) Lamet. In China, most Angkuic language 
speakers are classified in the Bulang nationality 
(91,822 people in 2000), but not all members of this 
nationality can speak their traditional languages. Mok 
and Loi may be one language, with perhaps 30,000 
speakers. Phalo is probably extinct. Lamet is a 
language with about 14,500 speakers, mostly con- 
centrated in a small area of north-western Laos 
(14,355 people), and one village near Wiang Pa Pao in 
Thailand where the language is moribund. 

Khmuic includes five subgroups: Mal/Pray, 
Mlabri, Pasing (Bit, Khao), Xinh Mul and Khmu 
itself. Khmu has about 470,000 speakers (389,694 in 
northern Laos; most of 56,542 in Vietnam, 13,674 in 
Thailand, about 10,000 in China, a few in Burma, and 
some refugees in western countries) and very exten- 
sive dialect differences, both phonological and lexical; 
it has a Roman orthography, which is not widely 
used, and a Thai-based orthography. The Mal/Pray/ 
Phay subgroup is one language with minor dialect 
differences but without a group name; speakers of 
this language call themselves Mal, Pray or Phay 
according to dialect. Thai authorities usually call this 
group T'in [t'in”], but locally this name is not used. 
Of just under 53,000 speakers, 38,823 live in Thailand 
and 13,977 in adjacent areas of Laos. Again, a 
Thai-based orthography exists. To the south of this 
group are the elusive Mlabri, also known somewhat 
pejoratively as Phi Tong Luang (‘spirits of the yellow 
leaves’), a small nomadic group of 125 in Thailand 
and 24 or more in Laos; some older literature also 
cites another group, Yumbri, but this is the same. 
Even Mlabri has several dialects; they are still mainly 
hunter—gatherers. In northern Laos and Vietnam, 


another Khmuic subgroup includes the Pasing (also 
called Bit, 1,530 in Laos and 202 in China), Khang 
(10,282 in Vietnam and a few in China) and 
Khao (about 5,000, included in the Khmu total in 
Vietnam). The easternmost Khmuic subgroup are the 
Xinhmul, also known as Phong in Laos and Puoc 
in Vietnam (18,018 in Vietnam, 2,164 in Laos). The 
latter is not to be confused with two Vietic groups 
further south in Laos with similar names. The final 
and most recently located Khmuic language is Iduh 
with 30 speakers among a group of 200 people, the 
furthest south-east of the Khmuic languages. 

Mang, as noted above, was the first language 
of the newly established Pakanic subgroup of MK to 
be described. There are about 2,300 in Lai Chau 
Province, Vietnam and 1,200 in Jinping County, 
China. Paliu (also known by the exonym Lai) is in 
north-west Guangxi, the furthest north-east MK 
language; many of the 1,771 members of this group 
now speak other languages, but 500 speakers remain. 
The third language of this subgroup is Pakan 
(Ben'gan) with about 3,000 people in south-eastern 
Yunnan. 


5.2.2 Vietic 


Within this subgroup of MK are Vietnamese, the 
national language of Vietnam, and a number of 
closely related smaller groups in north-western 
Vietnam and eastern Laos. Most of the latter have 
relatively few speakers, some as few as hundreds; 
some of these comprise mutually intelligible dialect 
chains. All, especially Muong itself, have undergone 
centuries of influence from Vietnamese. 

The Vietnamese language is undoubtedly MK, 
though it has undergone millennia of Chinese influ- 
ence. It has three main regional dialects, Northern, 
Central and Southern. Formerly written in a Chinese- 
derived orthography, it now uses a Romanization 
notable for its use of diacritics to represent the six 
tones and to distinguish vowel height. It is the 
national language of Vietnam, and is spoken by most 
other groups living in Vietnam as a second language. 
Apart from about 66 million first-language speakers 
and up to 10 million second-language speakers in 
Vietnam, there are large and long-established com- 
munities in Cambodia, Thailand, Laos and China 
(the Jing nationality, over 22,000 speakers). It is also 
spoken by a large post-1975 refugee community (well 
over a million) in the United States, France, Canada, 
Australia and elsewhere. Though about half of the 
‘Vietnamese’ refugees outside Asia are ethnic Chinese, 
they can also speak Vietnamese. Thus Vietnamese 
may have over 78 million speakers in all. 

Vietnamese was initially written with Chinese 
characters, and has a very large quantity of Sino- 
Vietnamese loanwords from the Han, Song and Tang 
dynasties during which it was under Chinese rule, 
and continuing after independence in ap 939. The 
Jesuit Alexander of Rhodes devised a Romanization in 
the early seventeenth century, and this became the 
sole official orthography under French colonial rule 
between 1862 and 1954. The use of Chinese char- 
acters continued among scholars and those connected 
with the Vietnamese court, but has nearly ceased 
now. 

The Muong languages are non-literary and fall 
within the cultural orbit of Vietnamese; they show 
varying degrees of Vietnamese influence. The largest 
of these by far is Muong itself, one of the recognized 
nationalities of Vietnam, with some 1.1 million 
speakers. The second is Nguon, with about 2,000 
speakers within the Muong ethnic group of Vietnam. 
There are many other small groups or subgroups, 
none with more than a few thousand speakers, mostly 
in southern Laos, some along the western border of 
central Vietnam, nearby, and a few in north-eastern 
Thailand. In Vietnam, these groups are collectively 
known by the Vietnamese exonym Chut (3,829 
people in 1999, mostly along the Lao border in 
central Vietnam) and the Arem ethnic group in Laos. 
For a discussion, see Ferlus (1974b). These can be 
grouped into six clusters: 1) Ruc, Sac and May (about 
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2,250 speakers in total); 2) Hung, Tum and Phong 
(about 3,000 speakers); Maleng (including Kri with 
31 speakers, another group known as Pong and the 
speech of the former village of Pakatan, about 4,000 
speakers in total); 4) Thavung (1,500 speakers); 
5) Aheu (also sometimes known as Phonsung from a 
village name, 500 speakers) and 6) Arem proper 
(under 1,000 speakers). Languages of groups 1 and 2 
are spread across the Vietnamese—Lao border, while 
the rest live in Laos; there are a few Thavung who 
have moved to north-eastern Thailand. There is a 
wide range of exonyms and alternative names; 
for example, one autonym for Ruc is Cha Cui; it is 
also known as Tac Cui, Cuoi or Tho as well as the 
Vietnamese name Ruc; similarly, Tum is also known 
as Tay Tum, or to its speakers as Ktum; Hung is also 
known as Khong Kheng. Pong and Phong are both 
also known as Kha Pong, and neither is the same as 
the Khmuic Phong further north. 


5.2.3 Katuic-Bahnaric 


Some scholars prefer to keep these two groups separ- 
ate, but most suggest that they have a close genetic 
linguistic relationship to each other and form a sub- 
group within MK. For many of these languages there 
are several named dialects, which have sometimes 
been cited in the literature as separate languages, but 
with no name for the language or group as a whole. 
Also, many of these groups have been cited according 
to geographical locations, clan names or names used 
by other groups, as well as by their own name. 

The Katuic languages of southern Laos, western 
central Vietnam, north-eastern Thailand and north- 
ern Cambodia are quite diverse and mostly poorly 
described. A former division into North Katuic and 
South Katuic is now rejected; some scholars (e.g. 
Thomas and Headley 1970) suggest that Katu itself is 
divergent from the rest of Katuic, while Ferlus and 
Diffloth both make a division into West Katuic and 
East Katuic, as followed here. 

West Katuic languages have over a million 
speakers, including about 800,000 Kuy (known in 
Thai as Suay), with many dialects widely scattered, 
mainly in Thailand but also about 50,000 in Laos; 
160,000 So speakers, mostly in Laos, but some also in 
Thailand; and about 80,000 speakers of Bru (formerly 
known as Vankieu in North Vietnam, and now called 
Bru-Vankieu there), 55,559 in Vietnam but also in 
Laos and two villages in Thailand. Some Bru dialects 
are sometimes reported as separate languages: Leun, 
Galler and so on. Other languages such as Tri, 
Khua and Mangkong are also quite close to Bru and 
are included in the Bru-Vankieu ethnic group in 
Vietnam. There are Thai-based orthographies for Kuy 
and Bru. 

East Katuic comprises many more named groups, 
but many of them seem to be mutually intelligible 
dialects. Linguistic work, mostly by members of the 
Summer Institute of Linguistics, suggests that Pacoh 
and Phuong (Phuang), with a total of about 21,000 
speakers, are close to each other, that Katang, Ta-oih, 
Klor/Ngeh (autonym Kriang) and Ong/Yir with over 
150,000 speakers are similar, and that the various 
Katu/Kantu dialects including Tareng, Pilu, Tampril, 
Taluy and high and low Katu, with over 52,000 
speakers altogether, are closely related despite lexical 
differences, especially within Katu. Recent fieldwork 
by Theraphan L. Thongkum in southern Laos has 
located several additional East Katuic languages 
including Chatong, Dakkang and Triw with very 
small speaker populations. Romanizations were 
devised for some East Katuic languages in Vietnam, 
but are not used. 

Again, much of the descriptive and comparative 
work on Bahnaric languages has been done by 
members of the Summer Institute of Linguistics. The 
most recent classification divides this into West, 
Northwest, North, Central and South Bahnaric. West 
and Northwest Bahnaric languages are spoken in 
southern Laos, North Bahnaric languages inland in 
central Vietnam, and South Bahnaric languages 
inland in southern Vietnam south of the Austronesian 


Chamic groups Rhade and Jarai. The two Central 
Bahnaric languages are spoken to the west and north- 
east of the northernmost Chamic groups. The arrival 
of the Chamic groups on the coast about 2,000 years 
ago and their later movements inland have intruded 
between the Bahnaric subgroups. 

West Bahnaric includes Brao (about 35,000 
speakers with many dialects scattered along the Lao- 
Cambodian border: Kravet, Krung, Su (Sou, Suq) and 
probably others; an alternative name for Brao is Love 
or Lave). Loven (autonym Jru', about 28,000 speakers, 
also known as Boloven from its location) is close to 
Nyaheun (about 4,000 speakers). Oi, Sok, Sapuan and 
Cheng with a total of about 25,000 speakers appear 
to be one language. Further West Bahnaric lan- 
guages recently found by Theraphan L. Thongkum in 
southern Laos are Swoeng and Juk. 

The Northwest Bahnaric languages, all in 
southern Laos, are Tareng (6,000 speakers), Yaeh 
(5,000 speakers), Alak (13,200 speakers) and Kaseng 
(1,200 speakers). There was earlier some disagree- 
ment about the classification of Alak and Kaseng. 
Early sources include them within Bahnaric, and 
Huffman (1976) includes Alak within West Bahnaric. 
However Thomas and Headley (1970) included them 
within Katuic. 

Central Bahnaric includes Bahnar itself, with 
155,000 speakers in central Vietnam, and Tampuan, 
with about 15,000 speakers in north-eastern 
Cambodia. 

North Bahnaric languages (Smith 1972), mostly 
in Vietnam, include Jeh, Halang, Duan (Halang 
Doan), Rengao, Hre and Sedang, with about 263,000 
speakers in all; Todrah (dialects Modra and Didra) 
and Monom, 12,000; Kayong and Cua, 42,000; Takua 
6,000 and Katua 4,000 speakers. Kaco' (1,000 
speakers; this also appears in the literature as Lamam, 
a local clan name) is in north-western Cambodia, 
isolated from other North Bahnaric languages and 
surrounded by North Bahnaric Brao, Cenrtal 
Bahnaric Tampuan, and Chamic Jarai. 

There is somewhat less diversity within South 
Bahnaric, but there are very many named groups 
speaking related dialects though often separated in 
the literature. Chrau has about 15,000 speakers out of 
22,567 ethnic-group members, Stieng about 100,000 
(about a third in Cambodia) and Koho 125,000. 
Koho dialects include Sre, Nop, Tola, Pru, Lac, Ma 
(Maa), Kodu, Tring, Kil and others. The Kil dialect 
of Koho is not the same as the Kil dialect of Eastern 
Mnong. Mnong can be divided into three languages: 
Eastern with Gar, Kil, Kwanha and Rolom dialects; 
Southern, with Bunong, Prang and other dialects; 
and Central with Biat, Bunar, Dihbri, Preh and other 
dialects. Mnong is an exonym, the Chamic word for 
montagnard; the Khmer word, also sometimes used in 
Cambodia, is Phnong. Population statistics are 
imperfect: there are 92,451 Mnong in Vietnam and 
about 150,000 in Cambodia, but it is difficult to 
separate these into the three language clusters. 
There is no other collective term for the three clusters 
of dialects, other than the word for ‘person’ in most 
Mnong dialects, Bunong. For many Bahnaric 
languages of Vietnam, there is a Vietnamese-based 
Romanization, but these are hardly used. 


5.2.4 Pearic 


Some scholars include Pearic and Khmer as a sub- 
group within MK; others prefer to regard the similar- 
ities as due to millennia of cultural dominance of 
the Pearic group by the Khmer. ‘Pear’ or ‘Pérr’ is a 
Khmer word for various small non-Khmer groups. 
Chong in Thailand is the most viable, with about 
500 speakers in a group of 2,000 and a language 
maintenance programme. Of the 200 Song or 
Kasong in Thailand, there are about 40 speakers. 
Su-ung, formerly known as Saoch from their Khmer 
exonym (meaning ‘skin disease’ and disliked by the 
group members) is spoken by about twenty people in 
Cambodia and elderly members of the so-called 
‘Khmer Padong’ group transported to western central 
Thailand in 1833. Samre is a group of about 200 


people, half in Pursat Province of Cambodia and half 
in Thailand; there may be as few as 120 speakers left. 
All other Pearic languages are or were spoken entirely 
within Cambodia. A group that has lost its autonym 
and now calls itself Pear has about 250 speakers, 
down from 1,000 pre-1975. There are several other 
Pearic languages that have recently become extinct, 
including Samray, Suoy and Samre of Siem Reap; 
their former location is indicated on Map 48. 


5.2.5 Khmer 


Khmer is a major literary language and was the lan- 
guage of one of the dominant empires in early South- 
East Asian history, centred on the area where it is still 
spoken but also including much of what is now 
north-eastern Thailand and southern Vietnam. There 
are probably over 800,000 speakers of Khmer in 
Thailand and 1,055,174 in Vietnam, as well as many 
more descendants of Khmer speakers there who 
speak only Thai or Vietnamese. 

Khmer is written in an Indic-derived ortho- 
graphy which is also the source for the Thai ortho- 
graphy. The earliest dated inscription is from ap 609; 
much of the early inscriptional corpus is in Sanskrit 
rather than in Khmer. It has very extensive dialect 
differences, especially in vowels, with the central 
area more innovative and the periphery, especially in 
Thailand, more conservative; some scholars even 
claim that Northern Khmer in Thailand is a separate 
language, though it shares a writing system with 
standard Khmer. Many of the vowel-system changes 
are related to the former presence of a register 
(phonation-type) contrast, which split many of the 
long and short vowels before it later disappeared. 
Starting from a system with nine long and short 
vowels and three long and short diphthongs, the 
extreme is reached in standard Phnom Penh Khmer 
which has nine short and ten long monophthongs 
and thirteen diphthongs, including contrasts of five 
vowel heights of front vowels. There is also a great 
deal of sociolinguistic and regional variation in the 
vowel system. For a description, see Lim and Purtle 
(1972). 

There are probably over ten million speakers of 
Khmer, including refugees now settled in France, the 
United States, Australia, Canada and elsewhere, as 
well as those in Thailand and Vietnam. The total of 
over a million post-1975 refugees from Cambodia 
includes many overseas Chinese; they speak Khmer as 
well as a variety of Chinese, usually Yue, and often 
choose to identify as overseas Chinese in their new 
homes. 


5.2.6 Monic 


Monic is another major subgroup within MK. It 
now comprises two languages, Mon with substantial 
dialects and literature, and Nyah Kur (known to the 
Thai as Chao Bon ‘upper people’) which is non- 
literary and moribund. It is clear that the language 
of the Dvaravati kingdom — in the sixth to tenth 
centuries AD in what is now central Thailand — 
was Mon; other areas of Mon inscriptions include 
part of north-eastern Thailand, the Haribhunjaya 
(Lamphun) Mon kingdom in the north, and the 
Thaton and later Mon kingdoms in Lower Burma. 
Mon orthography is Indic-based and phonologic- 
ally quite conservative; it was also the source for 
the Burmese orthography. Mon is one of the recog- 
nized nationalities of Burma, and there are also 
substantial numbers of Mons living in western and 
central Thailand. There are about 800,000 speakers 
of Mon, mostly in Burma; there are also many 
Mons who do not speak Mon in Burma, and many 
more in Thailand who have simply become part of 
the Thai population. Mon has many local dialects, 
which can be divided into two main types in Burma, 
Mon Ro (north-western) and Mon Rao (elsewhere); 
various of these are also spoken by recent arrivals 
in Thailand; and there is a third dialect type of 
long-term residents of Thailand, scattered around 
Bangkok. 
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Nyahkur is spoken by about 10,000, entirely in 
the hilly region between the central plain and the 
north-east of Thailand by a small part of the popula- 
tion of the area, and is in the process of being 
replaced by local varieties of Thai. It has three 
main dialects, Northern (rather distinct with three 
subdialects), Central (with the largest number of 
speakers and little internal diversity), and Southern, 
with two subdialects; see Diffloth (1984) and 
Thongkum (1984) for more data. 


5.2.7 Aslian 


The term Aslian comes from the Malay term Orang 
Ash ‘original inhabitants’, the Negrito groups in pen- 
insular Malaysia, with a few spreading into southern 
Thailand. Their languages form a subgroup of MK, 
with three subgroups. While some dialects and lan- 
guages are extinct, fourteen survive, including one 
spoken only in Thailand, two spoken in Thailand and 
Malaysia, and eleven spoken only in Malaysia. The 
maps indicate only those areas where these languages 
are still spoken; their traditional range was much 
wider. 

Within Northern Aslian, the five surviving lan- 
guages have many dialects. Maniq, Tonga’ Mos or 
Ten'en has at least two dialects and about 300 
speakers in Thailand. This is closely related to Kensiu 
with about 500 speakers of dialects Kensiu Batu', Ken- 
siu Siong, Kensiu Pemsed, Kenta’ Nakil, Kerita' Bong 
and some extinct dialects; some sources suggest that 
Kenta’ (Kintaq) is a separate language. Jehai has 
about 1,200 speakers of dialects including Menri' 
(Mendriq, Menraq, sometimes cited as a separate lan- 
guage). Kensiu and Jahai are spoken on both sides of 
the Thai—Malay border. Batek has about 960 speakers 
of dialects including Batek Té', Batek Dé’, Batek Iga, 
Batek Nong, Mintil (sometimes cited as a separate 
language) and at least one extinct dialect. Che’ Wong 
has two dialects including Kled and about 400 
speakers. 

The four languages of Central Aslian have many 
local dialects, especially Semai with about 29,000 
speakers and Temiar with about 17,000; for a descrip- 
tion of the latter, see Benjamin (1976). Jah Het has 
about 3,200 speakers, and Lanoh has about 350 
speakers. Southern Aslian contains three surviving 
languages; all have dialects. Semelai has about 4,100 
speakers and includes Temo' (Temogq, sometimes 
cited as a separate language). Semaq Beri has about 
2,500 speakers, and Besisi (also known as Mah Meri), 
has about 2,200 speakers of Betise' and Sisi’ among 
other dialects. The general Malay terms Semang and 
Sakai refer to the Aslian groups as a whole in a pejora- 
tive way; the term Jakun is worse. Nearly all Aslian 
language speakers also speak Malay, and some former 
groups speak only Malay. Like most MK languages, 
Aslian languages are verb-medial and have substantial 
noun and verb morphology. 


5.2.8 Nicobarese 


Nicobarese is another of the subgroups of MK, and 
differs somewhat from island to island. On Great 
Nicobar there are two languages: one, Shom Peng, 
is spoken by a tribal group; otherwise, the rest of 
Nicobarese forms a dialect chain including Southern 
(Milo, Great Nicobar, Little Nicobar and Condul 
subdialects), Central (Camorta, Katchall or Tehnu, 
Nancowry and Trinkat subdialects), Teressa/Bom- 
poka, Chaura (Chowra) and Car (or Pu) dialects. 
There is substantial internal lexical diversity, and 
a fairly low proportion of cognates with other MK 
subgroups. This may be related to lexical tabooing 
of names on death. Unlike other MK languages, 
Nicobarese languages are usually verb-initial, often 
VOS. The total number of speakers is about 23,000, 
with very few remaining speakers of Shom Peng — 
probably fewer than 100. The Nicobars were very 
severely affected by the tsunami of 26 December 
2005, and some varieties such as Chaura have lost a 
large part of their speaker population. For a descrip- 
tion of one variety, see Braine (1970). 


5.2.9 Khasian 


Khasian languages are spoken mainly in the eastern 
half of Meghalaya in India, also nearby in Bangladesh. 
Khasi is divided into quite diverse dialects, some not 
entirely mutually intelligible. However, the agreed 
standard Sohra dialect is based on the speech of 
Cherrapunji town, which has been written in a 
Welsh-influenced Romanization since 1841; it was 
earlier written in a Bengali script from 1813. The 
main internal differences are between Eastern or Syn- 
teng and Pnar, Southern or War, and Central or 
Sohra; the latter also includes Western or Lyngnam. 
Map 51 shows the locations of the three main sub- 
groups. Table 5.6 summarizes the speaker populations 
of MK languages in East and mainland South-East 
Asia, but does not include Munda, Nicobarese or 
Khasian speakers in South Asia. 


Table 5.6 Mon-Khmer languages (East and South-East 
Asia, speakers) 


Dakkang 1,198 
Triw 1,328 
Bahnaric 
West Bahnaric 
Brao 35,000 
Loven 28,000 
Nyaheun 4,000 
Oi/Sop/Sapuan/Cheng 25,000 
Northwest Bahnaric 
Tareng 6,000 
Yaeh 5,000 
Alak 13,200 
Kaseng 1,200 
North Bahnaric 
Jeh 12,000 
Halang 15,000 
Duan 2,500 
Rengao 18,000 
Hre 115,000 
Sedang 100,000 
Todrah 6,000 
Monom 6,000 
Kayong 25,000 
Cua 17,000 
Takua 6,000 
Katua 4,000 
Kaco' 1,000 
Central Bahnaric 
Bahnar 155,000 
Tampuan 15,000 
South Bahnaric 
Chrau 15,000 
Stieng 100,000 
Koho 125,000 
Mnong 142,000 
Khmer 10,000,000 
Pearic Group 
Chong 500 
Song/Kasong 200 
Pear 250 
Samre 120 
Chu-ung 30 
Samray extinct 
Suoy extinct 
Samre of Siem Reap extinct 
Monic Group 
Mon 800,000 
Nyahkur 10,000 
Aslian Group 
Northern 
Maniq/Tonga'/Mos/Ten'en 300 
Kensiu 500 
Jehai 1,200 
Central 
Lanoh 350 
Semai 29,000 
Temiar 17,000 
Jah Het 3,200 
Southern 
Semelai 4,100 
Semaq Beri 2,500 
Besisi 2,200 


5.3 Austro-Thai 


Languages Number of speakers 
Northern Group 
Palaungic 
Palaung/De'ang/Pale 150,000 
Rumai 150,000 
Phuman 30,000 
Danaw 1,000 
Waic 
Wa/Paraok 800,000 
Awa Wa 70,000 
Bulang Wa 30,000 
Rianglang 70,000 
Lavua 15,000 
Angkuic 
Plang/Phang 40,000 
Mok/Loi 30,000 
Phalo extinct 
Lamet 14,500 
Khmuic 
Khmu 470,000 
Mal/Pray 52,800 
Mlabri 149 
Pasing (Bit) 1,732 
Khao 5,000 
Khang 10,500 
Xinhmul/Phong 20,182 
Iduh 30 
Pakanic 
Mang 3,500 
Paliu (Palyu, Lai) 500 
Pakan (Ben'gan) 3,000 
Vietic Group 
Vietnamese 78,000,000 
Muong 1,100,000 
Nguon 2,000 
Ruc 250 
Sac 1,000 
May 1,000 
Hung 1,000 
Pong 1,000 
Tum 1,000 
Maleng (Pong, Kri) 4,000 
Thavung 1,500 
Aheu 500 
Arem 1,000 
Katuic-Bahnaric Group 
Katuic 
West Katuic 
Kuy 800,000 
So 160,000 
Bru-Vankieu 
(Tri, Khua, Mangkong) 80,000 
East Katuic 
Katu/Kantu 52,000 
Pacoh 15,000 
Phuong 6,000 
Katang 80,000 
Ta-oih 50,000 
Klor/Ngeh 10,100 
Ong/Yir 12,000 
Chatong 580 


Benedict has demonstrated a remote relationship 
between three groups of languages: Tai-Kadai (some- 
times also called Thai-Kadai, with subgroups Kam- 
Thai and Kadai), Austronesian and Miao- Yao, with the 
Kadai languages often providing the link cognates; 
this is summarized in Benedict (1975). Benedict also 
provided the new term Austro-Thai to refer to this 
family. Not all scholars accept this grouping, though 
Benedict has reconstructed a large number of etyma, 
many of them convincing. The basic principle is that 
reconstructed two-syllable etyma have Tai-Kadai and 
Miao-Yao cognates reflecting mainly the second syl- 
lable, Austronesian cognates reflecting both syllables, 
and Kadai cognates reflecting parts of both syllables. 
The classic examples are Tai-Kadai *taa, Austronesian 
*mata ‘eye’, and Tai-Kadai*taay, Austronesian *matay 
‘die’. Benedict (1990) proposes the addition of 
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Japanese to this group, but the Austro-Thai lexical 
element in Japanese appears to be due to contact 
during the Yayoi culture period in southern Japan. 

A more recent development is the proposal by 
Sagart (2004) that Tai-Kadai descends from the 
North-eastern sub-branch (including Kavalan) of the 
Eastern branch of Austronesian in Taiwan, which 
migrated to the mainland some 4,000 years ago, and 
then diversified and spread westward. This branch, 
which he calls Formosan Ancestor of Tai-Kadai, is 
related to the same subgroup of Taiwan Austronesian 
which he believes to have migrated southward about 
2500 Bc, Proto-Malayo-Polynesian, with its very wide- 
spread daughter languages, including Yami in Taiwan 
and many others elsewhere. 

Typologically, the Austro-Thai languages are 
similar, with a tendency to prefixing (though the 
mainland languages tend to lose morphology, thus 
conforming to the areal pattern). While Austronesian 
has been reconstructed as VSO, many modern 
Austronesian languages, and all of those in the 
mainland linguistic area, are SVO; similarly, all 
Miao-Yao and Kadai languages, and all but the 
north-westernmost Kam-Thai languages, are SVO. 
These latter are in an SOV area, surrounded (and 
being replaced) by the SOV Indic language, 
Assamese. 

The Chinese classification links Miao-Yao as a 
co-ordinate subgroup of Sino-Tibetan with TB and 
Sinitic, and places Kam-Thai (including Kadai) in the 
Sinitic subgroup of Sino-Tibetan. In both cases this is 
clearly wrong, as the similarities of vocabulary are 
based on Chinese loanwords of non-core vocabulary 
into Miao-Yao and separately (but more extensively) 
into Kam-Thai, both of which have long been within 
the cultural orbit of Sinitic. The most conservative 
current view is that Miao-Yao is an isolated family, 
that Kam-Thai (including the Kadai languages) is 
another, and that the very widespread Austronesian 
family is a third. However, Benedict’s original view 
about Austro-Thai is gaining widespread acceptance 
among scholars, notably through the work of Sagart, 
though all agree that much more work is needed 
to determine the exact correspondences between 
languages. 


5.3.1 Tai-Kadai 


The Kam-Thai languages, known in China as the 
Zhuang-Dong languages, form a closely related group 
co-ordinate with Kadai in Benedict’s Tai-Kadai com- 
ponent of Austro-Thai. Especially closely related are 
the Thai (also known as Tai or Dai) languages, which 
Li (1977) divided into three subgroups, Northern, 
Central and South-western. South-western Thai 
languages have many distinct Indic-based writing 
systems, while other Thai languages traditionally 
either used modified Chinese characters or no writ- 
ing, but some now have Roman orthographies in 
China and Vietnam, Many Thai languages form 
mutually intelligible dialect chains, especially in the 
south-western area. More distinct are the Kam-Sui 
languages. All are SVO and tonal, with little or no 
morphology or tone sandhi (unlike some surround- 
ing languages). 

The internal classification of Thai is not gener- 
ally agreed; some scholars reject Li’s basic Northern/ 
Central/South-western divisions and propose other 
groupings; others accept them but have fundamental 
differences in subgrouping within divisions, especially 
South-western; and some feel that the whole is a con- 
tinuum. The position of some specific languages is 
also not agreed: Li and Hashimoto classify Be as a 
Central Thai language, others (including Chinese 
scholars) group it with the Northern Thai languages, 
Ostapirat makes it a separate but co-ordinate sub- 
group within Kam-Tai, and Benedict believed it is 
genetically quite distant from Thai and Kam-Sui — 
perhaps even closer to Kadai. Most Chinese scholars 
include Lajia and Li (also known as Hlai) with the 
Kam-Sui languages, while some Western scholars 
place Lajia in Northern Thai. Ostapirat (2000) sug- 
gests that Li forms a separate subgroup, co-ordinate 


with Kadai and Kam-Tai, while Be forms a separate 
subgroup, co-ordinate with Kam-Sui and Thai within 
Kam-Tai. Here, Li and Lajia are treated as Kam-Sui 
languages, while Be is regarded as a Northern Thai 
language. 

There is also a tendency in the literature on Thai 
languages to identify languages by place names or 
official exonyms rather than autonyms, which adds to 
the confusion. The Chinese predilection for grouping 
languages into nationalities is exemplified by the Dai 
nationality, which includes all the South-western Thai 
languages of China: the largest groups there are Dai 
Mao or Dehong Dai and Dai Lue or Xishuangbanna 
Dai, but also substantial are Dai Neua, Dai Sa (includ- 
ing Dai Duan, Dai Luomi and Bumang), Yongren Dai, 
and smaller numbers of Shan, Black Tai, White Tai 
and so on. 


5.3.1.1 Kam-Sui (Dong) languages 


The Kam-Sui languages are called the Dong languages 
in China, from the Chinese name of the largest group, 
also known as Kam. Most of these languages are 
spoken in south-eastern Guizhou and northern 
Guangxi, with one spoken in Hainan and one in 
Guangdong. The five nationalities whose languages 
are Kam-Sui are Dong (Kam) of eastern Guizhou, Li 
or Hlai of south-west Hainan, Shui (Sui) of south- 
eastern Guizhou, Mulao of north-eastern Guangxi, 
and Maonan of north central Guangxi. Several of 
these languages have Romanizations used since the 
1950s. Three other languages in south-eastern 
Guizhou without separate nationality status are Rao 
(Ten, Yanghuang), Jin and Mak (Mo), which are all 
classified as Buyi. In western Guangdong are Biao, 
with speakers classified as Han Chinese. Finally, in 
north-eastern Guangxi are the Lajia (Lakkia), which 
are classified as Yao. Ostapirat (2005) suggests that Li 
or Hlai should be classified as an independent sub- 
group of Tai-Kadai. It has a great deal of internal 
diversity. 

Dong or Kam had 2,960,293 group members in 
the 2000 census, but less than half of these can speak 
the language, which has two major varieties, various 
further subdialects and a standard Romanization; see 
Liang (1980) for details. Li had 1,247,814 people, plus 
another 60,000 or so classified as Han and locally said 
to speak Cunhua ‘village speech’; but the total speaker 
population is only about 756,000. Li or Hlai is classi- 
fied into five major varieties: four Ha subvarieties, 
spoken by nearly 450,000 including the Han Cunhua 
speakers; three Qi subvarieties spoken by about 
179,000; two Bendi subvarieties spoken by 45,000; 
Meifu spoken by about 30,000 and Jiamao by about 
52,000. A Romanization for Li was prepared, but has 
not been used. Shui had 406,902 people in 2000, but 
only about 300,000 speakers. Mulao had 207,356 
people but fewer than 30,000 speakers, while Maonan 
had 107,166 people in 2000, but under 40,000 
speakers. 

Rao or Ten, also known by the exonym Yang- 
huang, is spoken by about 20,000 out of the group 
total of about 25,000 people. Lajia (or Lakkia, the 
term used in Haudricourt 1967) has about 9,000 
speakers out of some 12,000 group members of Yao 
nationality. Mak or Mo has about 5,000 speakers out 
of 10,000 in the group. The least-known Kam-Sui 
languages are Biao, which is spoken by about 70,000 
people in Zhaoqing Prefecture, Guangdong Province, 
and Jin, also known as Jiamu or Aicham and spoken 
by about 2,400 people in Jialang Township, Libo 
County, southern Guizhou. 


5.3.1.2 Northern Thai languages 


The speakers of the Northern Thai Buyi language are 
classified as Buyi nationality in Guizhou and parts of 
eastern Yunnan province, and as part of the Zhuang 
nationality in north-western Guangxi; there are also 
1,864 Bo-y ethnic group members in far northern 
Vietnam. Their autonym is [pu”’ ji*'], with the first 
syllable also pronounced with [ou] instead of [u] and 
the second syllable also pronounced with [ui], [ei] or 


[oi] instead of [i]. Nearly all the Buyi nationality of 
2,971,460 people and about two-thirds of the Zhuang 
nationality of 16,178,811 (including the ‘standard’ 
Wuming dialect) in north-western Guangxi and parts 
of south-eastern Yunnan province, as well as the Bo-y 
group of Vietnam, are Buyi speakers. There are 
extensive dialect differences in both areas. The Buyi 
and Zhuang orthographies were devised during the 
1950s in co-operation with Russian linguists, and 
contain various Cyrillic letters. Over 12 million people 
speak Buyi as their first language. Many more people, 
including speakers of ‘Southern Zhuang’ and various 
Kam-Sui, Kadai, Miao, Bunu and other languages, 
speak Buyi as a second language; there are also some 
members of the Buyi nationality who speak Kadai or 
Kam-Sui as in-group languages but also speak Buyi. 
Descriptions of Buyi and Zhuang are found in Yu 
(1980) and in Wei and Qin (1980), among other 
places. 

The speakers of Be or Ong-Be live in the 
north-west of Hainan Island in an area centred 
on Lingao town; there are about 500,000 speakers. 
While they are officially part of the Han (Chinese) 
nationality, the existence of their separate language 
is now recognized; but it has no orthography and 
is not taught in schools. As noted above, there is 
some controversy about the genetic position of 
Be (Hashimoto 1980, Ostapirat 2000, 2005), with 
Ostapirat (2005) suggesting that it forms a separate 
major subgroup of Tai-Kadai and is not a Northern 
Thai language. 

The Yay language, also known as Nhang in 
Vietnam, or Giay in a Vietnamese representation of 
the autonym, is spoken by small and scattered groups 
totalling 49,098 in northern Vietnam, over 12,000 
more in adjacent areas of China, and 3,447 people 
in Laos, a total of nearly 65,000. Within China they 
are considered one of the ‘northern’ subdialects of 
Zhuang. There are internal dialect differences; one 
dialect is known as Yoy. 

Saek is spoken in southern Laos and adjacent 
areas of north-eastern Thailand by about 25,000 
people; the language is in a state of decline, being 
replaced by Lao. Its current geographical location is 
an anomaly, probably the result of the practice of 
transporting the population of conquered territory 
closer to the power base of the conqueror. There may 
be additional undescribed Northern Thai languages 
spoken by members of the Buyi nationality in 
Guizhou. 


5.3.1.3 Central Thai languages 


The Nung live in north-eastern Vietnam and adjacent 
areas of south-western Guangxi and south-eastern 
Yunnan, China; for a grammar, see Saul and Wilson 
(1980). They form the Nung nationality of Vietnam 
(856,412 people in 1999) and the southern part of the 
Zhuang nationality in China, nearly a third of the 
Zhuang total population. Traditionally, they had a 
Chinese-based orthography; in south-eastern Yunnan 
a separate Romanization was implemented in the 
early 1980s to replace the northern Zhuang ortho- 
graphy in use since the 1950s there. In Vietnam there 
has been a joint Nung-Tay Romanization since 
1954, approved officially in 1961 and improved since. 
There are nearly six million speakers, the majority in 
China. 

The Tay nationality of northern Vietnam 
(1,477,514 in 1999), also known as Thé, has nearly 
two million speakers, with over two-thirds in north- 
eastern Vietnam and more within the Zhuang 
nationality in south-western Guangxi, China. Like 
Nung, they had a Chinese-based orthography, now 
replaced by the joint Nung-Tay Romanization in 
Vietnam. 

The Caolan, a Sinicized group, consists of 
147,315 speakers now classified as the Sanchay 
ethnic group of Vietnam; they are mainly in various 
areas of the delta of northern Vietnam. There is a 
much smaller number in China, where, like the other 
Central and Northern Thai groups of Guangxi, they 
are included in the Zhuang nationality. 
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5.3.1.4 South-western Thai languages 


South-western Thai is very widespread and diverse; 
these are the Thai groups who have migrated from 
Yunnan into South-East Asia over more than a 
millennium. Their names usually include a first 
syllable derived from Proto-Thai *dai, which is voice- 
less unaspirated in most modern South-western Thai 
languages, but aspirated in some languages such as 
Thai. The largest population of South-western Thai 
speakers is in Thailand, where the national language 
and the three major regional languages are all South- 
western Thai. In China they are grouped in the Dai 
nationality with 1,158,989 people in 2000; the name 
of the nationality reflects the pinyin form with a 
voiceless unaspirated [t] initial. In Vietnam, they are 
grouped in the Thai ethnic group with 1,328,725 
people in 1999. Burma uses the overall term Shan to 
refer to all South-western Thai languages; this name is 
related to Siam, the former term for Thailand. The 
Lao national majority group and several other large 
minority groups in Laos speak South-western Thai 
languages. In north-eastern India, the Ahom language 
was a South-western Thai language, and three other 
small groups still speak languages of this subgroup, 
with at least two further languages now extinct. 

Like most South-western Thai groups, the 
Yongren Dai speakers usually call themselves just Tai; 
they are at the northernmost Kam-Tai in China, in 
river valleys of north central Yunnan in Yongren, 
Wuding, Dayao, Huaping and Yongsheng counties; 
also Renhe District of far south-western Sichuan. 
About 35,000 Dai nationality people live there, but 
the language is only spoken by about 15,000 of them. 
Unlike other South-western Thai groups, they have 
no Indic-based orthography. They were probably 
brought there as soldiers to guard the northern 
borders of Yunnan over 500 years ago. 

The Dai Neua ‘northern Tai’ live in south- 
western central Yunnan along the Mekhong and 
adjacent river valleys; they number about 240,000 but 
only about 200,000 speakers. They should not be con- 
fused with the Tai Daeng or Tai Neua of the Samneua 
area of north-eastern Laos. Dai Neua is the source 
language for Thai loanwords into Lahu and Laomian. 

The Dai Mao, sometimes called Mao Shan, and 
in Burmese called Shan Tayok ‘Chinese Shan’, is a 
group of over 400,000, about 380,000 in Dehong and 
Baoshan prefectures in China and the rest in the 
northern Shan State of Burma. They have a distinct- 
ive ‘square’ orthography with a different appearance 
from those of other Thai groups; this has been 
reformed for use in China. 

Khamti (‘gold place’) is the Thai language of far 
northern Burma, with a few speakers in north-eastern 
India; for more details on this and other South- 
western Thai languages spoken in India, see Terwiel 
(1980-1) and Morey (2002, 2005). Khamti has been 
cited as the only currently-spoken Thai language 
which is verb-final. In fact the word order is largely 
pragmatic, but SOV is a frequent possibility in 
Khamti and two other South-western Thai languages 
spoken in India, Phake and Khamyang, along with 
SVO and less frequently OSV. SVO is usual and OSV 
is possible in other Thai languages. Khamti uses a 
slightly different version of the Shan orthography, Lik 
Tai. There is also a revised script (Morey 2005: 196). 
Of about 70,000 speakers, some 11,000 are scattered 
over north-eastern India; some were resettled in 
Assam, away from the border, by the British after 
rebellions during the nineteenth century, but most 
still live in Arunachal Pradesh. This group is also 
known as Khamti Shan; their language appears closer 
to Dai Mao than to Shan. 

There are three further groups of Thai speakers 
found in north-eastern India. The Phake group, with 
over 2,000 speakers, are Buddhist and maintain con- 
tact with Burma; they also use a version of the Lik Tai 
orthography; in Assamese they are known as Phakial. 
The Khamyang are a small group of about 50 mainly 
elderly speakers adjacent to the Phake; there are far 
more members of the ethnic group, but most no 
longer speak their traditional language. It is quite 


similar to Phake; both are syntactically similar to 
Khamti in having SOV as a frequent possibility. There 
are perhaps as many as 1.8 million Ahom and smaller 
numbers of Turung, Nora and other groups who no 
longer speak their South-western Thai languages. The 
Turung may have formerly spoken a South-western 
Thai language, most possibly similar to Aiton, but 
some now speak a very interesting divergent variety 
of Jinghpaw/Singpho under investigation by Stephen 
Morey. While there are a few Ahom deodhai ‘priests’ 
who have some ability to read religious texts, it 
appears that Ahom has not been an everyday spoken 
language for nearly two centuries; however, revival 
attempts are under way. The Aiton group, with up to 
2,000 speakers, are isolated near Jorhat in India and 
also use another orthography derived from Lik Tai; 
their language has phonological characteristics simi- 
lar to languages further south, and much less ten- 
dency to SOV than Khamti, Phake and Khamyang. 

Shan, an extremely large, widespread and diverse 
group, is one of the major nationalities of Burma. 
The total Shan population is well over three million, 
but this includes the Khamti, Khyn, Ly and Mao 
groups within Burma, and many ethnic Shan who 
speak only Burmese. Including relatively small 
numbers in China and north-western Thailand, there 
are about 2.6 million Shan speakers; dialect differ- 
ences are very substantial, especially between the 
more Burmanized Shan of south-western Shan State 
and outside the Shan State, versus those of northern 
Shan State. The traditional Lik Tai orthography, 
derived from that of Burmese, is quite inadequate for 
the vowels and tones of Shan, but is still used in Shan 
Buddhist activities. The 1970 revised orthography is 
also taught and used in some Buddhist temples in the 
Shan State, but not in government schools. The Shan 
name for themselves is [tai]; Shan is a Burmese 
exonym. 

Khyn is the language of the Kengtung area, the 
eastern corner of Burma, with its own distinct writing 
system. It has about 150,000 speakers, including a few 
in north-eastern Laos and northern Thailand. It is 
fairly similar to the adjacent southern dialects of 
Shan, but the script is more similar to that of Kam 
Myang in adjacent Thailand. 

Yuan (Nyuan) or Kam Myang ‘town words’, 
better-known in English as Northern Thai, is the lan- 
guage of north-western Thailand, the traditional area 
of the Lanna (‘million rice fields’) kingdom centred 
on Chiangmai. It extends slightly into parts of Burma 
and north-western Laos (33,940 people), but nearly 
all of its eight million speakers are in Thailand. Its 
traditional orthography, similar to that of Khyn, 
almost went out of use, but is now staging a 
comeback. There are substantial dialect differences 
between Chiangmai and the rest of the Ping valley as 
contrasted with areas further east such as Lampang, 
Chiangrai, Nan and so on, especially in the tone 
system; for details see Brown (1985) and Davis 
(1970). 

Lue, variously Romanized (also Ly, Lyy), is a lan- 
guage centred on Jinghong, the Sipsongphanna 
(‘12,000 rice fields’; also represented in Chinese as 
Xishuangbanna) in southern Yunnan, China. Some 
speakers are in northern Laos, far north-eastern 
Burma or far northern Thailand, and a few in north- 
western Vietnam. Some peripheral dialects have con- 
verged with Khyn, with Shan, or with other adjacent 
Thai languages. Its orthography, similar to Yuan, has 
been revised and improved for use in Xishuangbanna 
Dai Autonomous Prefecture around Jinghong. There 
are about 320,000 in China, 102,670 in Laos, 6,472 in 
Thailand and 3,700 in Vietnam, and a total of over 
430,000 speakers. See Hartmann (1984) for more 
information. 

The Dai Sa group also use an Indic orthography, 
the furthest north-east example, and is composed of a 
variety of named subgroups along the rivers of south 
central Yunnan in Yuxi and Honghe prefectures, 
especially in Xinnping, Yuanjiang, Mile, Jinping and 
Yuanyang counties. From north to south, these 
include Dai Duan, Dai Luomi and Wumeng; they total 
about 170,000 people and about 100,000 speakers. 


The Tai Dam (‘black Tai’) are centred around the 
Dienbienphu area in north-western Vietnam, but also 
extend into nearby areas of Laos. There are about 
7,000 in Jiangcheng County in China, and they are 
scattered far to the south-west into central Thailand 
owing to conquest and deportation by the Bangkok 
Thai in the early nineteenth century. In Thailand they 
are usually known as Lao Song. With the Thai Khaw 
and Tai Daeng, they form the Thai nationality of 
Vietnam (total population 1,328,725 in 1999); these 
three groups also share a Romanized orthography 
devised in the 1950s and recently revised. There is 
also an earlier Indic orthography in use. They total 
about 750,000 people, but only about 700,000 
speakers; many of the Lao Song in Thailand no longer 
speak the language. 

The Tai Khaw (‘white Tai’) are also sometimes 
known as Tai Don; they live along the Black River in 
north-western Vietnam, and in very small numbers in 
adjacent areas of China and Laos. They total about 
650,000. 

The Tai Daeng (‘red Tai’) live mainly in Samneua 
Province of Laos and nearby in western central Viet- 
nam; they total about 200,000 speakers. Those in 
Samneua Province of Laos are also known as Tai 
Samneua or Tai Neua. This group should not be con- 
fused with the Dai Neua of western central Yunnan, 
China; nor with the Yuan of north-western Thailand, 
often known in English as Northern Thai. 

Phu Tai “Tai people’ is a group of 441,497 people 
in the river valleys in southern Laos; more than 
100,000 further Phu Tai are widely scattered over 
north-eastern Thailand as well. They total nearly 
550,000 people, but only about 500,000 speakers. This 
is a composite group of various South-western Thai 
groups from north-eastern Laos, with extensive 
internal dialect differences. 

Yo, whose autonym is Nyo, is usually known as 
Yo from the Thai pronunciation of this name. The 
50,000 Yo are scattered over north-eastern Thailand 
and Laos. The Phuan group is scattered around 
central Thailand, and as among many such groups, 
the language is being replaced by Thai. They total 
about 80,000. 

Overall, Lao is the variety of Tai-Kadai with the 
largest number of first-language speakers: over 28 
million, with nearly 25 million in Thailand, about 3 
million in Laos, 11,611 in Vietnam, a smaller number 
in Cambodia, and about 100,000 post-1975 refugees 
in France and other Western countries. The Lao 
speakers in Thailand use standard Thai as their 
medium of literacy, but Lao is the national language 
of Laos and has a separate orthography used there. 
There is also a Lao Tham script closer to Khmer, used 
for religious purposes. The local Lao speech of 
the north-east of Thailand is usually called Isan in 
Thailand; it is spoken by about 35 per cent of the 
nation’s population, representing nearly 90 per cent 
of all Lao speakers. While Isan songs and music are 
very popular in Thailand, Isan is not usually written 
with Lao script there. As for many South-western Thai 
languages, there is considerable mutual intelligibility 
between Lao and Thai; especially, of course, for those 
Lao in Thailand who are constantly exposed to Thai. 
Within Lao there are great dialect differences, espe- 
cially in the tonal system, but Brown (1985) showed 
that these are recent. Lao themselves tend to cate- 
gorize themselves as Lao Isan (from Thailand) versus 
Lao Wiang (from Wiangcan ‘moon city’ or Vientiane; 
that is, from Laos), but linguistically the basic division 
is into 1) north-western (Luang Phrabang, the old 
royal capital of Laos; also some dialects in north 
central Thailand), 2) southern (most of the rest, 
including the Mekhong valley from Vientiane south- 
wards as well as most of Thailand’s north-east), and 
3) Korat (centred around Nakhon Ratchasima, locally 
known as Korat). Here, the distinctive Korat variety is 
treated as a separate major variety with two million 
speakers. It is transitional between Lao and Central 
Thai. 

Thai is the national language of Thailand, 
spoken natively in the central region around Bangkok 
by about 26 million people; as a second Thai variety it 
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is also spoken by about another 42 million. It is also 
the medium of literacy, education, government and 
the media throughout the country. There are also 
more than 100,000 Thai speakers living overseas in 
Western countries, mostly the United States. Its 
orthography is Indic and derived from the Khmer 
orthography. The famous King Ramkhamhaeng 
inscription of AD 1283 is said to be the first inscrip- 
tion, though there is now some doubt about its 
authenticity. Thai is gradually spreading to the north, 
replacing Yuan around Tak and elsewhere, and influ- 
encing all other Thai languages of Thailand. The 
western central region of Suphanburi, Kanchanaburi 
and Ratburi merges the rising tone into the high fall- 
ing tone, but otherwise there is little internal dialect 
diversity. For a full description of Thai, see Iwasaki 
and Ingkaphirom (2005). 

From Chumphon and southward into southern 
Thailand, and in the southernmost part of Burma 
and in north-eastern Malaysia, a rather different 
variety of South-western Thai with extensive local 
dialect diversity is spoken by about seven million 
people. Known as Pak Tay ‘southern dialect’ or in 
English Southern Thai, it is historically quite separate 
from standard Bangkok Thai with many tonal, seg- 
mental and lexical differences; however, there is no 
separate Pak Tay script. Of its speakers, over two mil- 
lion are bilingual in northern Malay, especially those 
in Satun, Pattani, Yala and Narathiwat, the four 
southernmost provinces of Thailand. There is also 
a divergent variety, known as Tak Bay, spoken in 
the south of Pattani and Narathiwat provinces in 
Thailand and in Kelantan in Malaysia. This is locally 
known as Chehe, and has a divergent tonal system. 


5.3.1.4.1 Bangkok 


Bangkok is the capital of Thailand; it is located in 
the lower reaches of the Chao Praya River. After the 
Burmese conquest and sack of the previous upriver 
capital of Ayudhya in early 1767, the Thai general 
Taksin re-established a Thai kingdom based at 
Thonburi, across the river to the west of Bangkok, 
later in 1767; in 1782 his general, Chakri, the founder 
of the present Chakri dynasty, moved the capital 
across the river to Bangkok (Krungthep in Thai) and 
established himself as King Rama I. The present king 
is Rama IX; the absolute monarchy was replaced by a 
constitutional monarchy in 1932. 

Thailand has never been colonized, but before 
and during the Chakri dynasty it welcomed large 
numbers of foreigners, including Portuguese and 
French, Mons from Burma, Persians, Arabs and 
various Indians, and also (from the early nineteenth 
century) large numbers of Chinese from the south- 
east coast of China, primarily Chaozhou. They ini- 
tially settled mainly in Bangkok. After successful wars 
against various neighbours, large populations of war 
captives were brought and settled in areas that are 
now part of greater Bangkok, including many Khmers 
and Laos who have now totally assimilated, some 
Chams from north-eastern Cambodia, and various 
others. 

There are many districts of Bangkok that still 
have concentrations of particular ethnic groups: 
the Indians, especially Punjabis, in Pratunam; the 
Chinese in Yaowarat; Mons at various locations, such 
as along Khlong Mon (‘Mon canal’) in Thonburi, as 
well as downriver and upriver on the west bank; the 
Muslim Chams in the area of Pathumwan; various 
other Muslim groups including Arabs around Yan- 
nawa, and so on. Some types of business are domin- 
ated by particular non-Thai ethnic groups: Punjabi 
Sikhs in the cloth and tailoring business, Arabs in the 
gem business, Chams in the silk industry, Hindu 
astrologers, and Chinese in the gold and rice milling 
businesses and most other areas of economic life. 
Some of these ethnic groups continue to bring in 
spouses from their original countries and maintain 
their languages and religions in addition to Thai; 
others have intermarried, are assimilated and speak 
only Thai. The greatest degree of intermarriage was 
between Chinese men and Thai women, and most of 


the economic activity of Thailand is now controlled 
by Sino-Thai families. 

Apart from Central Thai, the three main regional 
varieties of Thai are widely spoken in Bangkok by 
people from those regions and usually their Bangkok- 
born children: Yuan or Kam Myang from the north, 
Lao or Isan from the north-east and Pak Tay from the 
south of the kingdom. Most people from those 
regions who live in Bangkok are bidialectal and use 
Central Thai when speaking with members of other 
groups. 

The definition of Bangkok varies. Minimally, 
there is the Bangkok Metropolitan Area on the east 
bank of the Chao Praya. There is also the urban clus- 
ter consisting of this plus five other provinces sur- 
rounding it: Thonburi on the west bank, Nakorn 
Pathom further west, Pathum Thani to the north of 
Thonburi, Samut Prakan to the south-east, and 
Samut Sakhon to the south-west. The urbanization of 
these five provinces varies somewhat: Thonburi is 
entirely within the urban area, as are the eastern part 
of Nakorn Pathom, the southern part of Pathum 
Thani, the port area of Samut Prakan and the north- 
eastern part of Samut Sakhon. The spread of Bangkok 
into surrounding provinces over the last fifty years 
left many villages embedded within the urban area, 
but most have been redeveloped, leaving behind a few 
Buddhist temples and some old houses surrounded 
by wide roads and new commercial and housing 
developments. The spread paused for a few years 
after the economic shock of 1997, but has resumed 
and threatens to engulf parts of additional nearby 
provinces such as south-eastern Suphanburi, western 
Chachoengsao and western Nakhon Nayok. 

For official statistics on Bangkok and Thailand in 
general, see the website of the National Statistical 
Organization, <nso.go.th>. On the narrow definition 
of Bangkok, its population at the 2000 census was 
6.32 million; on the wider definition, the six prov- 
inces centred on Bangkok had a 2002 population of 
9,668,872 according to Ministry of Interior registra- 
tion statistics. These figures somewhat understate the 
population; there are many illegal workers from sur- 
rounding countries, large numbers of foreigners with 
residence permits of various types, and many Thais 
registered elsewhere but now living and working in 
Bangkok. An estimated total for the urban area, 
including the five surrounding provinces listed above 
and people who are not counted in official statistics, 
would be at least ten million. 

Thai official statistics do not separate urban 
people according to ethnicity, so there are no official 
figures for populations of Chinese or part-Chinese, 
Indians, Arabs, or any other ethnicity. The figures 
given in Table 5.7 are all estimates. The Thai hill tribe 
category includes the TB languages Akha, Karen, 
Lahu and Lisu, also Gong and Bisu (included in the 


Lawa category), as well as the MK languages Khmu, 
Htin and Lawa, and the Miao-Yao languages Miao 
and Yao; all are present in Bangkok in very small 
numbers. 

Among the Sino-Thai population, there are a few 
Yunnanese Muslims who speak south-western Man- 
darin, but those are concentrated more in the north 
of Thailand. There are also some Hakka, but the 
vast majority come from the Chaozhou region of 
the north-east coast of Guangdong, and speak the 
southern Min variety known as Teochiu, from the 
local name for Chaozhou. Sino-Thais who are literate 
in Chinese use traditional characters. There is a small 
sprinkling of other Chinese varieties, but the Teochiu 
are well over 95 per cent of the total and their variety 
is the lingua franca among the Chinese community. 
However, many younger Sino-Thai are now several 
generations from their Chinese roots and have Thai 
relatives as well; some have passive understanding 
of Teochiu, but many do not. There are also small 
numbers of recent arrivals from China. Among the 
Hindu population there are small numbers of 
speakers of a variety of languages: Indic Nepali, 
Hindi, Bengali, Dravidian Tamil and so on. Some are 
long-established and assimilated, others maintain 
languages and contacts overseas. The small Persian 
community is long-established and mainly assimi- 
lated. Much of the Burmese and Lao population and 
some recent Chinese arrivals are working in Thailand 
illegally. Among the Khmers, some are completely 
assimilated and identify as Thai, some are long- 
established citizens of Thailand from the north-east 
who still identify as Khmer, and some are recent 
illegal arrivals. 


5.3.1.5 Kadai 


The Kadai languages form one of two branches of 
Tai-Kadai, a component along with Austronesian 
and Miao-Yao of Benedict’s Austro-Thai; the term 
Kadai was coined by Benedict from the human 
prefix *ka- and the general term for the Dai group. 
More recently, Ostapirat (2000, 2005) has proposed 
to replace Kadai with the term Kra. Most of the 
Kadai languages are spoken by small populations in 
widely scattered areas, and some have extensive 
internal diversity. All are SVO and tonal; none is 
currently written. For a reconstruction and genetic 
classification of Kadai, which he prefers to call Kra, 
see Ostapirat (2000). This groups Gelao, Qaw and 
Lachi in Western Kadai, which together with Laha 
forms South-western Kadai. Buyang, Yerong and 
Pubiao together form Eastern Kadai, which with 
the Paha variety of Buyang forms Central-Eastern 
Kadai. 

The Gelao (Chinese) or Kolao (Vietnamese) 
group are widely scattered, with a few in southern 


Table 5.7 Bangkok: ethnicity and language (including five surrounding provinces, 2005 estimates) (numbers 


in 1000s) 
Speakers of 
ethnic/regional language Speakers of Central Thai Bangkok total 
number % number % % 
Thai 
Central n/a n/a 4,700 100 47 
Northern 400 80 500 100 5 
Isan 900 90 1,000 100 10 
Southern 200 90 200 100 2 
Sino-Thai 1,800 60 3,000 100 30 
Mon 3 2 150 100 1.5 
Cham 0 0 5 100 0.05 
Arab 30 100 30 100 0.3 
Punjabi 45 90 50 100 0.5 
Hindi/Urdu 20 67 30 100 0.3 
Burmese 50 100 40 80 0.5 
Khmer 50 50 90 90 1 
Lao 20 100 20 100 0.2 
Thai hill tribe 10 100 10 100 0.1 
Thai Malay 5 100 p) 100 0.05 
other 150 100 100 67 1.5 
Total/% of total 3,683 36.8 9,930 99.3 100 
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Sichuan, most in western Guizhou, and a few in 
Guangxi, Yunnan and north-eastern Vietnam. Of 
597,357 Gelao enumerated in the 2000 census of 
China and 1,865 in Vietnam in 1999, there are only 
about 5,000 speakers of the four surviving lan- 
guages, mainly elderly. Curiously, some Gelao lan- 
guages survive better among communities that have 
moved south from the original Gelao area in 
Guizhou to Guangxi, Yunnan and especially to Viet- 
nam. The four languages included within Gelao are 
Aqaw with a few elderly speakers, Hakhi or Green 
Gelao with about 2,000 speakers, Tolo or White 
Gelao with about 3,000 speakers and A-uo or Red 
Gelao with about 50 speakers. Speakers of some 
other Kadai languages (Qaw and Pubiao) are also 
classified as Gelao in China. For an overview, see He 
(1983). 

Buyang is a small group of about 3,000 people, 
living in Gula Township, Funing County, south- 
eastern Yunnan Province and classified in the Zhuang 
nationality, who speak four different Kadai varieties. 
The Paha variety of Buyang is quite distinct, though 
ultimately related as Central Kadai. 

Pupeo and Pubiao are the Vietnamese and 
Chinese names for an ethnic group of 705 people in 
Dong Van District, Ha Giang Province, north-eastern 
Vietnam and 307 members of the Gelao nationality 
in Malipo County, Yunnan, China, but there are 
only about 400 speakers of Pupeo/Pubiao, mainly in 
Vietnam. It was formerly known as Laqua; the 
group’s autonym is Kabeo. 

Laha is spoken by about 1,000 of the 5,686 Laha 
enumerated in Vietnam in 1999; another name for 
them is Khlaphlao. Laha in Son La Province is mori- 
bund, but in Lao Cai Province it is still spoken; the 
two dialects are similar but not identical. 

There are 1,643 Laji included in the Yi nation- 
ality in Maguan County, Yunnan, China and an 
ethnic group of 10,765 Lachi in various areas along 
the northern border of Vietnam. There are about 200 
speakers of Lachi/Laji in China and about 5,000 
speakers of Lachi/Laji in Vietnam. Their autonym is 
Lipulio; early French sources refer to them as Lati. 
There is considerable internal diversity within Lachi/ 
Laji; Ostapirat recognizes three subvarieties. 

Mulao is a group of about 20,000 widely scat- 
tered people in Majiang and surrounding counties in 
eastern Guizhou, China, who call themselves Ayo. 
This is not the same as the Kam-Tai Mulam or Mulao 
language further to the south with a similar name. 
Most of the Kadai Mulao are unclassified for nation- 
ality, but some are classified in the Mulao Kam-Tai 
nationality. Nungven or Ainh is spoken in one village 
of 200 people in Ha Giang Province, Vietnam. It 
is less moribund than many other similar Kadai 
languages. Ostapirat does not include Mulao or 
Nungven in his classification of Kadai. 

Qaw is the northernmost Kadai language, with at 
most a few elderly speakers among a group of 3,000 
people classified as Gelao in Gulin County, Sichuan 
and Bijie County, north-western Guizhou. It is closely 
related to Gelao. 

Yerong is spoken by over 300 people in Napo 
County, Guangxi, China; the speakers are classified as 
members of the Yao nationality, and some also speak 
Yao. Their language is closely related to Pupeo/ 
Pubiao. 


5.3.2 Austronesian 


The Austronesian languages spoken in this area 
include the indigenous languages of Taiwan and two 
subgroups of Western Malayo-Polynesian. Chamic 
and Malay. The Austronesian languages of Taiwan are 
extremely genetically diverse and can be grouped into 
nine major subgroups, plus the Yami language. All 
other Austronesian languages, including those dis- 
cussed under Australasia and the Pacific as well as 
Chamic, Malay, and the indigenous languages of 
Madagascar, are in one subgroup most closely related 
to the East Coast Linkage of Taiwan, and are some- 
times instead known as Malayo-Polynesian. The Yami 
language of Taiwan is most closely related to Malayo- 


Polynesian languages of the Batanic or Ivatan group 
in the north Philippines. 

The eight Chamic languages are mainly in 
southern Vietnam, with a few speakers of Cham 
in adjacent areas of Cambodia, one language on 
Hainan Island, and a few refugees from Cambodia in 
Malaysia and the United States. Within Austronesian, 
it has been suggested that the Chamic group fits most 
closely with Aceh of North Sumatra. For more details 
of the other Austronesian languages, see the maps for 
Australasia and the Pacific. 

Malay is of course the local language of much 
of Malaysia, especially the coasts, and extends to 
adjacent areas of Indonesia and Brunei. It has been a 
very widespread lingua franca for many centuries, 
and is the basis for Bahasa Malaysia and for Indone- 
sian. It has various local dialects, mostly with very 
minor differences apart from some varieties spoken 
by a few groups of fishermen collectively known as 
Orang Laut (Malay) or Chao Thalee (Thai), both 
of which mean ‘sea people’, along the west coast of 
Thailand and southern Burma. Even these are not too 
divergent. 


5.3.2.1 Taiwan Austronesian 


The Taiwan Austronesian languages are grouped in 
the Gaoshan ‘high mountain’ nationality in China, 
where there were 4,461 people in 2000 from a variety 
of language groups, many of whom no _ longer 
speak their traditional languages. On the mainland, 
the largest concentration is 18 per cent in Fujian 
Province, across the straits from Taiwan. In Taiwan 
they are classified into eleven groups and two residual 
‘other’ categories, mainly living on the east coast of 
Taiwan and the adjacent mountains, with a total 
population of 464,272 on 30 November 2005. For a 
historical picture of the distribution of these groups, 
see Wurm and Hattori (1981: Map 30). Most of the 
languages of the west coast and adjacent plains are 
now extinct; this atlas shows the current situation. 
Population statistics from Taiwan include indigenous 
people and groups who do not speak their trad- 
itional languages. Fortunately, many of the extinct 
languages are fairly well described, thanks to the 
efforts of Taiwanese, Japanese, Dutch and other 
scholars. 

The nine subgroups of Taiwan Austronesian lan- 
guages are Atayalic, Eastern, Puyuma, Paiwan, Rukai, 
Tsouic, Bunun, Western Plains and North-western, as 
well as the Yami language of Lanyu Island to the 
south-east. Various subgroupings among these nine 
groups have been proposed. The Atayalic group 
includes the various dialects of Atayal and Seediq, 
with a total group population of 75,309, about two- 
thirds of whom are Atayal. The Eastern group 
includes Amis with 155,897 people; Kavalan with 901 
people; Basay and Trobiawan, languages extinct 
about 1940; and Siraya, a language extinct about 1820 
but recorded by Dutch missionaries and recently 
described from those materials by Adelaar (2004). 
The Puyuma group has 9,046 people, Paiwan has 
77,381, and Rukai has 10,437; Rukai has a number of 
rather distinct dialects. The Tsouic group includes 
Tsou with 6,094 people and various fairly distinct dia- 
lects, as well as Kanakanavu and Saaroa, which are 
not enumerated separately but which are small 
groups of a couple of hundred people each. Bunun 
has 45,495 people, and the Western Plains languages 
include extinct Taokas and Babuza, Papora and 
Hoanya whose languages became extinct in 1969, and 
Thao with 569 people. The North-western group 
includes Saisiyat with 5,242 people and smaller 
unenumerated groups Pazeh and Kulon; the latter 
language is extinct. The Yami are enumerated at 
2,709; for an excellent recent grammar, see Rau 
and Dong (2006). Taiwan population statistics list 
20,564 aboriginal people as members of unspecified 
West Coast groups and another 54,658 as ‘other’. 
There are insufficient data to classify the extinct 
Ketangalan, Taivuan, Makatao and Qauqaut lan- 
guages linguistically. 

Many of the people who have indigenous status 


in Taiwan do not speak their traditional Austronesian 
language; this is also a politically contentious issue, 
with the government wanting to institute a language 
test to determine indigenous status. The most 
endangered language is Pazeh with one speaker who 
is 92. Thao has about 10 speakers, Saaroa has about 
12, and Kanakanavu has under 20; all these are aged 
over 60. Kavalan has middle-aged speakers in one 
village, Saisiyat has middle-aged speakers in several 
villages. In most other communities, there are far 
fewer speakers than group members, and few children 
using the language; for example, the Mantauran dia- 
lect of Rukai has one village of over 300 people, but 
only 50 speakers and no children (Zeitoun 2003). On 
the other hand, there are some villages of various 
groups where children do speak the language, for 
example one Yami village. 


5.3.2.2 Chamic 


The Chamic languages can be traced back to the 
Champa kingdom of what is now southern Vietnam 
in the fifth to eighth centuries ap; this kingdom was 
gradually overcome by the Vietnamese and Khmer 
over the following millenium. This Austronesian 
cluster was clearly a fairly late arrival in the area, as it 
intrudes between closely-related branches of Bahnaric 
MK languages; the arrival of the Chamic groups by 
sea can be dated to approximately 2,000 years ago. 
There are eight Chamic languages spoken now, seven 
in a compact area of Vietnam and Cambodia, and 
one in China. 

Cham, Roglai (two languages, northern and 
southern), Chru, Jarai, Haroi and Rhade are spoken 
by montagnard groups of southern central Vietnam, 
with Jarai and Rhade extending into Cambodia. 
Apart from Jarai and Rhade, which are further 
inland, the Chamic languages of Vietnam are spoken 
in a narrow strip behind the coast to the south of Nha 
Trang. The western dialect of Cham is also spoken by 
a substantial Muslim community in eastern central 
Cambodia (substantially reduced by the Khmer 
Rouge in the late 1970s), with a few thousand refu- 
gees from the Cham community in Cambodia now in 
Malaysia and the United States. Tsat is spoken in 
Hainan, China. 

The Champa Kingdom used an Indic-based 
Cham orthography. After their conversion to Islam 
about 500 years ago, an Arabic script for Cham 
came into use, and several Roman orthographies 
were developed for various Chamic languages in 
Vietnam from the 1960s. The total number of 
Chamic language speakers is nearly a million, of 
whom over 250,000 (including 132,873 in Vietnam 
and the rest in Cambodia) speak varieties of Cham, 
about 330,000 speak Jarai, (including 317,557 in 
Vietnam), with smaller numbers of Rhade (about 
275,000 including 270,348 in Vietnam, also known 
as Ede in Vietnamese), North Roglai (40,000), South 
Roglai (30,000), Chru (14,978, not to be confused 
with the nearby MK language Chrau), and Haroi 
(27,000, included in Vietnamese census statistics 
with Roglai). 

Tsat is spoken by about 4,000 members of a 
group of 5,000 people in two villages of Sanya 
County, Hainan, China; it has diverged substantially 
from the other Chamic languages, for example by 
becoming tonal. The migration of the Tsat to Hainan 
from southern Vietnam can be dated to ap 982, with 
further arrivals about ap 1471 (Thurgood and Li 
2003). The Tsat are classified as part of the Hui 
(Chinese Muslim) nationality as they are Muslim, and 
are locally called Huihui in Chinese, which is simply a 
reduplication of the term for Chinese Muslims 
(Ouyang 1983). Most Chamic language speakers are 
more or less bilingual in the national languages 
where they live, which are Vietnamese, Khmer or 
Chinese. 


5.3.2.3 Malay 


The core area of Malay is the straits between Malaysia 
and Indonesia, but Malay has spread much further 
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afield, along coasts far to the east in East Malaysia 
and Indonesia. The most divergent languages of this 
subgroup in this area are Moklen and Moken, 
spoken by two small groups of nomadic fishermen 
further north than anything else Malay. One is the 
2,000 Moken, along the south coast of Burma and 
into Thailand as far south as Phuket; they are known 
as Saloun to the Burmese. The other is the 4,000 
Moklen to the south, along Thailand’s west coast. 
Just to the south of these and down to the Malaysian 
border are more such fishermen who speak a 
divergent Para-Malay language called Urak Lawoi' 
‘sea people’; there are about 3,000 speakers (Hogan 
1988). Some of all three groups are found on the 
Thai island of Phuket; in Thai, all three are called 
Chao Thalee ‘sea people’. These and various other 
itinerant fishermen are also known by the collective 
Malay name Orang Laut ‘sea people’; some of these 
other groups also speak divergent Para-Malay lan- 
guages, such as the Duano' living just to the north- 
west of Singapore; but others speak local varieties of 
Malay. There are also many fully bilingual speakers 
of Pak Tay (Southern Thai) and local Malay, espe- 
cially North-western Malay in Satun Province in 
the southern part of the west coast of Thailand, but 
also North-eastern Malay further east. In addition, 
the somewhat different Malay dialects spoken by 
Aboriginal groups (Orang Asli ‘Aboriginal people’: a 
pejorative alternative name is Jakun) should be 
noted. Of course many Orang Asli retain their 
original MK languages, which are briefly discussed 
here but mainly shown in the Australasia and the 
Pacific section. In Malaysia, the national language has 
been superposed on all MK and Para-Malay varieties, 
as has Indonesian on the various Austronesian lan- 
guages of Indonesia. In Indonesia, at least, the local 
languages are nearly all quite different; but in pen- 
insular Malaysia the similarity has led to interesting 
processes of code- and dialect-mixing for many 
speakers. 

Malay has been written for many centuries, 
beginning with Jawi, an Arabic script, and more 
recently in various British and Dutch Romanizations. 
Indonesian and Bahasa Malaysia are now written in 
an identical compromise Romanization, put in use in 
1972, but Jawi is also used, mainly for religious pur- 
poses. The vast majority of Malay speakers live in 
the area of adjacent maps, in peninsular Malaysia, on 
the coast of East Malaysia, in Brunei and Indonesia, 
on the islands between Malaysia, Singapore and 
Sumatra, and elsewhere. As the national language of 
Malaysia, Indonesia and Brunei, Indonesian/Malay 
in the wider sense is also spoken by most of the 
population of those countries, at least as a second 
language. Malay nationalism is also on the increase 
in southern Thailand, where the local dialect of 
Satun Province is similar to adjacent North-western 
Malay, while in Pattani, Yala and Narathiwat 
provinces, the local dialects are similar to adjacent 
North-eastern Malay. 


5.3.3 Miao-Yao 


Older sources classify these languages as a subgroup 
of Sino-Tibetan, but this is incorrect. One hypoth- 
esis, that of Benedict (1975), groups them instead 
with Tai-Kadai and Austronesian in Austro-Thai; 
other scholars prefer to regard Miao-Yao as a 
separate family, unrelated to any of the surrounding 
groups. There are two major subgroups within 
Miao-Yao: Miao and Yao; She and Bunu are add- 
itional subgroups. Miao is more conservative for 
initials, while Yao is more conservative for rhymes. 
The genetic position of She languages appears to be 
closer to Miao, but some scholars group them with 
Yao instead. The fourth language cluster in this 
group, Bunu, is included within the Yao nationality 
in China, but the Bunu languages are also lin- 
guistically closer to Miao. Internal differences within 
each language cluster, especially Miao and Bunu, are 
very substantial. This is quite usual in China, where 
the criteria for defining yuyan ‘language’ and 
fangyan ‘dialect’ are parallel to the classification 


of Chinese, which is based on linguistic similarity 
and historical connection, rather than mutual 
intelligibility. 

By the criteria of intelligibility and group atti- 
tudes, there are three subclusters of Miao varieties 
with a total of twelve or more languages, five clusters 
of Bunu languages, one including four closely-related 
languages and the other four with one each, and two 
remaining She languages. Within Yao, the internal 
diversity is less than in Miao or Bunu, but there are 
still two clusters with a total of six languages. The 
overall total is thus at least 28 Miao-Yao languages. 
All are SVO, and in general syntactically similar to 
Chinese. In some recent Western literature, the new 
term Hmong-Mien is used instead of Miao-Yao. This 
new term is derived from the autonyms of the Miao 
in Laos, Hmong, and the autonym of the largest Yao 
group, Iu Mien, rather than from Chinese exonyms. 
Mapping the internal diversity of the Miao-Yao 
languages is not possible with currently available 
information, so the presentation in this atlas is 
simplified. 

The Miao-Yao groups were a very large com- 
ponent in the Man or ‘southern barbarian’ tribes, 
pushed south-westwards over millennia by the 
Chinese. In fact the term Man is sometimes still used 
to refer to the Yao, whose autonym, which also means 
‘people’, may be related to this term. The arrival of 
Miao- Yao in South-East Asia is relatively recent, only 
over the last century or two. Since 1975, substantial 
numbers of refugees have also come from Laos to 
Western countries. Many more members of these 
ethnic groups have, over the centuries, become part 
of the Han Chinese population. At present well over 
five million people who do not speak a Miao- Yao lan- 
guage are still recognized as members of these 
nationalities in China: over four million Miao, about 
a million Yao, some Bunu and all but a thousand of 
the She. Most of the non speakers who are members 
of these nationalities speak a local variety of Chinese, 
including South-western Mandarin, Kejia or Yue, and 
many Bunu speak the Thai language Zhuang as well. 
Non speakers account for much of the apparent 
population increase of these groups between the 1982 
and 2000 censuses. 


5.3.3.1 Miao 


Miao, with over 5.3 million speakers, is divided into 
at least twelve languages, in three main groups; for 
more details, see Niederer (1998) and Wurm, T'sou 
and Bradley (1987). Qoxiong (two languages) is 
spoken by about 800,000 Miao in south-western 
Hunan and north-eastern Guizhou provinces; Hmo 
varieties (three or more languages) are spoken by 
about 1.5 million Miao scattered across Guizhou 
Province; and Hmau or Hmong varieties (up to 
seven languages) are spoken by about 1.8 million 
Miao in western Guizhou, Yunnan and southern 
Sichuan provinces in China. There are over 1.2 mil- 
lion more speakers of Hmong outside China: in 
northern Vietnam (787,604) where they are known 
as Meo; in northern Laos (over 230,000) and Thai- 
land (over 125,000) where they are known as Maew; 
a couple of thousand in north-eastern Burma, where 
they are known as Myaing; and over 100,000 refugees 
from Laos living in the United States, France, French 
Guiana, Canada, Australia and so on, who prefer to 
be known as Hmong. It is not possible to map the 
Miao adequately on Map 46, as they are extremely 
widely scattered in very small groups; nearly every 
county in the western two-thirds of Yunnan has a 
couple of small villages, and there are even a few in 
southern Sichuan. Of the 8.9 million Miao in the 
2000 China census, nearly half do not speak any kind 
of Miao. 

The most widespread Miao variety, which is 
the only one spoken outside China, is Hmong. This 
includes the Hmong Daw ‘white Mong’ dialect of 
Laos, also spoken by refugees from Laos. There is 
also the Mong Njua ‘blue/green Hmong’ dialect as 
spoken by most of the Mong in Thailand, who have 
lost voiceless nasals, as seen in their autonym. 


There were various Pollard scripts (Enwall 1994) 
developed for different varieties of Miao from the 
early 1900s, and some are still in use among Miao 
Christians in China. A Romanization developed in 
the late 1950s, for Hmong as spoken in Laos, is 
in general use among the refugee community out- 
side Laos, and a number of Romanizations were 
developed for varieties of Miao in China from the 
1950s onward, with three still in use. Another 
Romanization came into use in Vietnam in 1961. An 
indigenous Hmong script was developed in Laos 
starting in 1959 by the messiah Shong Lue (Smalley et 
al 1990), and is used in some overseas communities 
(Eira 2002). 


5.3.3.2 She 


There were 709,592 She in 2000, but of those only 
about 1,000 in two areas, one cluster on the border 
of Boluo and Zengchong counties about 100 km 
(60 miles) north of Hong Kong and another on the 
border of Huidong and Haifeng counties about 
100 km (60 miles) north-east of Hong Kong, all in 
Guangdong Province, still speak the two She lan- 
guages, called [huo nte]. The rest speak only Chinese, 
including Kejia, Yue and Mandarin. 


5.3.3.3 Yao 


Yao includes at least six languages with a total of 
about 1.65 million speakers worldwide. While all the 
Bunu, the Tai-Kadai Lajia and over a million ethnic 
Yao who do not speak Yao are included in the 2000 
China Yao population of over 2.6 million, the 50,000 
Miao of Hainan Province who actually speak a variety 
of Yao are not. There are a further 620,538 Yao in 
Vietnam, known as Dao; over 18,000 in Laos and over 
48,000 in Thailand, and as many as 25,000 post-1975 
refugees from Laos in Western countries. The Western 
variety (with two subvarieties: Iu Mien and Kim 
Mun) is spoken by about 85 per cent of the total; for 
details see Mao et al (1982). Other varieties are 
closely related Biaoman and Jinmen, and less closely 
related Biaomin, Jiaogongmin and Zaomin. Mien, 
mun, man, men and min are different forms of the 
word for ‘people’. Yao men traditionally used written 
Chinese for liturgical and other purposes. From the 
1930s, various different Thai-based scripts for the Iu 
Mien dialect of Yao were created in Thailand. From 
the 1950s, various Romanizations were developed in 
China and Thailand, and in 1984 a unified Iu Mien 
Romanization was agreed for use inside and outside 
China (Purnell 1987). 


5.3.3.4 Bunu 


The eight Bunu languages are spoken in China 
by about 700,000 members of the Yao nationality, 
mostly in northern Guangxi and south-western 
Hunan provinces, and by about 1,600 people of the 
Pathen ethnic group of 5,569 in Vietnam. One cluster 
is Bunu itself, which has four closely related lan- 
guages: Bunu, Baonao, Numao and Dongmeng. 
Within Bunu proper, also known as Dongnu, there 
are Nunu and Bunuo dialects, with a total of nearly 
600,000 speakers. Baonao is also known as Nao Klao, 
Nao Gelao or Baiku Yao (‘white trouser Yao’) and has 
about 25,000 speakers. Numao has about 1,500 
speakers, and Dongmeng has about 600; both are 
endangered. The other four more distinct languages 
are Baheng, Ngnai, Younuo and Jiongnai. Baheng has 
about 50,000 speakers, mainly in China but also most 
of the Pathen in Vietnam. Ngnai has about 8,000 
speakers, Younuo has fewer than 4,000 and Jiongnai 
has only 1,500; the last three are endangered. No 
Bunu language has an orthography. 

Table 5.8 shows the numbers of speakers in the 
areas covered by Maps 45 to 50; this does not include 
speakers in the Australasia and the Pacific area, for 
example the much larger number of speakers of 
Indonesian/Malay, but does include migrants from 
the areas covered here who have settled elsewhere in 
the developed world. 
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Table 5.8 Austro-Thai languages 


Languages Number of speakers 
Tai-Kadai 
Kam-Tai 
Kam-Sui 
Kam (Dong) 1,400,000 
Li 756,000 
Lajia 9,000 
Maonan 40,000 
Sui (Shui) 300,000 
Ten 20,000 
Mak 5,000 
Mulam (Mulao) 30,000 
Tai 
Northern 
Buyi 10,000,000 
Be 500,000 
Yay 65,000 
Saek 25,000 
Central 
Nung 6,000,000 
Tay (Thd) 2,000,000 
Caolan 150,000 
South-western 
Yongren Dai 15,000 
Dai Mao (Dehong Dai) 400,000 
Dai Neua 200,000 
Dai Lue 430,000 
(Ly, Xishuangbanna Dai) 
Khyn 150,000 
Yuan 8,000,000 
(Kam Myang, Northern Thai) 
Shan 2,600,000 
Dai Sa/Dai Duan/ 
Dai Luomi/Bumang 100,000 
Tai Dam 700,000 
Tai Khaw 650,000 
Tai Daeng 200,000 
Phu Tai 500,000 
Yo 50,000 
Lao 28,000,000 
Korat 2,000,000 
Phuan 80,000 
Thai 26,000,000 
Pak Tay (Southern Thai) 7,000,000 
Khamti 70,000 
Phake 2,000 
Khamyang 50 
Aiton 2,000 
Ahom extinct 
Nora extinct 
Kadai 
Gelao (Aqao, Green, White, Red) 5,000 
Buyang 3,000 
Lachi or Laji 5,200 
Laha 1,000 
Mulao 500 
Nungven 200 
Pubiao or Pupeo 400 
Qaw 210 
Yerong 300 
Austronesian 
Taiwan Austronesian 
Atayalic 
Atayal 40,000 
Seediq 20,000 
Eastern 
Amis 125,000 
Kavalan 100 
Basay extinct 
Trobiawan extinct 
Siraya extinct 
Puyuma 6,000 
Paiwan 60,000 
Rukai (dialects) 8,000 
Tsouic 
Tsou (dialects) 4,000 
Kanakanavu <20 
Saaroa 12 
Bunun 35,000 
Western Plains 
Taokas extinct 
Babuza extinct 
Papora extinct 


Hoanya extinct 
Thao 10 
North-western 
Saisiyat 200 
Pazeh 1 
Kulon extinct 
Malayo-Polynesian 
Ivatan (Batanic) 
Yami 2,700 
Chamic 
Cham 250,000 
Chrau 12,000 
Haroi 27,000 
Jarai 330,000 
Rhade 275,000 
N. Roglai 27,000 
S. Roglai 30,000 
Tsat 4,000 
Malayic 
Moken 2,000 
Moklen 4,000 
Urak Lawoi' 3,000 
Malay 2,000,000 
North-western 700,000 
North-eastern 1,300,000 
Miao- Yao (Hmong-Mien) Group 
Miao 5,300,000 
Qoxiong 800,000 
Hmo 1,500,000 
Hmau/Hmong 3,000,000 
She 1,000 
Yao 1,650,000 
Bunu 690,800 
Bunu 600,000 
Baonao 25,000 
Numao 1,500 
Dongmeng 800 
Baheng (Pathen) 50,000 
Ngnai 8,000 
Younuo 4,000 
Jiongnai 1,500 


5.4 Japanese 


Japanese is the national language of Japan; the stand- 
ard variety is based on educated Tokyo speech and is 
used in education, government and the media. There 
are also many regional dialects, as discussed below. 
The speech of the Ryukyu Islands is regarded as a 
distinct Ryukyuan language by most Japanese lin- 
guists. Standard Japanese is spoken as a second dia- 
lect by nearly everyone in Japan and the Ryukyus, and 
among much of a widespread Japanese diaspora in 
the United States and Latin America. The total popu- 
lation of Japan was just under 127.7 million at the 
2004 census. This includes some 1.6 million ethnic 
Koreans, about 1.65 million speakers of Ryukyuan 
and the Ainu. All of them speak Japanese; many 
ethnic Koreans and most Ainu do not speak their 
traditional languages at all. 

While knowledge of Japanese is declining among 
the diaspora, there is still extensive contact and 
some movement back to Japan, especially from Latin 
America. The second generation or nisei overseas 
Japanese usually maintain the language, but much of 
the third generation or sansei often do not. Some of 
the youngest overseas Japanese are fourth generation 
or more. Overall, there are about 800,000 ethnic 
Japanese citizens of the United States including over 
250,000 in Hawaii alone, and nearly 900,000 in Latin 
America, with over 700,000 in Brazil, over 80,000 in 
Peru (which had an ethnic Japanese president for 
some time), over 30,000 in Argentina, over 10,000 in 
Bolivia and so on. There are also many Japanese who 
work or study overseas throughout the world, but will 
ultimately return to Japan. 

Japanese is genetically rather closely related to 
Korean, though this is not a popular view in Japan 
or in the Koreas. It may also be more distantly related 
to Manchu-Tungus. Miller (1972) and some other 
scholars suggest a connection with Altaic, though 
even the connection within that putative family is 


debatable; the Japanese and Korean link with 
Manchu-Tungus is fairly clear, however. Benedict 
(1990) instead proposes a genetic connection with 
Austronesian, but as we have seen, this is not widely 
accepted. The lexical similarities he cites were most 
probably borrowed into Japanese from Austronesian 
languages whose speakers came to southern Japan at 
the time of the Yayoi culture; for example Japanese 
me ‘eye’ from Proto-Austronesian *mata. 

Japanese has had deep and long-term influence 
from Chinese; indeed, the phonology of the various 
Chinese loanword strata in Japanese is one of the 
means for dating various phonological changes 
within Chinese. Prior to 1945, formal written 
Japanese for government, religion and similar 
high-prestige purposes was normally in a diglossic 
literary High Sino-Japanese style approximating to 
classical Chinese. From the eighth century ap 
onwards, popular literature started to be written in a 
style more like spoken Japanese. This popular style 
spread and was adopted more widely in the late nine- 
teenth century in some more formal contexts such as 
newspapers, but diglossia continued up to 1945 when 
the post-war occupation government stopped the use 
of the literary Sino-Japanese High. 

The modern Japanese writing system combines 
two subsystems: a logographic system borrowed from 
Chinese and a syllabic system with two subvarieties. 
Until 1945, a very large number of Chinese characters 
or kanji were in use, but the numbers were restricted 
to 1,850 in 1946. The Japanese kanji are mostly writ- 
ten like traditional Chinese characters, but a few are 
simplified. From the eighth century ap, the various 
kana or syllabic systems developed, with each kana 
symbol derived from a cursive version of a Chinese 
character. Modern Japanese writing uses two sets of 
kana: one is hiragana for Japanese words and gram- 
matical morphemes, and the other is katakana for 
borrowed words and onomatopoeia. The katakana 
are somewhat further simplified from the hiragana. 
When reading Japanese kanji, various readings are 
possible: a purely Japanese reading or some Sino- 
Japanese reading. For example, the Chinese character 
for ‘mountain’, shan in Mandarin, is used in Japanese 
kanji with the Japanese reading yama and the Sino- 
Japanese reading san. For some kanji, there are two 
or more different Sino-Japanese readings. Written 
Japanese does not indicate whether a particular kanji 
should be read with a Japanese or one of its Sino- 
Japanese forms. It is thus sometimes not possible to 
know how names should be read. 

Japanese is a verb-final language, with SOV 
order. Modifiers precede heads, so the possessor 
precedes the possessed, relative clauses precede their 
head noun, and subordinate clauses precede the main 
clause; marking is on dependents, not on heads. 
When counting nouns, a classifier and genitive 
marker follows the numeral, and the whole precedes 
the head noun, as in ni-hiki no inu ‘two dogs’ (two- 
head Gen dog). Apart from the presence of the 
genitive marker, this is parallel to the Chinese con- 
struction. Japanese is a zero anaphora language; that 
is, a pronoun is not obligatory where the nominal 
referent is clear from the context. There is very exten- 
sive morphology, mainly suffixed. Noun case and 
topic markers follow the noun, and various verbal 
markers follow the verb stem. These include deriv- 
ational markers, aspect, tense and mood, in this order. 

There are very extensive politeness and honorific 
structures, also known as keigo. There are normal 
forms such as the copula da and unmarked verbs as 
opposed to general polite forms, also known as desu/ 
masu from the polite copula desu and the polite verb 
suffix -masu. There are also honorific forms that 
express respect for referents, such as the honorific 
copula gozaru, which is normally combined with the 
polite suffix to give gozaimasu. Honorific forms 
include many verbs and nouns; nouns can also be 
made honorific by the addition of the honorific pre- 
fixes o- or go-. In general, polite and honorific forms 
are longer than the corresponding normal forms: iku 
‘go’, ikimasu ‘go (polite)’, irassyaru ‘go (honorific)’. 
There are also humble forms, which are used of 
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the speaker. In general, females use keigo more than 
males. 

Standard Japanese has a relatively small conson- 
ant and vowel inventory with 14 consonants p tk bd 
gmnszhrj (commonly transliterated y) and w, and 
five vowels i e a o u. Some of the consonants have 
different allophones, depending on the following 
vowel. A syllable-initial consonant can be followed by 
a palatal approximant j. Syllables can be short or long; 
long syllables consist of a long vowel or a vowel fol- 
lowed by i, a nasal homorganic with the onset of the 
following syllable, or an obstruent that is fully assimi- 
lated to a following initial obstruent. Apart from the 
medial -j- and word-medial homorganic nasals and 
geminated obstruents, there are no consonant clus- 
ters; there are also no syllable-final consonants. In 
addition to other conditioned allophones, there is 
intervocalic voicing or rendaku of tk sh, such as k > g 
seen in hiragana. These processes also apply to Sino- 
Japanese words such as ‘mountain’ san > zan. There 
are considerable regional and social dialect differ- 
ences in phonology; for example, the high back vowel 
is pronounced unrounded in Tokyo and standard 
Japanese, but rounded in Kyoto. A social example is 
the non-prestige Tokyo replacement of intervocalic g 
by a velar nasal. 

Japanese has a pitch-accent system with very 
substantial dialect differences. For verbs, there are two 
pitch-accent categories, for one-syllable nouns there 
are three, for two-syllable nouns there are four, for 
three-syllable nouns including suffixes there are five, 
and so on. In standard Tokyo Japanese, for example, 
hashi ‘bridge’ is HL, while hashi ‘chopsticks’ is LH 
and hashi ‘edge’ is also LH, where H indicates high 
pitch and L indicates low pitch. However, the Kyoto 
forms are LH, HL and HH respectively. 


5.4.1 Japanese dialects and Ryukyuan 


Japanese linguists have done very extensive and 
detailed dialect surveys. The main dialect boundary 
within Japanese is between northern and southern, 
with the northern or Tokyo-type dialect spoken from 
Nagoya north-eastward, and southern or Kyoto-type 
dialects spoken to the south-west of this area. There 
are many phonological, morphological and lexical 
differences, as briefly summarized in Shibatani 
(1990). For a more detailed overview and map, see 
Uwano (1981). Standard Japanese is based on the 
Tokyo dialect, which is a northern variety. The differ- 
ences are found in many highly frequent and basic 
lexical items and morphemes, such as the informal 
copula, northern da and southern zya/ja. The most 
stereotyped difference is in the pitch values of the 
pitch-accent system. As we have seen for hashi, in an 
area around Kyoto and also on much of Shikoku 
Island, the pitch values are reversed when compared 
to the rest of Japanese, including northern and the 
other southern dialects. This is stereotyped, as is the 
greater range of pitch values used within the pitch- 
accent system in southern Kyushu (Kagoshima and 
the surrounding area). The origin and development 
of the pitch-accent system is one of the main contro- 
versies among Japanese linguists. 

Most linguists in Japan consider Ryukyuan (as 
spoken in the Ryukyu Islands between Japan and 
Taiwan) as a distinct language, and there are minutely 
detailed surveys and maps of the Ryukyuan dialects 
(Nakasone et al 1981). The Tokara Islands, south of 
Kyushu, are on the Japanese side of the dialect 
boundary, and the Amami Islands are the northern- 
most Ryukyuan-speaking area. Nearly all Ryukyuan 
speakers can also speak standard Japanese, and are 
enumerated as Japanese in census statistics. 

The main dialect boundary within Ryukyuan is 
between the Amami-Okinawan dialect to the north, 
with about 1.4 milllion speakers and eight main sub- 
dialects, and the Sakishima dialect to the south, with 
about 250,000 speakers of the Miyako, Yaeyama and 
Yonaguni subdialects. The main island, Okinawa, is 
at the southern end of the Amami—Okinawan area 
and has two main subdialects, northern or Kunigami 
and southern. One typical isogloss between northern 


and southern Ryukyuan is that northern Ryukyuan 
preserves proto-Japanese *p as p but changes proto- 
Japanese *k to h, while southern Ryukyuan preserves 
proto-Japanese *k as k but changes proto-Japanese *p 
to has also seen in most of the rest of Japanese, for 
instance ‘flower’ pana in northern Ryukyuan, but 
hana in Japanese and southern Ryukyuan. 


5.5 Korean 


Korean is spoken by the populations of the Republic 
of Korea (South Korea) (48.4 million in 2006) and the 
Democratic People’s Republic of Korea (North 
Korea) (21.4 million in 1990). Korean is also spoken 
by most of the Korean national minority of China 
with 1,923,842 people in 2000 in various areas of 
north-eastern China, a substantial minority of about 
1.6 million in Japan with over half able to speak 
Korean, a small minority in far eastern Russia, a larger 
group in various Central Asian republics totalling 
over 400,000 people but only just over half still able to 
speak Korean, and a very large overseas Korean com- 
munity, especially in the United States but also in 
many other countries. In the USA, the Koreans are 
concentrated in southern California and the New 
York metropolitan area, in suburbs such as Fort Lee, 
NJ. A world-wide estimate is 75 million speakers and 
over a million additional ethnic Koreans. 

There are substantial lexical differences between 
North and South Korea, which have developed since 
1945. Genetically, Korean is most closely related to 
Japanese and perhaps more distantly to the Manchu- 
Tungus languages. 

Koreans are also Japan’s largest ethnic minority. 
The Korean community in Japan is mainly the result 
of Japanese rule in Korea from 1910 to 1945, when 
large numbers of Korean workers were brought to 
Japan, mainly to work in cities such as Osaka, Tokyo, 
Kobe and so forth. Many remain concentrated in 
Korean neighbourhoods in these and other cities. 
About half of the 1.6 million Koreans in Japan have 
assimilated and are hard to enumerate, but nearly half 
are registered as Korean (and so do not have Japanese 
citizenship), maintain Korean identity, the Korean 
language and even some contact with one or other of 
the two Koreas. Some live in Korean areas of major 
Japanese cities such as Tokyo and Osaka, but others 
are scattered. About a quarter of Korean-registered 
children go to Korean schools, mostly with a North 
Korean allegiance; but many other Korean-registered 
children of the second and third generation do not 
speak Korean. Conversely, in Korea elderly people 
educated before 1945 speak Japanese. 

As in Japan and Vietnam, writing in Korea 
started with the use of classical Chinese, with the 
earliest texts dating from the twelfth century ap. 
Chinese characters were later used to represent 
Korean phonetically, initially to transcribe hyangga 
poetry, the earliest of which may date from the 
Southern Silla dynasty (ap 886-918), and for some 
earlier Korean place names. Korean continued to be 
written entirely with Chinese characters until ap 1446, 
when King Sejul devised the han'gul phonetic system. 
This uses 28 elements (later reduced to 24), which 
represent the segments of Korean. These are com- 
bined into squares the size and shape of a Chinese 
character, representing a syllable. However, classical 
Chinese continued to be the main formal written 
language until the Japanese occupation. Even when 
writing Korean using han'gul, it remained normal to 
write Chinese loanwords using Chinese characters, 
known as hanja in Korean. In North Korea, these 
hanja were officially abolished in 1949, but have been 
partly reintroduced in education and_ technical 
publishing. In South Korea, hanja continue in full use 
alongside han'gul. 

Korean is a verb-final language with extensive 
suffixation; the usual order is SOV. Modifiers precede 
heads, so possessor precedes possessed, relative and 
other attributive clauses precede the head noun, and 
subordinate clauses precede the main clause. Noun 


case or topic is marked by suffixes. Numerals are fol- 
lowed by a classifier, as in Chinese; plural marking is 
obligatory on definite human NPs (noun phrases), 
and may also be used elsewhere. Verbs can be marked 
by suffixes for derivational suffixes such as valency, 
aspect, tense and mood, in this order, as in Japanese. 
Also like Japanese, Korean is a zero anaphora lan- 
guage; that is, a pronoun is not obligatory where the 
context provides the referent. 

Like Japanese, Korean has an extensive politeness 
and honorific system using distinct stems and suffixes 
to indicate status relationships between speaker 
and addressee and between speaker and referent. For 
example, ha-da ‘do (plain) (do-PLAIN.INDIC) is 
the informal form, ha-mnida ‘do (formal) (do- 
FML.INDIC) is the polite form, and ha-si-mnida ‘do 
(honorific-formal)’ (do-suBJ HON-FML.INDIC) is the 
honorific polite form, indicating respect for the 
referent (in this case, the subject of the verb). 

The phonology of Korean has a typologically 
unusual three-way stop manner contrast in syllable- 
initial position, with voiceless unaspirated p t c k, 
voiceless tense and weakly aspirated pp tt tch kk and 
voiceless strongly aspirated p' t' c' k’; this manner 
contrast is absent in syllable-final position, where 
only p tc k occur. There are also aspirated s, tense ss as 
well as r h m n and the glides w and j (commonly 
transliterated y). The distribution of w and j is 
restricted according to the lip and tongue positions of 
the following vowel. 

Normatively, there are ten vowel phonemes: 
three front unrounded vowels i e ae [i e ¢]; three 
central/back unrounded vowels, high 1 [uw], mid 6 
(schwa) and a; two back rounded vowels u 0; and two 
front rounded vowels ui [y] and oe [o]. The front 
rounded vowels are now usually merged with the 
front unrounded or back unrounded vowels, giving 
an eight-vowel system. Some dialects have a pitch- 
accent system, while others have a corresponding 
vowel-length system. The Seoul dialect had a vowel- 
length distinction, but is losing it. 

Syllables combine consonants and vowels in a 
(C)V(C) structure, that is, CVC, CV, VC or V. 
When syllables are combined within a word, various 
assimilation processes affect consonant sequences, 
final consonants may be re-analysed as the initial of a 
following vowel-initial syllable, and various suffixes 
have allomorphs with initial vowels when attached to 
a stem with a final consonant, but allomorphs with- 
out these vowels when attached to a stem with a final 
vowel. The effect of these processes is to maximize 
CV structure. 


5.6 Ainu 


Ainu is a language isolate on the verge of extinction, 
with only a few fluent speakers left on Hokkaido, the 
northernmost island of Japan. Official census stat- 
istics give a total of 23,830 ethnic Ainu, but this is a 
major underenumeration, as many Ainu hide their 
ethnicity because of extreme prejudice among the 
majority Japanese population. A better estimate 
would be 100,000 or more people of Ainu descent, 
though not all these identify as Ainu and many have 
intermarried with the Japanese. Ainu was formerly 
spoken throughout Hokkaido, northern Honshu (the 
main island of Japan) and on southern Sakhalin and 
the Kurile Islands, and Ainu place names are found 
throughout this area. Japanese authorities started 
moving Ainu from the Kuriles and Sakhalin to 
Hokkaido gradually from at least 1876 onwards. 
Hattori and Chiri conducted a survey in the mid- 
1950s that managed to collect some lexical data on 
thirteen Hokkaido dialects and three Sakhalin dia- 
lects, mainly from the last speakers (Hattori and Chiri 
1981). The last remaining Ainu in Sakhalin and the 
Kuriles were moved in 1945 before Russia took these 
areas over, but these dialects of Ainu are now extinct; 
the last speaker died in 1994. Kirikae (1997) traces all 
available demographic information on the Ainu. 

The dialects that are still spoken, albeit very 
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tenuously, are in south-western Hokkaido, notably 
the Saru variety of the area near Biratori town with 
39 per cent of the officially acknowledged Ainu popu- 
lation, and the variety of the area of Shizunai, Chitose 
and Horobetsu where another 31 per cent of the 
enumerated Ainu live. There are about ten fluent 
speakers, all over 70, and a larger number of semi- 
speakers, some slightly younger. Over the last twenty 
years, and especially since a 1997 law giving the 
Ainu improved cultural rights, revival efforts have 
been under way using a modified Japanese katakana 
syllabary to teach Ainu to children and adults, but the 
numbers remain small. 

During the late nineteenth century and the first 
half of the twentieth century, rich collections of trad- 
itional oral literature as well as various folklore and 
other texts in more colloquial language were recorded 
by Japanese and other scholars. Much of this has now 
been republished by the Endangered Languages of the 
Pacific Rim project in Japan. 

The most accessible descriptions of Ainu are in 
Shibatani (1990) and Tamura (2000). Ainu is verb- 
final. There is very substantial morphology, includ- 
ing prefixes and suffixes; Ainu makes extensive use of 
inflectional prefixes, unlike Japanese. For example, 
there is prefixed person and number agreement mark- 
ing for subject and object on verbs. Conversely, there 
is no tense marking on Ainu verbs, unlike Japanese, 
which marks tense with suffixes. As in Japanese, noun 
cases are marked by suffixes. 

Most dialects have a pitch-accent system, a 
binary distinction similar to that of Japanese. The 
most striking difference from Japanese syllable struc- 
ture is that Ainu has syllable-final consonants includ- 
ing ptksrmn wj. The extinct Sakhalin and Kurile 
dialects had slightly different consonant inventories 
from Hokkaido Ainu that were more different from 
Japanese, and a vowel-length contrast, which is absent 
from Hokkaido Ainu. 


5.7 Turkic 


Turkic languages are agglutinative with extensive 
suffixed morphology. They are verb-final and show 
other typical characteristics of verb-final languages. 
There is extensive vowel harmony, which determines 
the vowel characteristics of the suffixes according to 
the vowels of the word stem: front or back, rounded 
or unrounded. 

The eight Turkic languages of China and 
Mongolia are from three of the five major subgroups 
of Turkic: Western or Kipchak (Kazakh, Kirghiz and 
Tatar), Eastern or Karluk (Uighur, Uzbek, Salar and 
Western Yugur) and Northern or Eastern Hunnish 
(Tuvan or Urianhai). Several of these languages are 
more widely spoken elsewhere: Kazakh, Uzbek and 
Kirghiz are the national languages of independent 
Central Asian nations. There are several varieties of 
Tatar in widely dispersed parts of Russia, from the 
Crimea to the far east. Tuvan is spoken mainly in 
the republic of Tuva in Russia, north of western 
Mongolia, but also in some border areas of north 
central Mongolia and far northern Xinjiang in China. 
All eight Turkic languages of the area are spoken in 
China, mainly in Xinjiang Uighur Autonomous 
Region but also in parts of Gansu and Qinghai 
provinces. Kazakh and Uighur are also spoken in far 
north-western Mongolia, and Tuvan in north central 
Mongolia. The Turkic and other languages are 
extremely widely dispersed and intermingled, reflect- 
ing the complex history of western China and its 
status as a major trade route. In Xinjiang, there are 
also substantial populations of Mongol, Xibo, and 
Tajik, smaller numbers of Russians with Chinese citi- 
zenship, substantial numbers of Hui (Chinese Mus- 
lims) and very large and growing numbers of Han 
Chinese. 

Uighur, also Romanized as Uyghur, is the major 
language of Xinjiang, and is widely spoken as a lingua 
franca by speakers of other languages in the area, 
Turkic and non-Turkic. It has nearly nine million 


first-language speakers, including about 8.4 million in 
China, some 300,000 in Kazakhstan, about 90,000 in 
Kyrgyzstan and Uzbekistan, about 3,000 in Afghani- 
stan, 1,000 in Mongolia, and small immigrant com- 
munities in Turkey, Australia and Germany (Dwyer 
2005: 12). Some urbanized members of other Turkic 
nationalities in China, such as the Tatar, speak mainly 
Uighur instead of or in addition to their traditional 
languages. Included within the Uighur nationality 
(8,399,393 in 2000) are some other smaller groups, 
including the Aynu, also known as Abdal Uighur, who 
speak a mixed language with mainly Iranian basic 
lexicon and mainly Uighur structure (Lee-Smith 
1996a, Ladstatter and Tieze 1994). Chinese linguists 
divide Uighur into three dialects: the Central dialect, 
with over seven million widely scattered speakers, 
especially across central Xinjiang from east to west; 
the Khotan dialect of southern Xinjiang, spoken 
mainly in Hotan Prefecture, and in Qiemo (Qargan) 
and Ruogiang (Qarkilik) counties, by over 1.6 million 
speakers; and the Lopnur dialect of Yuli (Lopnur) and 
northern Ruogiang (Qarkilik) counties in south cen- 
tral Xinjiang with about 30,000 speakers. 

The Kazakh nationality had 1,250,458 people in 
China in 2000; this is of course in addition to a 
much larger number in Kazakhstan and surrounding 
countries further west and a smaller number in 
north-western Mongolia. In Mongol they are known 
as Hasag and comprise about 5 per cent of the total 
population, or over 130,000 people. In Mongolia, 
about two-thirds are concentrated in north-western 
Bayan-Olgiy Province, where they are 90 per cent of 
the total population, with fewer in nearby Khovd 
Province and the rest scattered across Mongolia, 
including urban areas further east. Chinese linguists 
divide the Kazakh language of China into two dialect 
groups, north-eastern with most of the total in 
various areas of northern and eastern Xinjiang and 
into Qinghai, Gansu and Mongolia, and south- 
western with about 100,000 speakers of the Alban 
group in Qapqal, Tekes and Gongliu (Tokkuztara) 
counties of western central Xinjiang. The Kazakhs 
of Mongolia and to a lesser extent the Kazakhs of 
China have extensive contact with the Kazakhs of 
Kazakhstan, especially since the early 1990s; for more 
on the Kazakhs of Mongolia, see Section 5.9. 

The Kirghiz, with 160,823 people in 2000, are the 
third largest Turkic group in China, with far more 
in Kyrgyzstan and elsewhere further west. Chinese 
linguists divide the Kirghiz language in China into 
northern and southern dialects, north and south 
of the Kizil River in south-western Xinjiang, with 
about equal numbers of speakers. They are concen- 
trated in south-western Xinjiang, especially in the 
Kizilsu Kirghiz Autonomous Prefecture bordering on 
Kyrgyzstan. In Kizilsu, where nearly 80 per cent of 
China’s Kirghiz live, they are the majority. Another 6 
per cent are nearby in various areas of western Aksu 
Prefecture, just east of Kizilsu, and nearly 10 per cent 
are in and around Yining, further north. Over 
1,000 Kirghiz are found in Fugu County in western 
Heilongjiang Province in north-eastern China, but 
they no longer speak Kirghiz; they now speak 
Mongolic languages and Chinese. 

The fourth Turkic nationality by population in 
China is the Salar with 104,503 people in China in 
2000; this group is only found in China. Nearly 70 per 
cent of the Salar live in Xunhua Salar Autonomous 
County in eastern Qinghai and over 11 per cent in 
nearby Hualong Hui Autonomous County. Others are 
scattered in nearby areas, notably over 6 per cent in 
Jishishan Bao'an, Dongxiang and Salar Autonomous 
County in Gansu Province and over 4 per cent across 
Xinjiang, including nearly 2 per cent in and around 
Yining town. They are mostly followers of Tibetan 
Buddhism. 

The 12,370 Uzbek in China are a tiny minority 
compared to the number in Uzbekistan and sur- 
rounding nations further west. Most Uzbek in China 
live in Urumdi, Kashgar and Yining towns. There are 
also a few Uzbek in Mongolia. Tatar is also a very 
small group in China, with 4,890 people in 2000, 
many speaking Uighur rather than Tatar and most 


living in Kashgar and Yining towns. The few Tuvan - 
speakers in north-eastern Ili Prefecture of Xinjiang in 
China are not recognized as a separate nationality 
there; however, Tuvan or Urianhai is a recognized 
nationality of Russia and of Mongolia. Few of the 
estimated 21,000 Tuvans in Mongolia speak Tuvan, 
but some in scattered areas of the north-west of 
Mongolia still do. 

The Western Yugur or Saryg Yugur, also known 
as Yellow Uighur, live in north-western Gansu 
Province in China, with about 7,000 group members 
(out of the total Yugur nationality population of 
13,719), but only some 4,600 are speakers of their 
endangered language. They are traditionally followers 
of Tibetan Buddhism. 

Various versions of Sogdian vertical scripts were 
in use among some Uighurs in China, as for Mongol; 
however the most widespread orthographies for all 
the Turkic languages of Muslim groups in this area 
were based on Arabic. The pan-Turkic Romanization 
introduced in the Soviet Union in the late 1920s did 
not spread into China, but some Cyrillic scripts 
developed from the late 1930s came into use in China 
from the mid-1950s. However, the most popular 
scripts for Turkic languages in China are those based 
on Arabic. These were not in official use between the 
mid-1950s and the late 1970s, but have now come 
back into use in different versions from the pre-1950s 
Arabic scripts. Unlike the former Soviet states in 
Central Asia, there is no movement to Romanization. 

For Uighur, for example, Cyrillic script was in use 
from 1955 to 1958, Arabic script was revived in 1979, 
made official in 1982 and 1987, and reformed some- 
what in the early 1990s (Dwyer 2005: 19). The current 
situation is that Uighur, Kirghiz and Kazakh are 
written in China using Arabic-based scripts. In 
Mongolia, Kazakh is written either with the Arabic 
script used in China or with the Cyrillic script used in 
Kazakhstan. Some Salar and Western Yugur read and 
to some extent write literary Tibetan for liturgical 
purposes. While Uzbek, Tatar and Tuvan are literary 
languages outside China, none of the other Turkic 
languages is officially used for writing in China. 
Within China from 1958 to the mid-1970s, Turkic 
languages were rarely used in written form. Chinese is 
the national language and is widely spoken and writ- 
ten by Chinese-educated Turkic language speakers in 
China, especially younger people. 

Education in Xinjiang and minority-inhabited 
areas of Gansu and Qinghai provinces is under the 
control of local autonomous authorities, and can in 
principle be delivered in Uighur or — in autonomous 
areas of Kazakh, Kirghiz, Mongol, Tibetan, Xibo and 
Tajik and so on — in another appropriate language. 
For the Turkic nationalities, these areas include 
Kizilsu Kirghiz Autonomous Prefecture in the south- 
west, Ili Kazakh Autonomous Prefecture in the north 
and Mori, Barkol and Aksay Kazakh autonomous 
counties in the east and in north-western Gansu 
Province. In south-western Qinghai, Haixi is a mixed 
autonomous prefecture for Mongol, Tibetan and 
Kazakh. In the Salar autonomous county of Xunhua 
in eastern Qinghai, local authorities have chosen not 
to use Salar in education or to have a written form of 
Salar. The official policy for minorities throughout 
China is to have transitional bilingual education 
in the first three years of primary school; however, 
local authorities in many Turkic and other areas are 
increasingly choosing to have all education in 
standard Chinese. 

In addition to increasing knowledge of Chinese, 
speakers of Turkic languages in China are often able 
to speak other Turkic languages. This includes most of 
the Tatar and Uzbek who also speak Uighur, a wide- 
spread lingua franca in towns across central Xinjiang. 
Some Salar and Western Yugur have some knowledge 
of liturgical Tibetan as well as Chinese, and some 
Western Yugur also can speak Mongolic Eastern 
Yugur. Members of other groups living in mainly 
Kazakh areas may also speak Kazakh. There is less 
widespread knowledge of other Turkic languages; for 
example, the Salar in Xinjiang can also speak some 
Uighur or Kazakh, depending on where they live. 
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Table 5.9 Turkic speakers in China and Mongolia 


Languages Number of speakers 
Western/Kipchak Group 
Kazakh 1,380,000 
Kirghiz 160,000 
Tatar 1,000 
Eastern/Karluk Group 
Uighur 8,400,000 
Uzbek 12,000 
Salar 105,000 
Western Yugur 4,600 
Northern/Eastern Hunnish Group 
Tuvan 5,000 


5.8 Manchu-Tungus 


Depending on how one categorizes them, there are up 
to eighteen Manchu-Tungus languages, of which up to 
eight are spoken in China and one is extinct. The clas- 
sification of these languages is somewhat contro- 
versial; for two recent views, see Janhunen (2005) and 
Whaley et al (1999). The conventional view is that 
the basic split is into two branches: 1) Northern 
(also known as North-western or Ewenki) Tungus, 
including the languages of the Oroqen and Ewenke 
nationalities in China, along with five others in 
eastern Siberia, as opposed to 2) Southern with two 
sub-branches: A) Central Tungus with two further 
sub-branches, Udeghe and Nanai, and B) Manchu. 
Nanai languages are spoken in China as well as 
Siberia, and three Manchu languages are or were 
solely spoken in China. 

Janhunen (2005) provides current information 
for the Manchu-Tungus languages of China, and pro- 
poses a new classification in which Northern and 
Udeghe are linked in Northern Tungus, as opposed to 
Nanai and Manchu in Southern Tungus. Janhunen 
(2005: 38) says that the Northern Tungus branch 
could be considered as one language with a dialect 
chain showing large internal differences. Whaley et al 
(1999) also discuss the differences among Northern 
Tungus languages spoken in China by members of the 
Orogen and Ewenke nationalities, and indicate that 
due to extensive contact it is difficult to determine an 
exact genetic classification within Northern Tungus, 
but that more than one language is involved. 

Of the three languages of the Manchu branch, 
the extinct Jurchen language of the Jin dynasty of 
China (Ap 1115-1234) was a precursor of modern 
Manchu. Manchu was the official language of the 
Qing dynasty of China (ap 1644-1911). The Xibo 
language is also a variety of Manchu, now spoken 
only by the descendants of a Manchu army sent to 
Xinjiang in western China in ap 1764. Since 1599, 
Manchu, like Mongol and formerly Uighur, has been 
written with a vertical script adopted from Mongol 
but ultimately derived from Sogdian; Manchu build- 
ing names can still be seen above the doors of the 
Forbidden City palace in central Beijing, alongside 
Chinese. Perhaps due to its high socio-political status 
and extensive contact with Chinese and other lan- 
guages, the Manchu branch is lexically and structur- 
ally aberrant within Manchu-Tungus. The population 
of Manchu (officially known as Man) in China in 
2000 was 10,682,262, but only about twenty people 
aged over 50 in remote areas can still speak the lan- 
guage fluently; they and a few semispeakers are in 
three villages in north-western Heilongjiang Province, 
Siji and Dawujiazi villages, near Heihe City, and 
Sanjiazi village, Fuyu County, just north of Qiqihar. 
They are outnumbered by the scholars, of Manchu 
and Han nationality, who do research on Manchu. In 
fact, much of the indigenous population of Beijing 
and to a lesser extent other cities in northern China 
has some Manchu ancestry, and a near-150 per cent 
population increase for the Manchu nationality 
between 1982 and 2000 is due to the reclaiming of 
this identity over that period. 


Xibo (also transcribed Xibe, Sibo or Sibe) was a 
nationality of 188,824 people in 2000. However, only 
18 per cent of them live in Xinjiang, where the lan- 
guage is still spoken; most live in Liaoning and other 
north-eastern provinces, and have no knowledge 
of the language. The area where Xibo is spoken 
is Qapqal Xibo Autonomous County in far western 
Xinjiang, with about 11 per cent of the Xibo total in 
eight settlements originally based on military units; 
there may be as many as 20,000 speakers of the 27,000 
or so Xibo in Xinjiang, but the number is declining 
and many younger people are non speakers (Stary 
2003). The Xibo of Xinjiang have continued to use 
literary Manchu, written in a slightly reformed 
version of the vertical Sogdian script, taught in 
schools before 1958 and after 1978, and with exten- 
sive local publications from the 1930s to today. The 
Xibo language is said not to be mutually intelligible 
with the current spoken varieties of Manchu, but is 
quite similar to literary Manchu. 

Hezhe is the official Chinese name of the nation- 
ality known as Nanai (Nanay) in Russia, and formerly 
also Goldi. Another name sometimes seen in China is 
Hezhen. While more numerous in Siberia, in 2000 
there were 4,640 Nanai in China. They are con- 
centrated in Heilongjiang Province, especially in the 
north-eastern corner along the Amur River in 
Tongjiang, Raohe and Fuyuan counties. Few of the 
Nanai in China can speak the language; there are 
about forty elderly speakers, all over 60, about half of 
whom speak Nanai proper and half speak Kilen, in 
remote villages in two areas; these two languages are 
not separated on Map 45. In addition, there is a small 
population of a couple of thousand Qiakala in eastern 
Heilongjiang and Jilin provinces, whose ancestors 
spoke Udeghe but who now speak only Chinese. 

The Northern Tungus languages of China are 
classified in two nationalities, the Ewenke with 30,505 
people and the Orogen with 8,196 people in 2000. 
There is also a small group in Mongolia classified 
as Khamnigan, which in China is included within 
the Ewenke nationality. The three Manchu-Tungus 
languages included within the Ewenke nationality 
in China are Solon, Khamnigan and Manchurian 
Reindeer Ewenk. The Solon in China live mainly in 
Morindawa Daur Banner and nearby areas; there are 
more Solon in Siberia. Of about 27,000 Solon among 
the Ewenke nationality in China, less than half speak 
the language: speakers are mainly those over 50, 
and most also (or instead) speak Daur Mongol and 
Chinese. The few Khamnigan in China live in north- 
eastern Inner Mongolia and in a small area of eastern 
central Mongolia; those in Mongolia do not speak 
Khamnigan, but some of those in China do. Most 
Khamnigan speak Mongol, and those in China also 
speak Chinese; Hamnigan is the general Mongol term 
for all Northern Tungus groups. The Ewenk language 
has about 1,000 speakers in Aoluguya village of Chen 
Barag Banner of north-western Inner Mongolia 
in China, and far more in Siberia. Map 45 does not 
separate Solon and Ewenk. 

The Orogen language has under 2,500 speakers 
of four dialects (Kumarchen, Orochen, Selpechen and 
Birarchen) in north-eastern Inner Mongolia and in 
north-western Heilongjiang Province in China. In 
Heilongjiang, Kumarchen is spoken in Shibazhen vil- 
lage, and Selpechen is spoken in Xinsheng village, 
both in Heihe City with about 5 per cent of China’s 
Orogen; Birarchen is spoken in Xin Ewenke village in 
Xunke County where about 14 per cent of China’s 
Orogen live. Orochen is spoken in the Alihe Orogen 
Autonomous Banner, further west in Inner Mongolia, 
where about 40 per cent of China’s Orogen live. 
Kumarchen has the most speakers, including some 
children; Orochen and Selpechen have few if any 
speakers under 50; and Birarchen is even more 
moribund. 

Whaley et al. (1999) provide a brief overview of 
some phonological and morphological characteristics 
of the Manchu-Tungus languages; Vovin (1993) pro- 
vides further details. These languages are verb-final, 
and have a very rich set of nominal case suffixes; 
Whaley et al. (1999: 299) give reconstructions for 18 


distinct case markers. There is also extensive suffixed 
verb morphology. The lexicon, phonology and 
morphology of Manchu are aberrant within Manchu- 
Tungus, due to contact with Chinese and Mongol. 


Table 5.10 Manchu-Tungus speakers in China 


Languages Number of speakers 
Manchu Group 
Jurchen extinct 
Manchu 20 
Xibo 20,000 
Udeghe-Nanai Group 
Udeghe extinct 
Nanai 40 
Northern Group 
Ewenk 1,000 
Khamnigan 200 
Solon 13,000 
Orogen 2,500 


5.9 Mongolian languages 
Alan Sanders 


5.9.1 Mongolian in Mongolia 


The language of Mongolia is Mongol or Mongolian, a 
language of the Altaic family. The Altaic family has 
three branches: the Mongolic, Turkic (Turko-Tatar) 
and Tungusic (Manchu-Tungus) languages. Vowel 
harmony is a common feature of the Altaic languages; 
the agglutination of suffixes and absence of gender 
are also important indicators. The word order is 
subject—object-verb (SOV). The nominative is the 
same as the noun stem. There is no distinction 
between substantives and objectives. Mongolian has 
the remnants of inclusive and exclusive forms of the 
first person plural pronoun. Suffixes added to noun 
or verb stems are not interchangeable. Verbs occur as 
finite forms, converbs and verbal nouns. Finite verbs 
are marked for tense, mood and aspect. The impera- 
tive is the same as the verb stem. 

Besides independent Mongolia (the former 
Mongolian People’s Republic and Outer Mongolia), 
Mongolian and its dialects are spoken in Inner 
Mongolia, Xinjiang and other parts of China, and in 
the Buryat and Kalmyk Republics of the Russian 
Federation. The total number of speakers is around 
9.7 million. 

The Mongols are very widely scattered, and great 
confusion has been created by the various transcrip- 
tions of their tribal names, both within the countries 
they inhabit and by the whim of international 
scholars who have studied them. For the Mongols of 
Mongolia, their names are transcribed here from 
Mongol Cyrillic. For the Mongols of Russia, their 
names are transcribed from Russian Cyrillic. For the 
Mongols of China, their names are transcribed in 
their Chinese pinyin forms. Where tribes live across 
international boundaries, these names have been 
cross-referenced. 

Proto-Mongolic is the common ancestor of all 
living and historically attested Mongolic languages, 
according to Janhunen. Ancient Mongol relates to the 
period up to the thirteenth century, before Mongol 
was written. Middle Mongol is the spoken language 
of the thirteenth to fifteenth centuries. The conserva- 
tive ‘old script’ of written Mongol dates to the Middle 
Mongol period, but is not identical with it. Later it 
was one of five official literary languages of the Qing 
state. The old Mongol script retained the features of 
Ancient Mongol even when the Tibetan scriptures 
began to be translated in the classical period (seven- 
teenth to nineteenth centuries). As revised in the 
eighteenth century this remained the literary lan- 
guage until it was displaced by the Mongolian Cyrillic 
alphabet or new script (shine bichig) in the 1940s. It 
retained a great number of features, including archaic 
noun and verb suffixes no longer used in modern 
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speech. Thus the classical written and modern spoken 
languages differ considerably. For example, ‘seven’ 
is written dologan in the Mongol script while the 
modified Cyrillic script currently in use reflects the 
contemporary spoken language much better: doloon 
(intervocalic g has been lost from the spoken lan- 
guage). With orthographical and morphological 
simplifications, as modern written Mongol script it 
is still in use amongst Mongols in China. 

Mongolic and Turkic languages have many words 
in common; their origins are often obscure and there 
is no proof that similar words are loan words, in 
either direction. During the Yuan period (the Mongol 
Empire), Mongol acquired Chinese words, for ranks 
of the nobility, ‘tea’, etc. With the expansion of 
Tibetan Buddhism it gained much religious termin- 
ology through Tibetan, such as ‘lama’. Manchu 
(Qing) rule from the seventeenth century onwards 
brought in new titles like amban ‘governor’. Later 
neologisms were often coined from Chinese, e.g. galt 
tereg from huo che, railway train, literally ‘fire cart’. 
With the increase in Russian influence, especially after 
the Bolshevik revolution, Russian became a more 
important channel, e.g. mashin ‘motor car’, sotsializm 
‘socialism’, and so on. In the 1990s there has been a 
great upsurge in Mongolia of the study of English, 
which has contributed menejer, kompyuter, broker and 
other terms related to modern market and business 
practices. In 2004 the Mongolian parliament resolved 
to make English ‘the second official language’. 


Table 5.11 Mongolia’s nationalities (census of 2000) 


Ethnic group Number % of total 
Khalkha 1,934,674 81.79 
Kazakh 102,983 4.35 
Dorvéd 66,706 2.82 
Bayad 50,824 215 
Buryat 40,620 1.72 
Dariganga 31,909 1.35 
Zakhchin 29,766 1.26 
Uriankhai 25,183 1.06 
Darkhad 19,019 0.80 
Oéld 14,634 0.52 
Torguud 12,628 0.53 
Khoton 9,014 0.38 
Khotgoid 7,237 0.30 
Myangad 6,082 0.26 
Tuvan 4,778 0.20 
Barga 2,506 0.10 
Uzemchin 2,386 0.10 
Sartuul 1,540 0.06 
Khamnigan 565 0.02 
Total* 2,365,269 100 
Note: 


* Mongolian citizens, including other ethnic groups. For ethnic groups 
numbering fewer than 500 persons, see text. 


Source: 
2000 Population and Housing Census: the Main Results, Mongolian 
Statistical Office, Ulaanbaatar, July 2001, Annex 1, Table 4. 


Generally speaking, all Mongols of Mongolia what- 
ever their ethnicity can speak the national language, 
Khalkha, and read and write it in the Mongol Cyrillic 
script adopted in the 1940s (see section 5.9.1.1 on 
Mongolian scripts). Mongol names are rendered here 
in transliteration from the Mongol Cyrillic spelling. 
The Khalkh (Halh) form the largest Mongol 
ethnic group whose mother tongue is the official 
language and national literary language of Mongolia. 
They are represented in substantial numbers every- 
where except Bayan-Olgii aimag (only 393 in 1989). 
The name Khalkh Tiimen emerged in the fourteenth 
and fifteenth centuries, and Khalkh Oron ‘Khalkh 
country, Khalkh Uls ‘Khalkh people’ and Khalkh 
Undesten ‘Khalkh nation’ from the sixteenth century. 
Later they were separated into North (ar ‘back’) and 
South (6vér ‘breast’) Khalkh, reflecting the division of 
Mongol territory into Outer (ar) and Inner (évér) 
Mongolia under Qing (Manchu) rule. The eastern, 
central and western dialects are spoken in independ- 
ent Mongolia, while the southern dialect is spoken in 


Chinese Inner Mongolia. The Eljigen are Khalkh 
living in scattered communities in Uvs and Gobi- 
Altai aimags. 

The Oirad (Oird, Oirats) are a western Mongol 
tribal group whose name means literally ‘forest 
people’. Sources differ over the original composition 
of the so-called Dérvén Tiimen ‘Four Oirad’. Some 
say the Dérvéd yastan ‘Doérvéd nation’, Khoid ovog 
‘Khoid clan’, Khoshid aimag ‘Khoshid tribe’ and Tsors 
(Tsoros, Choros) ovog were the original members, 
but others include the Barga, Buryat and Tiimd 
(Tumet), an aimag from Inner Mongolia. After 1487 
the Oirad comprised the Oold, Khoid, Batuud, Barga, 
Buryat, Tsoros, Khoshuud and Torguud. The Oirats’ 
Jungarian (Ziitingar) khanate was established in 
1674. The armies of the khanate invaded deep into 
Khalkha territory, but finally were destroyed by Qing 
(Manchu) troops in 1757 and the tribes scattered. 
Mongolia’s Oirad speak a western dialect of Mongol 
and used a modified form of the classical Mongol 
(Uighur) script called ‘clear script’ , which the Oirads 
in China still use. 

Mongolia’s Kazakhs, whose language is Turkic, 
arrived in western Mongolia from Xinjiang in the 
nineteenth century and settled in the Khovd border 
area. In 1912 the Ondér Gegeen agreed to the settle- 
ment of 400 families from China. In the 1940s, terri- 
tory was set aside for their Bayan-Olgii aimag, where 
they form the majority population, though this is not 
a ‘national’ province administratively. There are large 
communities of Kazakhs in Ulaanbaatar (Nalaikha) 
and other mining areas of central Mongolia. In the 
early 1990s, thousands of Mongolian Kazakhs were 
allowed to migrate to work in Kazakhstan, but many 
were unable to adjust or find jobs and later returned 
to Bayan-Olgii. One reason was that their archaic and 
Mongolized Kazakh dialect was not understood. 
Mongolia’s Kazakhs adopted the modified Cyrillic 
alphabet devised for use in Kazakhstan. In 1989 there 
were 82,750 Kazakhs in Bayan-Olgii aimag, 12,814 
in Khovd aimag, 3,229 in Selenge aimag and 3,116 in 
Darkhan. 

The Dérvéd (Derbet), a western Mongol ethnic 
group, are one of the ‘four Oirad’ and speak the 
northern dialect. They now inhabit areas west of Lake 
Uvs in Uvs aimag (33,941 in 1989) and the Dérgén 
area of Khovd aimag (3,609). “Dérvéd aimag was a 
former name of Uvs aimag. 

The Bayad (Bait) are Oirad who speak the 
northern dialect, and apparently nomadized from 
Hulun Buir to the Selenge and Tuul valleys in the 
twelfth century. They are now concentrated in the 
Ziitingobi area of Uvs aimag (29,672 there in 1989). 

Mongolia’s Buryats (Buriad, Buryaad) inhabit 
the areas of northern and north-eastern Mongolia 
along the border with Russia’s Buryat Republic. Two 
dialects are spoken, eastern — Khori, Aga and Barga 
in Dornod aimag (16,654 in 1989), and Khudir in 
Khentii aimag (7,044); and northern — Tunkh in 
Khévsgol aimag, Sanaga in Bulgan aimag and 
Tsongool in Selenge aimag (992 in 1989). The Tunkh 
and Sanaga Buryats are said to have arrived in 
Mongolia in the early twentieth century, escaping 
recruitment into the Tsarist army. 

Buryat advisers, brought into Mongolia by the 
Soviets at the time of the Mongolian revolution of 
1921, played an important role in the revolutionary 
administration, and later in scholarly research. 
More Buryat herdsmen arrived in the 1930s, fleeing 
Stalinist collectivization. Buryat intellectuals suffered 
persecution during the Mongolian purges of 1937-9. 

The Dariganga are an eastern Mongol ethnic 
group in Siikhbaatar aimag (24,521 in 1989) who 
speak a dialect of Khalkh. They are descendants of 
Khalkh, Tsakhar and Odéld tribesmen who were 
settled in the Dariganga territory (named after Mt 
Dari and Lake Ganga) in 1697 to look after the herds 
of horses and camels of the Qing (Manchu) emperor. 

The Zakhchin are Torguud who were sent by the 
Qing (Manchu) emperor to guard the western border 
(zakh) with the Jungarian khanate (Ziitin gar khaany 
uls). They now live mostly in southern Khovd aimag 
(17,228 in 1989), though there were 857 (1989) in 


Selenge aimag. The Zakhchin speak a southern dialect 
of Oirad (western Mongol). 

The Uriankhai (Altai Uriankhai) are a Mongol 
ethnic group living in Bayan-Olgii aimag (5,100 in 
1989) and Khovd aimag (5,403), which speaks a 
southern dialect of Oirad. There are smaller com- 
munities in Selenge aimag (1,768) and Khentii aimag 
(1,304). They may have come originally from 
Liaotong in north-eastern China. 

The Darkhad (Darhat) are a Mongolized Turkic 
people who formed a tribal league in the thirteenth 
century. In the nineteenth and twentieth centuries 
Khar Darkhad and Ukhaa Darkhad were to be found 
scattered in Tov and Ovorkhangai and mostly in 
Khévsg6l aimag — the Darkhadyn Khotgor region lies 
to the west of Lake Khévsgél. There are also some 
Darkhad (Khar and Ukhaa Darkhad, as well as Jal 
Darkhad) in Gobi-Altai aimag, sometimes called 
Darkhchuul. The Darkhad were recruited to herd 
livestock on the various estates of the Ondér Gegeen, 
Mongolia’s Buddhist leader. There were 12,991 
Darkhad in Khovsgél aimag in 1989. 

The Odld (Olyot, Eleuth), a western Mongol 
ethnic group, now speaking the northern Oirad dia- 
lect, seem to have lived originally in central Mongolia, 
in Arkhangai aimag (1,427 still there in 1989), but 
mostly moved several hundred years ago to western 
Mongolia (Khovd aimag, 5,622 in 1989). The Odld in 
Gobi-Altai aimag are returnees from Jungaria. 

The Torguud (Torgut) were part of the Oirad or 
western Mongols. Some nomadized to the Volga in 
the seventeenth century, and some returned in the 
eighteenth century, to Khovd aimag (6,703 in 1989) 
and to China. 

The Khoton are Uighurs from the Ili valley taken 
to Mongolia by the Oirad in the eighteenth century. 
They are concentrated in central areas of Uvs aimag 
(5,007 in 1989) and speak Oirad (western Mongol). 

The Khotgoid (Hotogoyt) are Khalkha-ized 
Oirad from the Lake Uvs area who now inhabit 
southern Khévsgol aimag and north-eastern Zavkhan 
aimag. 

The Myangad (Mingat) are a western Mongolian 
(Oirad) group dwelling mostly north of Lake Khar Us 
in Myangad district of Khovd aimag (3,517 in 1989). 

The Tannu Uriankhai (Malchin Tuva) form a 
Turkic-speaking ethnic group, related to the Tuva 
(Tyva) in Russia, which may originally have been 
Uighur. They are now concentrated mostly in 
northern Khévsgél aimag. The Tsaatan are an isolated 
and specialized group, reindeer herdsmen restricted 
to the north-western mountainous and forested 
(taiga) areas of Khoévsgél aimag. They have been 
known variously as Forest Uriankhai or Tagna 
Uriankhai (from Tannu Tuva), Uighur Uriankhai or 
Soyod (Soyots). Their number is limited to a few 
dozen families (303 individuals in 2000). Their 
language is Turkic and similar to Uriankhai Tuvan 
(see below). 

The Barga (Barguud, Bargachuud, Barag), who 
speak a dialect of Buryat, lived east of Lake Baikal and 
in the Barga area of Hulun Buir, Inner Mongolia. 
After the suppression of a revolt against the new 
Republic of China in 1912, many moved to the 
northern and western areas of Mongolia’s Dornod 
aimag, and there were 1,825 in 1989. Some returned 
to China in the 1940s after re-demarcation of the 
border. A couple of dozen living in Tév aimag are 
descendants of monastery serfs. 

The Uzemchin community now living in 
Dornod aimag (1,697 in 1989) and Siikhbaatar aimag 
(253) arrived in 1945 from China, where there are 
several banners. They speak an eastern Mongol 
dialect. 

The Sartuul people were brought from the Iraq— 
Iran region (once called Sartuul) by the Mongols in 
the thirteenth century. They now live in Zavkhan 
aimag and speak a dialect of Khalkha. 

The Khamnigan (Evenki) of Mongolia are a 
Manchu-Tungus people from Siberia, also called 
Ziirchid, who became part of Genghis Khan’s empire 
in the twelfth century. They now live in the 
Onon valley of Khentii aimag, Dornod aimag, and 
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particularly the Y6ré6 valley among the Selenge 
Buryats of Tév aimag. Mongolia’s Khamnigan have 
become Mongolized and speak a dialect of Buryat. 
Khamnigan Mongol is dominant and considered to 
be a language, not a dialect of Mongol, while Kham- 
nigan Evenki is a dialect of Evenki no longer spoken 
in Mongolia now. 

The Khoshuud (Hoshut), originally from the 
Hinggan mountains, made their way to Oirad terri- 
tory in the fifteenth century, joining the Dérvéd and 
Torguud. When the Oirad were pressed by the Qing 
in the seventeenth century, some Khoshuud went to 
Kokonor (Qinghai) and others together with some 
Torguud entered Russia and reached the Volga (see 
Kalmyks). The descendants of the Khoshuud who 
eventually returned to Mongolia now live in Khovd 
aimag (183 of them, according to the 2002 census). 

The few Kharchin, Khorchin and Tsakhar 
(Chahar) Mongols living in Mongolia are members 
of the ethnic groups of the same name in China. 
Mongolia’s Uzbeks are related to those living in the 
Republic of Uzbekistan. They made their way into the 
Mongol and Qing empires as traders and caravan 
men. 

The number of Russian and Chinese long-term 
residents who are Mongolian citizens is now very 
small, 158 and 173 respectively. Most have retained 
their mother-tongue, but can communicate in 
Khalkha. 

The aimag population figures for ethnic minority 
people given above are from the 1989 census (see 
Bulag’s book on Mongol ethnicity). The Mongolian 
authorities had not been publishing regional figures, 
perhaps for internal political reasons, to do with the 
consolidation of a Mongol nation based on the 


Khalkh. 


5.9.1.1 Mongolian scripts 


The Mongols acquired their classical script hudam 
mongol bichig during the early period of expansion 
of their empire. After a battle in which the Naiman 
tribe was defeated in 1204, a Uighur scholar named 
Tatatungga surrendered to ‘Temiijin, the future 
Genghis Khan. The Uighurs had adapted the Sogdian 
script, which was Aramaic in origin, and the Uighur 
script was already in use amongst the Naiman. 
Temiijin ordered Tatatungga to teach the young 
Mongol nobles to read and write. The Mongols there- 
fore sometimes call their script uigarjin. 

The script, written in vertical (top-down) 
columns from left to right, has an alphabet of 24 
letters, with variant initial, medial and final forms 
for most. Because initial consonants are modified by 
the following vowel, some scholars treat the script as 
syllabic. Although the classical script is no longer in 
daily use, there is an echo of it in the syllabic abbrevi- 
ation of names: e.g. khu nam from khuvisgalt nam 
‘revolutionary party’, Re bagsh from Renchin bagsh 
‘the scholar Renchin’. The script has five vowels, but 
does not distinguish between o and u or 6 and ii. 
The consonants t and d and k and g are written simi- 
larly. Letters are formed from combinations of basic 
brush-strokes called stick, horn, tail, hook, bow, 
tooth, stomach, and so on. 

The language of this period is called Middle 
Mongol. Ancient Mongol was in use before writing 
was introduced, and has been partly reconstructed 
from loan words in languages like Jurchen. The oldest- 
known inscription in classical Mongol (c. ap 1225) 
is sometimes called “Genghis’s stone’. It was on a 
stone tablet found towards the end of the nineteenth 
century near Nerchinsk on the Shilka River, a tribu- 
tary of the Amur. The tablet, the original of which is 
kept in St Petersburg, records that Yesiinge, who may 
have been a nephew of Genghis Khan, shot an arrow 
a distance of 335 spans (around 580 yards). 

The Secret History of the Mongols, which 
describes the mythical birth of the Mongol nation 
and the conquests of Genghis Khan, records that in 
the Year of the Tiger (ap 1206) Genghis Khan 
ordered his adopted brother Shigiken-qutuqu (a 
Tatar foundling) to judge the people and record his 


judgments on ‘white paper’ in a ‘blue book’. The 
oldest extant version of the Secret History (Mongqol-un 
niucha tobchaan, in modern transcription Nuuts 
tovchoo) is in Mongol phonetic transcription into 
Chinese characters, with interlinear notes, and was 
first identified in Beijing in the late nineteenth cen- 
tury by the Russian Orthodox priest Palladii. The 
Secret History records Genghis Khan’s death and the 
accession of his successor Ogédei (r. 1246-8), but its 
dating and authorship remain uncertain. During the 
twentieth century, Mongolists established that parts 
of the Secret History had been written into other later 
Mongol collections like the Altan Tobchi. 

A new state script was designed for Kublai Khan 
(1215-94) by Phags-pa (hP'ags-pa ‘reverend’) Lama 
Lodoijantsan (1234-79), the Tibetan national pre- 
ceptor in charge of Buddhist affairs under the Yuan. 
The use of the Phags-pa or square script was com- 
manded in ap 1269, for writing not only Mongol 
but also Tibetan, Sanskrit, Chinese and the Turkic 
languages spoken in the Mongol Empire. It was based 
on Tibetan letters but written vertically. The un- 
modified consonants represent pa, ba, ra, ma, la, etc., 
and could be modified by writing other vowels 
underneath them. There were eight vowels, two semi- 
vowels and 32 consonants, with some variation. 
From ap 1278, Phags-pa script supplanted classical 
Mongol script (uigarjin) on gerege (paizi, gold and 
silver tablets, which acted as passports for govern- 
ment messengers), and it was used for edicts, inscrip- 
tions and other government and administrative 
purposes, including seals and paper currency. 

The third Dalai Lama, who visited what is now 
Inner Mongolia in 1587, lectured on Buddhist litera- 
ture in Sanskrit and instructed a monk named 
Ayuushi-Gushi to devise for Mongol readers an 
alphabet for the better phonological transcrip- 
tion (galig) of Tibetan and Sanskrit words (e.g., 
bodhisattva) in religious texts. 

The Mongolian classical script was adopted by 
the Manchus, founders of China’s Qing dynasty 
(1644-1911), who earlier had written their language 
with Chinese characters. From ap 1632 the script was 
modified with diacritical marks. 

Another modified version of the Mongolian 
classical script was devised in 1648 by the Oirat 
(western Mongol) Zaya Pandita Namkhaijamts 
(1599-1662). This script, also called the ‘clear script’ 
(tod iiseg), became widespread only amongst the 
Oirats and is still used by Oirat tribes in Uvs and 
Khovd aimags (provinces) of Mongolia and in 
Xinjiang, though it was probably intended for use by 
all Mongols. It removed most ambiguities, dis- 
tinguishing n from a, s from sh, and i from y, and 
marked long vowels. The number of letters in the 
‘clear script alphabet is disputed — 26 basic letters, or 
46, counting those modified by diacriticals to dis- 
tinguish homographs. 

In ap 1686 another script called the soyombo 
script was created by Zanabazar (1635-1723), the first 
Javzandamba Khutukhtu (Bogd Gegeen, ‘Holy 
Enlightened One’, or Living Buddha), son of the 
hereditary Tiisheet Khan of Mongolia. This alphabet, 
also devised for use in transcribing religious texts 
from Tibetan and Sanskrit, was phonetic, but had a 
syllabic orthography of 41 consonants plus the other 
vowels. Each letter was topped by an inverted triangle 
and had a thick stem to the right side. Soyombo was 
usually written horizontally, but sometimes vertically. 
It is named after the soyombo, the alphabet’s opening 
symbol, which has been adopted as the emblem of 
Mongolian independence. 

A horizontal square script, also devised by 
Zanabazar, became known in Europe in the eight- 
eenth century. A xylograph dating from ap 1729 
included the design for a seal. The script related to 
Tibetan and Sanskrit, and was written from left to 
right using 16 vowels, 45 consonants and 6 com- 
pound consonants. 

The so-called Vagindra script was designed in 
1905 by the Buryat Mongol scholar Agvaandorj 
(1854-1937), who left his native Buryatia in 1871 to 
become a lama at Drepung monastery in Tibet and 


later travelled in Europe before settling in Russia. 
Vagindra is the Sanskrit form of Agvaan, meaning 
‘master of eloquence’. His script had 8 vowels and 
26 consonants, and vowel length was marked. The 
alphabet was intended primarily for use by the 
Buryats, and several books were published in St 
Petersburg, but Vagindra script was overtaken by a 
Romanization of Buryat by the Buryat scholar Tseven 
Jamtsarano. 

The runic script, to which reference is sometimes 
made in histories of pre-Mongol Central Asia, was an 
alphabet of Old (Orkhon or Yenisei) Turkish. There 
are several famous stelae on Mongolian territory 
dating from the sixth to eighth centuries. The script 
resembles Scandinavian runic script, but there is no 
known connection. 

Only classical Mongolian written in the Uighur 
Mongolian ‘old’ script had the durability to outlast 
all its competitors. Although communist histories of 
the Mongolian revolution in 1921 (which led to the 
proclamation of the Mongolian People’s Republic in 
1924) have claimed that most people were illiterate 
at that time, the same histories indicate that the 
Mongolian revolutionaries published their own 
pamphlets and newspapers in the classical script. 
Under Manchu (Qing) rule, only those who could 
write as well as read were considered literate, a quali- 
fication which created a liability for clerical service. 
Lamasery schools taught Tibetan, and the Tibetan 
script was used for writing Mongolian. Even today, 
nearly all Buddhist liturgy in Mongolia is still in 
Tibetan. 

Along with the Soviet Buryats and Kalmyks, in 
the 1930s Mongolia’s people experimented briefly 
with Romanization. Postage stamps of the period 
show Mongols studying the new Latin alphabet and a 
man holding a flag inscribed Kubiskalta Zaluucuud 
‘Revolutionary Youth’. This Romanization closely 
followed a transliteration of the classical Mongol 
script. 

In March 1941, the ruling Mongolian People’s 
Revolutionary Party and the government, reflecting 
the change of mood in the USSR, resolved to replace 
the classical Mongol script with the Russian Cyrillic 
alphabet, using two extra letters for the front vowels 
6 and ii. However, it was not until after the Second 
World War, in 1946, that this Cyrillic alphabet was 
brought into use. Afterwards it underwent some 
modification, and long vowels initially marked with a 
bar were doubled. Literacy was claimed to have risen 
from 20.8 per cent in 1940 to 42.4 per cent in 1947 
(90 per cent adult literacy in 1963 and 100 per cent by 
1968). The importance of the new Mongol Cyrillic 
script was that it represented an enormous leap 
forward from the classical language towards a much 
better phonetic representation of the spoken language 
of the Khalkha majority, which was effectively, and 
became legally, the state language. 

Besides the Buryats who have lived for many 
generations along Russia’s border with Mongolia, 
some Buryats went to Mongolia from Russia after 
1911, especially during the civil war in Siberia. There 
was a small but influential number working as civil 
servants and skilled craftsmen. Even before 1921, they 
had helped to modernize the army, develop education 
and improve administration. There had been un- 
successful attempts to introduce the Cyrillic script 
amongst the Buryats in Russia in the nineteenth 
century, but they continued to write in Buryatized 
classical (literary) Mongol until the introduction of 
Jamtsarano’s Latin script (with one extra letter) in 
1929-31. This was displaced by a Cyrillic alphabet 
with three additional letters in 1938. 

In Chinese Inner Mongolia, a Cyrillic alphabet 
was also briefly introduced after the communists 
came to power, but was soon discontinued for fear of 
Mongol nationalism. In the early 1950s there was a 
plan to Romanise Mongol as well as Chinese but this 
was also abandoned and China’s Mongols continued 
to read and write in classical Mongolian. 

In the Mongolian People’s Republic (MPR), 
several generations of Mongols grew up after 1946 
with little or know knowledge of the classical script 
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and the literary, historic and cultural heritage con- 
tained in it. Official policy on classical Mongolian 
changed after President Tsedenbal’s removal from 
power in 1984, and its teaching in senior schools was 
permitted. 

The momentous events of 1990, with the 
collapse of the MPR, the birth of democracy in 
Mongolia, the first multiparty elections and the 
formation of the country’s first coalition government, 
led to the wide popularization of classical Mongolian, 
with teaching in primary schools started and lessons 
broadcast on television. In 1991, the Mongolian 
government decided that classical Mongolian would 
be made the country’s official script from 1994. A 
standing committee of the State Little Khural (stand- 
ing legislature of that time) recommended that the 
classical script and Cyrillic script should be used in 
parallel until the end of the century. 

However, some Mongolian language specialists, 
while approving the decision to popularize the 
classical script, disagreed with its use for official 
purposes, seeing this as an anachronism in the com- 
puter age. Other opponents pointed to the huge cost 
of re-equipping Mongolia’s publishing houses with 
the necessary printing machinery. Most telling of 
all was the overwhelming rejection of classical 
Mongolian by the public, whose interest in the use of 
English and the Latin alphabet was growing, 
prompted by the rapid spread of computers and 
internet connections across Mongolia. 

Several scholars mooted their own different 
schemes for the Romanization of Mongolian 
Cyrillic, some proposing the use of diacritics, others 
avoiding them for easier e-mail communication in 
Mongolian. 

The Mongolian government instructed the State 
Committee for Standardization and Metrology to 
issue a standard for Romanization of the Mongolian 
Cyrillic script and in 2002 the Mongolian Great 
Khural (parliament) approved a law setting out a 
programme for Romanization: preparation 2002-4, 
and implementation 2005-6. The standard for 
Romanization, as revealed in the Mongolian press, was 
mostly a one-for-one transliteration of Cyrillic letters 
by Latin ones, but without diacritics there seemed 
to be no really satisfactory way of showing the differ- 
ence between front and back vowels, an essential 
element of the Mongolian language. Initially there 
was ambiguity about Romanizing — for instance, j as j 
or zh — and how to Romanize Russian words in Mon- 
golian with a soft or hard sign. The state committee 
opted to use o and u for the back vowels and o' and u' 
for the front vowels (6 and ii, as in this text), but 
using these once only in each word: e.g. o'noodor for 
6ndédér ‘today’. The use of the ‘single quote’ was then 
extended to include c' for ch and s' for sh, though later 
this seems to have been dropped. By 2005, this 
Romanization standard had been abandoned. 

Most transcriptions of classical Mongolian script 
now follow a Romanization by Nicholas Poppe (1991) 
that uses some accented and non-standard letters. A 
controversial system by Janhunen (2003) combines 
two Romanizations (letter and glyph) for segments 
with consonantal (j, v, w) as well as vocalic (i, a, e) 
values, with ‘significant consequences for the syllabic 
interpretation’ of Mongol script. For example, where 
initial vowel a is written with a double stroke, the first 
stroke is Romanized as v— so words beginning with a 
in the traditional transcription begin with va. Glyphic 
Romanization is being applied in compiling a Berlin 
library catalogue. 


5.9.2 Mongol in Russia 


The Buryat (buryaad) were conquered by Jochi, 
son of Genghis Khan, in 1207 and colonized by 
Russians from the seventeenth century. They live 
mostly in the Buryat Republic and the autonomous 
districts centered on Aginskoye (Chita region) and 
Ust'-Ordynskiy (Irkutsk region), which in early Soviet 
times were both part of the Buryat-Mongol Autono- 
mous Republic. Generally there is a division between 
western and eastern dialects and cultures, the Buryat 


Table 5.12 Mongol speakers of Russia (2002 census) 


Ethnic group Republic’s Number in Speakers of Speakers of 
(republic) population group own language Russian 
Buryats 981,000 445,000 369,000 429,000 
Kalmyks 292,400 174,000 154,000 173,000 
Sub-groups by region: 

Buryats (Aga) 12,000 = = - 
(Ust'-Orda) 135,000 = = - 
Mongols = 3,000 = 3,000 
Mongolized Turkic groups: 

Altais 202,900 67,000 66,000 63,000 
Tuvans/Tyva 305,500 244,000 243,000 207,000 
Source: 


Tabulated from Russian Federation Statistical Service’s website 


from west of Lake Baikal having been converted to 
Russian Orthodoxy, while those from east of the 
lake were converted from shamanism to Tibetan 
Buddhism, while using the old Mongol script as the 
principal literary medium. The main ethnic groups 
are the Bulagat, Ekhirit and Khori Buryat. The 
Bulagat (Bulgad) settled on the Lena River and its 
tributaries (north-west of Lake Baikal); their name 
may mean ‘sable hunters’, or be a variant of Buryat. 
The Ekhirit (Ehrid) settled on the Barguzin River and 
tributaries (north-east of Lake Baikal); their name 
may mean ‘squirrel hunters’. They converted to 
Orthodoxy officially, as they had been converted 
earlier to Tibetan Buddhism, but remained shaman- 
ists. The Alar Buryats from western areas of the 
republic are semi-sedentary and live in hexagonal 
wooden gers (houses shaped like the traditional 
round felt tents). The Khori Buryat (Khori Tiimed) 
from the Uda valley (south-east of Lake Baikal) 
nomadized into Mongolia in the mid-seventeenth 
century, moved to the Nerchinsk area east of Chita on 
the Shilka River some 25 years later, and finally settled 
in the Aginskiye steppes to the south in the eighteenth 
century. 

Buryat has numerous dialects (named after the 
area where the speakers live), and their language has 
preserved some ancient features (e.g. personal end- 
ings to verb forms, and Middle Mongol words of the 
twelfth to sixteenth centuries, found in the Secret 
History of the Mongols). The Tsongool dialect of 
Buryat, somewhere between Buryat and Khalkha, is 
spoken in Noikhan area, at the confluence of the 
Selenga and Khilok rivers. The Aga Buryat dialect is 
spoken in Aginskoe (autonomous district), a former 
part of the Buryat Republic famous for the Aga 
datsan (lamasery), a district which was absorbed into 
Chita oblast' (region). Some Buryat are Buryatized 
Tuva or Tungus. 

The Buryat were shamanist rather than Buddhist, 
but were brought by Russian influence to Orthodox 
Christianity, especially those dwelling west of Lake 
Baikal. The Buryat national epic (since the 1940s) is 
the Geser, actually of Tibetan origin. After the 1917 
revolution the western Buryat tribes became part of 
Soviet Russia as the Buryat-Mongol republic, while 
the eastern Buryat came under the short-lived Far 
Eastern Republic as the Mongol-Buryat republic. 
Reunited in the USSR, the Buryat territory became 
the Buryat-Mongol Autonomous Soviet Socialist 
Republic (ASSR), but the word ‘Mongol’ was later 
removed amidst Soviet fears of Pan-Mongolism. 

Some Jungarian or Oirat Mongols settled in 
Buryat territory in the fifteenth century. More settled 
in the area of Nizhneudinsk (western Irkutsk region) 
and the former Zungarbukhotsky district in the seven- 
teenth and eighteenth centuries, after the destruction 
of the Jungarian state. 

From the 1920s to the 1970s the republic was 
divided into 18 aimag or ‘leagues’, named mostly 
after towns or geographical features: Barguzin, 
Bauntovsky, Bichura, Dzhida, Kabansk, Khorinsk, 
Kizhinga, Kyakhta, Mukhorshibir, Oka, Pribaikalsky, 
Selenga, Severo-Baikalski, Tunka, Ulan-Ude, Yeravnoe, 
Zaigraevo and Zakamensk. The territory of the 
Buryat Republic is now divided administratively into 
raion or ‘districts’: Bauntovsky Evenki, Dzhida, Muya, 


Oka, Pribaikalsky, Selenga, Tunka and Yeravnoe. The 
capital of the Buryat Republic is Ulan-Ude. 

The Kalmuck or Kalmyk (khalimag) are Jung- 
arian Oirad Mongols whose ancestors were the 
Khoshuud and Torguud, Odld and Dérvéd, some 
300,000 of whom settled on the Volga in Russia in 
1616 to escape the Qing (Manchu). It is suggested 
that the name khalimag means ‘those who stayed 
behind’, though 125,000 of them returned to Jungaria 
in 1771. Those who remained in Russia now live 
mostly in the Kalmyk Republic (Khal'mg Tangch), the 
former Kalmyk ASSR, in the Caspian steppes. After 
Stalin’s abolition of the Kalmyk ASSR in December 
1943 for alleged wartime collaboration with the 
Germans, they were settled in other areas of the Volga 
valley (Astrakhan’, Rostov, Volgograd and Stavropol') 
and parts of Siberia (Orenburg) and Central Asia. 

The Kalmyk republic was restored in 1957, but as 
a result of the scattering of the population only about 
10 per cent still spoke their mother tongue. Since the 
collapse of the USSR, the Kalmyk have been encour- 
aged to restore their culture language, which contains 
many Russian loans. The Kalmyk national epic is the 
Jangar, which appeared in the eighteenth century and 
contains Buddhist elements, though the Kalmyk were 
originally shamanists. The Kalmyks used the modi- 
fied classical Mongol script (tod bichig) until the 
introduction of a modified Cyrillic script, followed 
by Romanization (1931-8), and then a new Cyrillic 
script. The territory of the Kalmyk Republic is 
divided into districts: Iki-Burul, Chernozemelsky, 
Ketchenery, Lagan, Malye Derbety, Gorodovikovsk, 
Priyutnoe, Sarpa, ‘Tselinny, Yashalta, Yashkul, 
Oktyabrski and Yusta (centre Tsagan Aman, on the 
Volga). The capital of the Kalmyk Republic is Elista. 

The Khalkh (Halh) Mongols living in Russia do 
not have an administrative territorial unit of their 
own, having mostly gone to the USSR (1924-90) to 
study or work there and then stayed on. 

The Oirot (Oirad) or Altai-Kizhi of the southern 
Altai Republic are partly Mongolized Turks. Their 
language is Turkic, but the vocabulary of livestock 
herding, for example, and the life-style of the herds- 
men are basically Mongol, under the influence of the 
successive rule of the Mongol Empire and the Jung- 
arian khanate. Their territory was called the Autono- 
mous Oirat Oblast’ (region) from 1922, and renamed 
the Gornyy (mountainous) Altai in 1948, the Russian 
name of the people, altaytsy, being derived from this. 

The administrative districts of the Altai Republic 
are Chemal, Kosh-Agach, Maima Choya, Ongudai, 
Shebalino, Turochak, Ust-Kan, Ust-Koksa and Ust- 
Ulagan. The capital is Gorno-Altaisk. The Altai 
Republic is part of the Altay krai (capital Barnaul). 

The Tuva (Uriankhai) of the Republic of Tuva 
(Tyva, former Tuvan ASSR) are Mongolized Turks. 
The traditional lifestyle and culture is similar to that 
of the Khalkha Mongols. Their language is Turkic, but 
until the twentieth century it had no written form, 
and literate Tuva read and wrote the classical Mongol 
(Uighur) script. After experimenting with a Latin 
script, the Tuva were given a Cyrillic script. 

Like Mongolia, the Uriankhai territory was part 
of the Qing Empire, whose governor was resident in 
Uliastai, but Russian and Soviet expansion led to the 
creation of a quasi-independent Tannu Tuva People’s 
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Republic in 1924. This was annexed to the Soviet 
Union in 1944. The capital of Tuva is Kyzyl. 

The administrative kojuun ‘banners’ of Tuva are 
Bai-Taiga, Barun-Khemchik, Bii-Khem, Chaa-Khol, 
Chedi-Khol, Dzun-Khemchik, Kaa-Khem, Kyzyl, 
Mongun-Taiga, Ovyursky, Sut-Khol, Tandinsky, Tes- 
Khem, Todzha and Ulug-Khem. Names in khem are 
tributaries of the Yenisei, names in khol (khél) are 
lakes. 


5.9.3 Mongolian languages of China 


Six languages of the Mongolic stock are spoken in 
China: Mongolian (Inner Mongolian, Oirad and 
Bargu Buriat), Dongxiang (Sharaigol Mongol or 
Santa), Monguor (Tsagaan or ‘White’ Mongol, Tu), 
Daur (Dagur), Bao'an and Eastern Yugur. Generally 
speaking, the more assimilated, isolated or remote (in 
time and place) the Mongolian minorities are from 
mainstream Mongolian speakers, the greater the dif- 
ficulty of mutual comprehension. Monguor and Daur 
in particular are hardly understood by other 
Mongolian speakers. 

Inner Mongolian (for Mongol dialects, see Table 
5.14) is closely related to Khalkha (also Romanized in 
pinyin as Halh), the predominant language of 
independent Mongolia (the former Outer Mongolia), 
but fricatives tend to be softer: in Inner Mongolian 
chag ‘hour’, zham ‘road’, Khalkh tsag and zam. Daur 
and Monguor preserve some ancient primary long 
vowels, while secondary long vowels resulting from 
the decay of intervocalic -g- exist in all modern 
Mongolic languages. There are differences in phonetic 
systems of Mongolic languages in China: for example, 
there are 23 vowel contrasts in ‘Blue Banner’ (Qahar 
Mongol), in length and tenseness, whereas there is no 
contrast in tenseness in the other five languages, and 
therefore fewer vowel contrasts. 

In Dongxiang Mongol there is no distinction of 
vowel length (but it has several vowel clusters): it has 
only seven vowel contrasts. Mongolian has only 19 
contrasting consonants, while there are many more in 
other languages — for example, Dongxiang has 28, 
and Eastern Yugur 29. The main contrasting vowels 
and consonants display regular sound correspond- 
ences between the languages. 

Assimilation of consonants is a common feature 
of Mongolic languages: e.g. the dative-locative case 
ending -d after |, m, n and ng becomes t after r, g and 
s, and the converb in -j (dzh) becomes ch after g, rand 
v. Only Daur does not have this assimilation. Daur 
has developed the final consonants b, d, g and s into 
-r or -/: bular ‘spring’, olor ‘nation’, Khalkh bulag and 
ulus. Monguor has preserved ancient Mongolic initial 
labial *p in the form of f or x-: fan ‘year’, xarvan 
‘ten’, Khalkh on and arvan. 

In terms of lexicon — despite their common 
heritage of Turkic roots, Tibetan and Sanskrit 
religious and Manchu and Chinese administrative 
terminology — Inner Mongolian and Khalkha have 
begun to diverge. While Khalkha was absorbing 
the Soviet political socio-economic vocabulary in 
Russian for the 70 years between 1921 and 1990, 
Inner Mongolian and other Mongolic languages 
spoken in China began to acquire the political 
and economic vocabulary of Communist China in 
Chinese. Historically, Dongxiang has the largest 
number of modern Chinese loan words, while 
Bao'an, Monguor and Eastern Yugur have Tibetan 
words. Words beginning with r are foreign loan 
words. 

All the languages have case markings; suffixes 
vary from five (Bao'an) to ten (Daur), with seven for 
Inner Mongolian/Khalkha, not counting vocative and 
reflexive possessive suffixes. Dongxiang makes its 
chief distinction one of aspect, not of tense. Bao'an, 
Monguor and Eastern Yugur distinguish affirmative 
and non-affirmative forms in the declarative mode: 
‘affirmative’ indicates the speaker’s own experience, 
volition or decision, whereas ‘non-affirmative’ declar- 
ations are not the speaker’s own experience, or they 
may indicate unexpected or involuntary behaviour. 

Many national minority people speak Chinese or 


another minority language as their mother tongue. 
The figures given in Table 5.13 are for the numbers of 
persons of the nationalities; therefore the actual 
number of speakers is smaller. For example, about 30 
per cent of Daur nationality people speak a language 
other than Daur. The Bao'an nationality totals 
10,000, of whom only about 6,000 are Bao'an 
speakers, plus about 4,000 Bao'an speakers of 
Monguor nationality in Tongren County, Qinghai. 
There are also about 2,000 speakers of Hui nationality 
there who speak Kangjia, which is also similar to 
Bao'an. 


Table 5.13 Mongolian nationalities in China (1982 
census) 


Nationality Number 
Mongolian 2,917,500 
Inner Mongolian 2,713,000 
Oirad 139,000 
Barga-Buryat 65,000 
Dongxiang 270,000 
Monguor 90,000 
Huzhu 60,000 
Minhe 30,000 
Daur 90,000 
Bao'an 10,000 
Dahejia 6,000 
Tongren 4,000 
Kangjia 2,000 
Eastern Yugur 4,000 


5.9.3.1 Mongolian dialects 


The dialects of Inner Mongolian, spoken in scattered 
communities over the greater part of the Inner Mon- 
golia Autonomous region and neighbouring areas, are 
mutually intelligible. Besides those listed in Table 
5.14, the speakers include the Ujemchin, Urat, Tumet 
and Abaganar. However, Oirad, which is spoken in 
Russia as well as China and western Mongolia, has 
preserved some archaic forms, including the front 
vowels 6 and ii where other Mongolian languages 
now have o and u, and non-initial 00 and 66 have 
become aa and dd; the consonant dz becomes z and k 
appears where other Mongolian languages have x (kh): 
kem ‘who’, Khalkh xen. Meanwhile, in Buryat, also 
spoken in northern Mongolia and Russia, 6 and ii 
have become o and u (or u only), changing the vowel 
harmony; s becomes h before a vowel and d at the end 
of a syllable; ch becomes sh or so, while dzh becomes 
zh, z or even y: yargal ‘happiness’, Khalkh jargal. 


Table 5.14 Speakers of Mongol dialects in China (est.) 


Dialect Number of speakers 
Inner Mongolian 2,713,0000 
Khorchin 1,347,000 
Kharachin 593,000 
Bairin 317,000 
Chakhar 299,000 
Ordos 123,000 
Ejine 34,000 
Oirad 139,000 
Torgut 106,000 
Kok Nur 33,000 
Barga-Buryat 65,500 
New Barga 47,000 
Old Barga 14,000 
Buryat 4,500 


It is estimated that around 86 per cent of China’s 
Mongolian nationality people speak Mongolian. 
Others speak mainly Chinese varieties. Chinese- 
speaking Mongols usually live in areas between Han 
(Chinese) and Mongolian peoples, where there is a 
mixed population of both nationalities. The Mongols 
of Tonghai County in Yunnan are Kazhuo (TB Negwi) 
speakers, while Tibetan-speaking Mongols are found 
in Qinghai and Tibet. 

According to the 2000 census results, there were 


altogether 5,813,947 Mongols in China. In addition, 
there were 513,805 Dongxiang, 241,198 Tu, 132,394 
Daur, 13,719 Yugur and 16,505 Bao'an. The Mongol 
population of China at the time of the 1990 census 
was 4,802,407, of which 3,000,000 were in Inner 
Mongolia and 100,000 in the Tian Shan and Tar- 
bagatai Mts and Ili valley, Xinjiang. 


5.9.4 Mongol leagues, banners and dialects 


The Republic of China government replaced the 
Mongol/Manchu administrative banner/league struc- 
ture of Inner Mongolia with provinces, including 
Suiyuan. The part of Inner Mongolia taken over by 
Manchukuo (1932-45) was called Jehe (Jehol) or 
Haluun Gol and included Barga, Daur, Ju Od and 
Jirem, with parts of Hinggan and Zost leagues. In 
January 1946 the assembly of people’s delegates from 
eastern Inner Mongolia set up the Eastern Mongolian 
Autonomous Region with Hulun Buir, Jirem, Zost 
and Ju Od aimag ‘leagues’, plus Nun River, which 
became part of Hulun Buir in April 1949, when Jehe 
became part of Ju Od League. A preparatory com- 
mittee for Inner Mongolia was created at a meeting in 
Alxa by delegates from Ordos, Alxa, Eznii Gol and 
Ulangqab. Their Inner Mongolia comprised Alxa, Eznii 
Gol, the Tiimed special banners, Ulanqab, Ih Ju, 
Shiling Gol, Qahar, Zost, Ju Od, Jirem and Hulun 
Buir. 

When the Inner Mongolia Autonomous Region 
(IMAR) was set up in October 1949, it had six 
leagues: Qahar, Hinggan, Hulun Buir Jirem, Ju Od 
and Shilin Gol. Ulanqab League joined in 1954 from 
Suiyuan province, and Bayannur in 1956 from Gansu. 
At some point, as the territories of the cities grew, 
Zost (Josotu) League was displaced. 

During the cultural revolution in the 1960s, 
the IMAR was truncated, leaving only four leagues: 
Bayannur, Ih Ju, Ulanqab and Shilingol. Large 
numbers of banners were placed under the adminis- 
tration of neighbouring provinces. The former Qahar 
aimag joined Xilin Gol, with one county to Ulaanjab. 
Hinggan (Ulaan Hot, formerly part of Hulun Buizr, 
was ceded to Jilin province, while Alashan (Alxa, 
centre Bayan Hot, Alxa Left Banner), formerly part of 
Bayannur, was divided between Gansu and Ningxia. 
The IMAR was restored in December 1979, with 
some modification of the original boundaries. In the 
1990s Ju Od League was redesignated Chifeng City 
under its Mongol name Ulanhad. 

Under the most recent administrative changes, Ih 
Ju League has been redesignated Ordos City, Jirem 
League has become Tongliao City and the former 
leagues of Bayannur (centre Linhe), Ulanqab (Jining) 
and Hulun Buir (Hailar) have been redesignated 
prefecture-level cities. 

On the basis of territorial divisions (leagues), five 
main dialects of Inner Mongolia are distinguished by 
Janhunen: 


1 Jerim: Horqin, Jasagtu, Jarud, Jalaid and Gorlos; 

2 Ju Ud: northern sub-dialects Ar Horqin and 
Bairin, southern Ongniud, Naiman and Aohan; 

3 Zost (Josotu): Hargin and Tumd; 

4 Ulanqab: Qahar, Urad, Darhan, Muumingan, 
Dorvon Hiitihed (Siziwang) and Hexigten; and 

5 Shilingol: Ujimqin, Huuchid, Abaga, Abaga nar 
and Sonid. 


Ordos (Urdus) speakers constitute a southern main 
dialect, but are usually not distinguished from other 
Mongols in Chinese material. 

Dialects are traditionally identified on a tribal 
basis. Note that administrative territorial divisions 
(khoshuu, aimag, Chinese shi ‘city’) have political 
rather than ethnic or linguistic boundaries and have 
been subject to arbitrary change. In 2006, the 
administrative units of Inner Mongolia were: 


Abag (Avga) Banner of Xilingol League 

Abagnar (Avga nar) Banner of Xilingol League 

Aohan Banner in Chifeng City (former Ju Ud 
League) 

Bairin (Baarin) Left and Right Banners of 
Chifeng City (former Ju Ud League) 
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Barag (Barga) New Right, New Left and Old 
(Chipchin) Banners of Hulun Buir League 

Dalad Banner of Ordos City (former Ih Ju 
League) 

Darhan in Darhan Muu Minggan Joint Banner 
of Baotou City (were in Ulanqab League) 

Daur in Morin Dawa Autonomous Banner and 
Butha Banner (Zalantun) of Hulun Buir 
League 

Dorbod (Dérvéd) in Siziwang (Dérvén 
Hiitthed) Banner of Ulanqab League 

Ejin Banner of Alxa League 

Hanggin Banner of Ordos City (former Ih Ju 
League) and Hanggin Rear Banner of 
Bayannur 

Harqin Banner of Chifeng (former Ju Ud 
League) 

Hexigten Banner of Chifeng (former Ju Ud 
League) 

Horqin Right Wing Front and Right Wing 
Centre banners of Hinggan League; Horqin 
district, and Left Wing Centre and Left Wing 
Rear banners of Tongliao City (former Jirem 
League), and Ar (Rear) Horqin Banner of 
Chifeng City (former Ju Ud League) 

Jalaid Banner of Hinggan League 

Jarud Banner of Tongliao City (former Jirem 
League) 

Minggan (Myangad) in Darhan Muu Minggan 
joint Banner of Baotou City (were in 
Ulanqab) — a Darhan Banner was part of 
Jirem League in 1929. 

Naiman Banner of Tongliao City (former Jirem 
League) 

Ongniud Banner of Chifeng City (former Ju Ud 
League) 

Otog and Otog Front banners of Ordos City 
(former Ih Ju League) 

Qahar (Tsahar) Right Wing Centre, Right Wing 
Front and Right Wing Rear banners of 
Ulangab, Zhengxiangbai [Plain and 
Bordered White, Xulun Hobot Qagan] 
Banner (centre Qagan Nur), Xianghuang 
[Bordered Yellow, Hobot Xar] Banner and 
Zhenglan [Plain Blue, Xulun Hoh] Banner 
(centre Dund Hot) of Xilingol League 

Sonid Left and Right banners of Xilingol League 

Tumd (Tiimed) Left Banner of Huhhot City and 
Right Banner of Baotou City (Bugat) 

Ujimgqin East and West banners of Xilingol 
League 

Urad Front Banner and Centre and Rear joint 
Banner of Bayannur City (ex League) 


In the seventeenth century, there were two Khalkha 
(Halh) banners, Khalkha Left Wing in Ju Od League 
and Khalkha Right Wing in Ulanqab League. The 
Qahar, Tiimed and Barga banners were originally not 
part of Inner Mongolia. The eight banners of Qahar 
were closer to the Great Wall and Kalgan (Chuulalt 
Khaalga). There are reported to be some Buryat living 
in Hulun Buir. 

In addition to the administrative units with 
Mongol population in Inner Mongolia listed above, 
there are the following outside it: 


Dorbod: (Dérvéd) Mongol Autonomous 
County, Heilongjiang province, and Subei 
Mongol Autonomous Banner, Gansu 

Gorlos: Front Gorlos Mongol Autonomous 
County, Jilin province — they are said to be 
related to the Bayat, who originated in 
Hulun Buir 

Hargqin: Harqin Left Wing Mongol Autonomous 
County, Liaoning province 

Khoshuud: Haixi Mongol-Zang-Kazak 
Autonomous Prefecture in Qinghai, Hoxud 
County, Bayangol Autonomous Prefecture, 
Xinjiang 

Oold (‘Tsaidam Mongols’): Haixi Mongol-Zang 
[Tibetan]-Kazak Autonomous Prefecture, 
Qinghai 

Qahar: ‘Ili Qahar’ in Bortala [Bor Tal] Mongol 
Autonomous Prefecture, Xinjiang 


Qapqal (Tsavchaal): Qapqal Xibe Autonomous 
County, Ili Kazak Autonomous Prefecture 

Torguud: Bayangol (Bayingolin) Mongol 
Autonomous Prefecture (Bayit dialect) and 
Hoboksar [Hovogsair] Mongol 
Autonomous County, Xinjiang 

Tsoros: Henan Mongol Autonomous County, 
Qinghai; Tibetanized Mongols 

Tumd (Tiimed): Fuxin Mongol Autonomous 
County, Liaoning 


There are reported to be some other small Mon- 
gol communities elsewhere: in Weicheng Mongol- 
Manchu Autonomous County in Hebei, and Datong 
Mongol and Khoton Autonomous County in Qinghai 
(these former administrative units seem to have been 
abolished). The Tsastyn (Snow) Mongols are Oirad— 
Dérvoéd and Khoshuud living in the high mountains 
on Gansu’s southern border. There are small Mongol 
communities in Henan and Sichuan provinces. There 
are three communities in Yunnan province, claiming 
descent from the Mongols who invaded Song China 
in the thirteenth century. They speak Kazhuo, a TB 
language, but no Mongol. There are also said to be 
some 20,000 people claiming Mongol ancestors in 
Sichuan, 2,500 in Guizhou, and 2,000 in Hubei. Most 
of the people of Mongol nationality in Sichuuan are 
speakers of Moso, a TB language. References to the 
Hor Mongols of south-western Tibet and Sog Mon- 
gols of the Lhasa area may be mistaken, since Hor is 
the Tibetan word for Monguor, and Sog the word for 
Mongol. 


5.9.5 Other Mongol nationalities of China 


Some Mongolic minorities of China are listed as 
separate nationalities. According to the 2000 census, 
there were 513,805 Dongxiang, 241,198 Tu, 132,394 
Daur, 13,719 Yugur and 16,505 Bao'an people in 
China. 

The Dongxiang (Dongxiang Mongol, also known 
as Santa) take their name from the place name 
Dongxiang. They live south of the Yellow River, in 
Dongxiang Autonomous County, Linxia Hui 
Autonomous Prefecture, Gansu Province, but there 
are scattered groups in Qinghai, Ningxia and Xinjiang 
(Ili valley), where 50,000 relocated in the 1960s. Their 
language is a form of Mongol. They may be the des- 
cendants of a one-time garrison of Mongol troops. 
They are Muslim, and there is evidence of a written 
form of their language in Arabic script. 

The Tu (Tukun, Tuhujia) are also called 
Monguor. They are to be found mostly in Huzhu Tu 
Autonomous County, Minle County and Datong 
County, Qinghai, with scattered groups in forested 
mountain areas of Tongren and Dule, and Menyuan, 
Tianzhu Tibetan Autonomous County, Gansu. 

The Minhe Monguor are known as Mangghuer 
or White Mongols, descendants of a Mongol Empire 
garrison (or later Mongol invaders) who intermarried 
with local herdsmen. They number perhaps 25,000. 
Their form of Mongol has a considerable number of 
Sinitic loans, and there are two distinct dialects, 
Huzhu and Minhe (San ch'uan). 

The Huzhu Monguor are known as Mongghul, 
and number perhaps 50,000. Both Mangghuer and 
Mongghul are Yellow Hat Tibetan Buddhists. Their 
languages have no written form, and Chinese or 
Tibetan is used. 

The Daur (Dahur, Dagur), possibly descendants 
of the Khitans (Liao), settled along the Nenjiang river 
and now live mainly in Morin Dawa Daur Autono- 
mous Banner and Ewenki Autonomous Banner of 
Inner Mongolia and Qigihar, but some were settled in 
Tacheng (Xinjiang) as Qing border guards in the 
eighteenth century. Their language is archaic, preserv- 
ing Middle Mongol elements (twelfth to sixteenth 
centuries), and has the Nonni (Butha, Qiqihar and 
Morin Davaa), Hailar (Hulun Buir), Amur (Heihe) 
and Xinjiang dialects. They are shamanists. There 
have been various attempts to write Daur in Manchu, 
Cyrillic and Romanized scripts. 

The Yugur (Yaohu'er, 


Weiwu, Weiwuerh, 


Weiguerh) mainly live in Sunan Yugur Autonomous 
County and Huangnibao Yugur township, Jiuquan 
City, Gansu. They are descendants of Huigu from the 
Ergun basin, some of whom in the ninth century 
moved to Dunhuang (Gansu), Zhangye and Wuwei in 
the Hexi Corridor, Gansu, and also Hami (Xinjiang). 
Their Ganzhou khanate was conquered by the Xixia 
in the eleventh century, and they came under Mongol 
rule in 1227. The nationality includes speakers of 
Mongolic and Turkic languages, Chinese and Tibetan. 

The Western Yugur speak a Turkic language, 
while the Eastern Yugur, mostly from Sunan, speak 
Enger (Engle), a form of Mongolian. They are also 
known as Shira Yughur, and mistakenly as ‘Yellow 
Uighurs’. Their ancient religion was Tengerism (wor- 
ship of the Heavenly Khan) with liturgy in Raohul, 
but they converted to Lamaism. Neither language has 
a written form. 

The Bao'an (Baonan, Bonan) speak a Mongolian 
close to Mangghuer (Tu) and Santa (Dongxiang). 
There is no script. They live along the Yellow River, 
mostly in Jishishan Bonan-Dongxiang-Salar Autono- 
mous County, Gansu. There are some in Linxia Hui 
Autonomous Prefecture, Gansu, and Xunhua County, 
Qinghai. 

The Bao'an from Tongren County, Qinghai 
Province, in the late nineteenth century fled to 
Xunhua, then to Dahejia, in Dongxiang Autonomous 
County. They are Muslim. The Buddhist Bao'an in 
the Bonan area are registered as of Monguor (Tu) 
nationality. 

The term ‘Shirongol’ or ‘Sharaigol Mongols’ 
seems to have applied to all the Mongols of the 
Qinghai-Gansu area except the Shira Yughur and 
Qinghai Oirad. 

There also some Mongol speakers amongst the 
Ewenki (Evenki, also called Tungus and even ‘Yakut’), 
of whom 30,505 (2000) live in the Greater Hinggan 
mountains, mainly in Ewenki Autonomous Banner of 
Hulun Buir meng (League) of Inner Mongolia, the 
rest in Chen Barag, Ergun Left, Morin Dawa and 
Arun banners, Zhalantun City and Nehe County, all 
in Heilongjiang Province. The Ewenki of Ergun Left 
Banner are reindeer herders. Their language (Hui- 
Yimin, Mergel and Alruguya dialects) is Manchu- 
Tungusic, but the herders in pastoral areas speak 
Mongol. The Ewenki were brought from the Nanjiang 
river basin in 1732 for guard duties in Hulun Buir. 
For more details of their languages, see section 5.8 
above. 


5.9.6 Mongol communities outside 
Mongolia, Russia and China 


The Oirad dialect is spoken by Sart Kalmuck, des- 
cendants of Odld and Torguud Mongols who in 1880 
emigrated from Jungaria to Chelpek and Bériti Bashi, 
east of Lake Issyk K6l, in the modern Kyrgyzstan. 

Moghol was spoken by the Mogholi of Afghani- 
stan, descendants of abandoned garrisons of Mon- 
gol troops from the thirteenth-century invasion of 
Khwarazmia. There are a few Moghol documents in 
Arabic script. They lived in Kandahar, Ghor and 
Herat (3,000 persons in the 1960s and 1970s), but 
very few spoke Mongol, their language having been 
strongly influenced by Dari. Little research has been 
done into their language and culture. The where- 
abouts of Moghol speakers has not been ascertained 
since the series of wars and occupations in Afghani- 
stan, and Moghul may be extinct. 

The Hazara tribe claims descent from the 
Mongols, on the basis that their name means ‘one 
thousand’ in Persian, the name of a Mongol army 
unit, but there seems to be little other evidence. Their 
language is not Mongolic. 


5.10 Indo-European 


There are two Indo-European languages spoken by 
local citizens in the area: Russian and Tajik in western 
China. Russian is a Slavic language and the national 
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language of Russia; Tajik is an Iranian language and 
the national language of Tajikistan. There are also two 
Portuguese creoles, Patua and Kristang, spoken in 
former Portuguese port colonies in China and 
Malaysia. 

Russian is an officially recognized national 
minority of China, with 15,609 people who were 
citizens of China in 2000; in addition, there are many 
more Russians temporarily working or trading in 
China and Mongolia. Of the Russian national 
minority of China, nearly 60 per cent are in Xinjiang, 
including over 20 per cent in Tacheng (Qogek) 
Prefecture in the north-west (with 14 per cent in 
Tacheng town alone), 16.5 per cent in Urumqi and 
7.5 per cent in Yining. Forty per cent of the Russian 
nationality of China are very widely scattered; they 
are mainly in urban centres, and have excellent know- 
ledge of Chinese. Russian was a major foreign lan- 
guage in Chinese education in the early 1950s, but 
since 1958 this has declined dramatically. 

Taijk is also an officially recognized nationality of 
China; most of the Tajik in China speak varieties 
of Wakhi or Sarikoli, which are among the cluster of 
Iranian languages spoken in eastern Tajikistan and 
north-eastern Afghanistan and closely related to 
standard Tajik. In 2000 there were 41,028 Tajik in 
China, nearly two-thirds of whom live in extreme 
south-western Xinjiang in the Taxkorgan Tajik 
Autonomous County bordering on Pakistan, Afghha- 
nistan and Tajikistan, and most of the rest in sur- 
rounding counties: just under 10 per cent in Zepu 
(Poskam), another 6 per cent in Shache (Yarkand), 
and so on. 

The two Portuguese creoles of this area, Patua 
and Kristang, are remnants of Portuguese colonial 
outposts. Melaka (formerly known as Malacca) is a 
port on the west coast of peninsular Malaysia, and 
was a major Portuguese outpost from 1511 to 1641. It 
later went through other hands — Dutch (1641-1795 
and 1818-25), British (1795-1818, 1825-1942, 1945— 
57) and Japanese (1942-5) — before becoming a part 
of independent Malaysia. Melaka was settled from 
Portuguese Goa, and its indigenous creole, Kristang, 
contains a substantial Dravidian and Indic lexical 
element. During its centuries of contact with Malay 
in Melaka, it also took over large numbers of Malay 
words. What may have kept the Kristang community 
distinct was its Catholic religion and a continuing 
presence of Portuguese-speaking priests; indeed 
the name Kristang means ‘Christian’. In modern 
Malaysia, there is increasing language shift to English 
and Malay, with the creole in decline. Baxter (1988) 
provides a full description of this creole, and Baxter 
(1996) briefly summarizes its history and structure. 
There are about 10,000 people in Melaka who identify 
as Kristang, but only about 2,150 speakers, mainly 
over 50 but with some young adults and a few chil- 
dren. In order to qualify for bumiputra (indigenous) 
status in Malaysia, some younger people are learning 
Kristang, now regarded as an indigenous language of 
Malaysia. 

The Portuguese took over Macao as a trading 
port on the coast of China in 1555, and there was 
obviously substantial contact with Melaka and Goa, 
as the indigenous creole of Macao, known as Patua or 
Papia Kristam, also contains a large Malay and sub- 
stantial Dravidian and Indic lexical elements. In add- 
ition, it has a large Cantonese Chinese lexical com- 
ponent (Fernandes and Baxter 2001). Patua also 
spread to Hong Kong during British rule after 1841, 
but is now gone from there and in serious decline in 
Macao. Patua, unlike Kristang, underwent a long- 
term decreolization due to continuing Portuguese 
rule and the resulting influx of Portuguese-speaking 
administrators and priests. The return of Macao to 
China in 1999 led to a major exodus of the creole 
population, all of whom had Portuguese citizenship. 
It is estimated that about half the creole popula- 
tion of 10,000 moved to Portugal before the end 
of 1999, leaving about 5,000 behind. Among the 
group remaining in Macao, Patua is still spoken by 
about fifty people, mostly over 70 years old. How- 
ever, a minor folkloristic revival is under way, as 


evidenced by publications such as Fernandes and 
Baxter (2001). 


5.11 Mixed languages 


There are four mixed languages spoken in western 
China. All are the result of contact along the Silk 
Road between China and Central Asia. They are 
Aynu, Hezhou Hui, Tangwang and Wutun. 

Aynu (Eynu, Aynu) is a mixed language of 
south-western Xinjiang with Iranian basic lexicon but 
Uighur structure. It is spoken in a string of oasis 
towns in south-western Xinjiang — including Lop, 
Hotan (Khotan), Shache (Yarkand), Moyu (Karakax), 
Shache (Yarkand), Yengisar and Shule — by small 
groups of descendants of Persians. The Aynu speakers 
are classified as part of the Uighur nationality, and so 
no exact statistics are available, but they are estimated 
to number several thousands (Ladstatter and Tieze 
1994). Another name for them is Abdal Uighur, but 
this is pejorative. Traditions suggest that they arrived 
in two waves, first as Shiite missionaries up to 1,000 
years ago, and then as merchants who settled down 
about 400 years ago when trade along this part of the 
Silk Road ceased. Because there has been contact 
between speakers of Iranian and Turkic languages for 
so long, Aynu has been heavily influenced by Uighur, 
and most scholars view it as a mixed language. All 
Aynu also speak Uighur. Lee-Smith (1996a) presents 
a brief summary of the structural characteristics of 
Aynu, showing that its phonology and morphosyntax 
are quite similar to those of Uighur, but much of 
its basic lexicon is Iranian. This makes it a mixed 
language with special lexicon (Matras and Bakker 
2003: 9). 

Some of the Hui (Chinese Muslims) who live in 
Linxia Hui Autonomous Prefecture of south-western 
Gansu Province, at the eastern end of the former Silk 
Road, speak Chinese with verb-final syntax; they are 
sometimes known as Hezhou. The verb-final struc- 
ture may be connected to the fact that all other lan- 
guages in the area are verb-final: Turkic, Mongolic, 
Tibetan and so on. Verb-final Hezhou appears to be 
a contact variety which arose when Muslim speakers 
of a Turkic language assimilated into the Chinese- 
speaking population. Salar is the Turkic language 
spoken in the same area, but there are fewer Salar 
speakers than Hezhou speakers. The total population 
of Hui in China was 9,816,805 in 2000, but the vast 
majority of these speak local Chinese varieties, mainly 
Mandarin. About 4,000 Hui in Hainan Province 
speak the Austronesian Chamic language Tsat. There 
are also about 2,000 Hui of Tongren County in 
Qinghai Province with the surname Kang who speak 
the Mongolic language Kangjia. Over 600,000 Hui live 
in Linxia Prefecture, but some of them speak normal 
verb-medial Chinese. Speakers of Hezhou are also 
found among the Han Chinese population of Linxia 
Prefecture, and among the Hui and Han Chinese 
of some nearby areas of north-eastern Qinghai 
Province, such as Hainan and Huangnan Tibetan 
autonomous prefectures and Xunhua Salar Autono- 
mous County, where there are altogether about 
50,000 Hui. As only some of the Hui in all these areas 
speak Hezhou, their location is not indicated on Map 
45. For some details of the structure of Hezhou, see 
Lee-Smith (1996b). While much of the lexicon 
is Chinese, there are various postpositions which 
appear to be Turkic. However, unlike Aynu, there is no 
full Turkic morphological system. Phonologically, 
Hezhou is mainly Chinese, but has only three tones 
and some neutralization among these. The language 
appears to be a mixed variety of Chinese with some 
components of Turkic morphosyntax but fairly 
limited suffixation, not a structurally Turkic language 
relexified by Chinese as suggested by Lee-Smith 
(1996b: 872). 

Tangwang is a mixed language with Chinese and 
Dongxiang Mongolic elements, spoken in about a 
dozen villages in north-eastern Dongxiang Autono- 
mous County of south-western Gansu Province. The 


largest two villages are Tangjia and Wangjia, whence 
the name Tangwang (Lee-Smith 1996c: 875). There 
are about 20,000 speakers or about 10 per cent of the 
Dongxiang in this county; a few Tangwang speakers 
are members of the Hui nationality. The language is 
the outcome of contact between Mongolic Dongxiang 
and Chinese. It has a full Mandarin-like set of four 
tones, but with a merger of two tones in progress. Its 
morphology is sometimes derived from Chinese 
forms, but with Mongolic meaning and use; for 
example, the plural marker -m is from Chinese -men 
which is an optional restricted plural for pronouns 
and some human nouns, but in Tangwang this is a 
general and obligatory plural marker. The categories 
of nominal inflection follow a reduced Mongolic 
pattern and have some Dongxiang forms, such as 
the instrumental/comitative, Tangwang -la, see 
Dongxiang instrumental -qala and comitative -la. It is 
verb-final, and its syntax is more or less parallel to 
that of Dongxiang. Overall, it can be seen as a mixed 
language with Mongolic structure but largely Chinese 
lexicon; the extent of the Chinese lexicon in Tang- 
wang is much greater than the Iranian lexicon found 
in Aynu. 

Wutun is the speech of three villages with about 
2,000 people, just north of the county seat of Tongren 
County in north-eastern Qinghai Province; the main 
village is called Wutun, whence the name of the lan- 
guage. The Wutun speakers are classified as Bao'an 
nationality, but follow Tibetan Buddhism and would 
prefer to be classified as Tibetan. Their language 
contains Chinese, Tibetan and some basic Bao'an 
Mongolic lexicon. Like Aynu, Hezhou and Tangwang, 
it is verb-final. Lee-Smith (1996d) summarizes some 
structural characteristics, showing that Wutun is 
morphosyntactically very mixed. Detailed documen- 
tation by Yeshe Acuo is under way. 


5.12 Language policies 


5.12.1 China 


China is officially a multi-ethnic country, with 56 
nationalities: the Han majority and 55 national 
minorities. Overall, the minorities represented over 
105 million people or nearly 8.5 per cent of the popu- 
lation in 2000. In principle, each nationality has a 
language, a culture and a territory, though some 
nationalities lack or are losing a distinct language and 
many nationalities include speakers of more than one 
language. 

The national language is the standard Mandarin 
variety of Chinese based on the speech of Beijing and 
known as putonghua “common speech’, written since 
1958 with simplified characters and transliterated 
with the pinyin Romanization for some purposes. 
This is the official language, universally taught in 
schools and used in most media and government. 
Knowledge of this variety has spread very widely 
since 1950, but is still not universal among all 
of China’s minority nationalities. Speakers of other 
Chinese varieties are not supposed to use those 
varieties in official, educational or other public con- 
texts, according to the 2000 National Language Law, 
but there is nevertheless considerable local media and 
unofficial government use of non-Mandarin Chinese 
varieties, as well as continuing use in daily life. People 
whose mother tongue is a Chinese variety other than 
standard Mandarin often vary in the standardness of 
their speech and code-switch between it and the 
standard, depending on the situation and the inter- 
locutors. For more details of language policy for 
Chinese, see Section 5.1.1 above and Bradley (1991). 

Since 1950, the government has systematically 
created autonomous areas at the province level 
(autonomous regions, of which there are five: Tibet, 
Xinjiang, Ningxia, Guangxi and Inner Mongolia), the 
prefecture level, and the county level, each designated 
for one or more specific national minorities, such 
as Xinjiang Uighur Autonomous Region, Honghe 
Hani and Yi Autonomous Prefecture, Lancang Lahu 
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Autonomous County and so on. In addition, local 
authorities may designate townships within counties 
for specific national minorities or for national 
minorities in general. In some parts of northern 
China, the former Qing dynasty military terminology 
is used instead, so for example there are autonomous 
banners instead of autonomous counties. Within 
these autonomous areas, minority leaders have con- 
siderable control over education and cultural activ- 
ities, and can make decisions about how much to use 
their languages in education and public life; they may 
also decide not to use them. 

The policy for minority languages has followed 
political trends in China, with a period of develop- 
ment from 1950 to 1958, followed by nearly twenty 
years of neglect or even hostility. Since the mid-1970s, 
there has been a renewed effort to use and support 
the languages of the national minorities; in the last 
ten years, this effort has waned somewhat. It is very 
difficult to generalize about the implementation of 
language policy for national minority languages, even 
within one national minority, as local policies differ 
very greatly; for example, see the discussion of very 
diverse policies for the Yi nationality in Bradley 
(2001). In the 1980s and early 1990s, many people 
who had formerly been classified as Han Chinese 
were reclassified at their own request as members of 
various national minorities; such people had one (or 
more) parent or grandparent who was a member of 
the minority. This led to spectacular population 
increases for many nationalities between the 1982, 
1990 and 2000 censuses. Almost none of these reclas- 
sified minority members can speak the traditional 
language of their group. 

Language policy in Hong Kong emphasized 
English up to 1997, but this has changed since its 
return to China; for the current situation, see section 
5.1.1.2 above. Language policy in Taiwan is discussed 
in section 5.1.1.1 above; briefly, the national language 
is Mandarin Chinese, written in traditional characters 
and called Guoyu ‘national language’. There are vari- 
ous Romanizations in use, none entirely consistently. 


5.12.2 Japan and Korea 


Japan and Korea are relatively homogeneous societies 
with little ethnic diversity; while there are Koreans 
and Ainu in Japan, they are not part of the main- 
stream, and all can also speak Japanese. In both coun- 
tries the policy has been to promote literacy in the 
national language. In Japan, the number of kanji 
(Chinese characters) required in education and 
authorized for use in written Japanese was greatly 
reduced in 1946. Only 1,850 characters were still 
approved for use after 1946, though in 1981 this was 
expanded to 1,945, with 166 others available for use 
in names. The other major post-war reform was to 
eliminate the Sino-Japanese literary style and to write 
solely in modern spoken Japanese. 

During the Japanese occupation from 1910 to 
1945, many Koreans learned Japanese and some were 
moved to Japan to work in factories there. The main 
foreign languages of North Korea are Russian and 
Chinese. In South Korea, the use of Chinese char- 
acters or hanja is more extensive than in North Korea; 
there is also a national obsession with learning 
English. Both Japanese and Korean are inundated 
with English loanwords adapted to local phonology. 

Language policy in Mongolia is summarized by 
Sanders in Section 5.9 above. 


5.12.3 Mainland South-East Asia 


Apart from Singapore, which has four official lan- 
guages, each nation of mainland South-East Asia has 
one indigenous majority language that is used as the 
national language: Vietnamese in Vietnam, Lao in 
Laos, Khmer in Cambodia, Burmese in Burma, Malay 
in Malaysia, and Thai in Thailand. 

While the three nations of Indochina were under 
French colonial rule, Burma, Malaysia and Singapore 
were under British colonial rule; this has left a residue 
of knowledge of the ex-colonial language. There was 


Japanese occupation in all these countries up to 1945, 
from various dates between 1940 and 1942. Towards 
the end of this period, nominally independent states 
were set up in all the former colonies. After a brief 
French return to Indochina, this was followed by 
American influence and the spread of English up to 
1975 in all except North Vietnam. British colonial 
rule ended more harmoniously, apart from a mainly 
ethnic Chinese communist insurgency in Malaysia. 
Thailand was never colonized, but like many coun- 
tries has embraced the learning of English. 

Vietnam, Laos since 1975, Cambodia since 1979, 
and Burma have minority policies similar to those 
of China, which were in turn derived from those of 
the former Soviet Union. Often the actual minority 
policy is less liberal, and less fully implemented, in 
some of these countries than in China. Singapore, as a 
multi-ethnic state, has a pluralistic policy. Malaysia 
has used language policy as a means of unification, by 
expanding the status and domains of Malay, while 
still supporting the linguistic and other rights of 
indigenous minorities. Thailand has an ambivalent 
policy, recognizing and supporting the development 
of its minorities while also attempting to educate 
them in Thai and integrate them into national life. In 
every one of the nations of mainland South-East Asia, 
the outcome is that many small minority languages 
are endangered, and even the languages of many 
larger minorities are in decline, both in speaker 
numbers and in domains of use. 

After the departure of the French in 1954, Viet- 
nam was divided into the Democratic Republic of 
Vietnam (North Vietnam) and the Republic of Viet- 
nam (South Vietnam). Both countries used Vietnam- 
ese in its standard literary northern dialect as the 
national language and universal medium of educa- 
tion. Apart from residual knowledge of French, the 
main foreign language in North Vietnamese educa- 
tion was Russian, while the main foreign language in 
South Vietnam was English. North Vietnam followed 
a Soviet-style nationalities policy with recognition 
and nominal support for minority languages and 
cultures, while South Vietnam followed the French 
policy of recognizing the montagnard (French for 
‘mountain people’) groups but not assisting with 
their languages. Since reunification in the Socialist 
Republic of Vietnam in mid-1976, the Soviet-style 
policy has been extended across the entire country. 
This has resulted in a large increase in the inventory 
of ethnic groups, from 36 to 54 (including the 
majority Viet or Kinh who are just over 80 per cent of 
the total population), and one minor change: the Van 
Kieu nationality of former North Vietnam was 
known as Bruu in South Vietnam, so this group is 
now known as Bruu-Van Kieu. When compared to 
the classification of national minorities in China, the 
classification in Vietnam is more finely grained; some 
groups that are classified as one nationality in China 
are classified as separate ethnic groups in Vietnam. 

After full independence in 1953, Laos was also 
divided by civil war. The Mekhong valley was con- 
trolled by the Royal Lao government, while the east of 
the country was controlled by the Pathet Lao (‘Lao 
nation’) or communist party, with support from 
Vietnam and China. The Pathet Lao area was mainly 
inhabited by ethnic minorities, while the Royal Lao 
area was mainly inhabited by ethnic Lao, who are just 
over half the total population. There was also a major 
effort by the Americans to create and maintain a Meo 
army in north-eastern Laos to resist the Pathet Lao. 
Both Royal and Pathet Lao governments agreed that 
Lao was the national language. After reunification 
under the Pathet Lao in the Lao People’s Democratic 
Republic in late 1975, the traditional Lao script was 
slightly reformed to eliminate consonant clusters that 
are not pronounced, and education in Lao became 
more systematic in remote areas of the country 
inhabited by ethnic minorities. Russian and Chinese 
influence increased, but a residual knowledge of 
French and English persisted. The 1985 classification 
of the ethnic groups of Laos listed 47 groups, includ- 
ing the majority Lao; a revised 2000 list reduced this 
to 45, and a new list approved in 2002 has 49. Unlike 


China and Vietnam, but like Cambodia and Burma, 
Laos has no policy of supporting the languages of 
ethnic minorities; all education, government and 
media are in Lao. 

Cambodia was also an independent kingdom 
from 1953; it remained outside the conflicts in its 
neighbours until the late 1960s, but then became 
deeply involved. The kingdom was replaced by a 
republic in 1970, and the Khmer Rouge took over in 
1975, with a major effect on language policy: Khmer 
was greatly simplified, with honorific forms and 
much else eliminated, and all education was solely 
political. When the Vietnamese replaced the Khmer 
Rouge in early 1979, the new government reintro- 
duced formal education and reversed many of the 
Khmer Rouge linguistic reforms. The minorities of 
Cambodia are mainly in the east of the country, an 
area gravely affected by the conflict up to 1975. The 
Cham were a particular target of the Khmer Rouge; 
the ethnic Chinese in the towns and the ethnic Viet- 
namese in the south-east also suffered badly, but for- 
tunately most of the smaller and remoter groups were 
largely ignored from 1975 to 1978. Since minorities 
other than the Vietnamese and the ethnic Chinese are 
a small proportion of the population in remote areas 
of Cambodia, language policy has not given them 
much attention. Up to 1975, French persisted more 
in Cambodia than in some of its neighbours; Viet- 
namese influence was particularly strong after 1979, 
and English is also spreading. 

Burma has 135 recognized national groups, 
including the Burman majority (known as Bamar 
inside Burma). There are seven ethnic-based states 
around the borders, for the seven largest minority 
national groups: Arakanese (known as Rakhine inside 
Burma), Chin, Kachin, Shan, Kayah, Karen (known 
as Kayin inside Burma) and Mon. The policy is to 
spread the use of Burmese (known as Myanmar 
inside Burma), and to this end all education, govern- 
ment and other public life is entirely in Burmese. In 
1990 the preferred English forms of all place and 
national group names were changed by the SLORC 
military government to reflect their pronunciation in 
modern literary Burmese. For example, they prefer 
the country to be known as Myanmar (the literary 
form) instead of Burma (from the spoken form). 
Some place names with English origins were replaced: 
for example, Maymyo became Pyin-oo-lwin. The 
National League for Democracy government elected 
in 1990, which has never been allowed to take office, 
prefers the former names, including Burma. Many 
minority ethnic groups were deeply involved in rebel- 
lions against the central government from the late 
1940s onward, and their languages are tolerated at 
best. During the British colonial period up to 1948, 
knowledge of English became quite widespread; 
but this has declined radically over the long years 
of military government and isolation since 1962. 
Chinese influence has grown rapidly in Burma over 
the last ten years, and Yunnanese Mandarin Chinese 
is now widely spoken by traders and others in many 
areas of north-eastern Burma. 

Malaya was formed as a federation in 1948 and 
achieved full independence in 1957. It included most 
of the former British Straits Settlements colonies — 
Pinang (formerly Penang), Melaka (formerly Malacca) 
and Labuan — and nine protectorates of peninsular 
Malaya (previously governed indirectly through 
Malay sultans). In 1963 Malaysia was formed from 
Malaya and the East Malaysian states of Sarawak and 
Sabah (formerly British North Borneo), also includ- 
ing Singapore from 1963 to 1965. This text covers 
only peninsular or West Malaysia; East Malaysia is 
discussed in the Pacific and Australasia section (4.3.9 
and 4.4.1). The national language is Malaysian, which 
is a form of Malay. Since independence, the policy 
has been to promote the Malay language at the 
expense of the former colonial language, English, and 
the languages of non-indigenous groups, mainly 
ethnic Chinese but also large numbers of speakers of 
various Dravidian languages and smaller numbers 
of speakers of Indic languages from South Asia, who 
arrived during the British colonial period. Peninsular 
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or West Malaysia has a majority of ethnic Malays. 
There are also various MK Aslian languages and some 
groups speaking languages related to Malay. Indi- 
genous groups, including the Malays and the Orang 
Asli ‘original people’, are known as bumiputra. While 
the Orang Asli have some special rights, their Aslian 
languages are not taught in schools and continue to 
decline. During the colonial period, a very high level 
of ability in English was reached by much of the 
population. In 1970, Malaysian started to be used as 
the medium of education, and standards of English 
have since declined. So-called ‘national-type’ schools 
are permitted to continue to teach other languages, 
including Mandarin Chinese. Taiwan was eager to 
assist Malaysian Chinese schools, and traditional 
characters were used in most such schools until the 
early 1980s; but since then, simplified characters have 
been taught. 

The policy of Singapore is summarized in Section 
5.1.1.3 above; briefly, there are four official lan- 
guages: English, Mandarin Chinese, Malay and Tamil. 
English has a distinctive Singaporean variety, one of 
the New Englishes. The government has been vigor- 
ously promoting Mandarin, written with simplified 
characters, in place of other Chinese varieties since 
1979. 

Thailand is usually said to be a relatively homo- 
geneous society in which everyone is Thai and speaks 
Thai, but in fact there is great diversity within the 
Thai population. Most people speak a South-western 
Thai variety as their first language, but major regional 
varieties are quite different from Central Thai. 
Central Thai is the national language, used exclusively 
in education, the media and public life. Apart from 
the indigenous population, Thai society has been very 
receptive to migration over many centuries. There is a 
large Sino-Thai minority in every city and town, most 
of whom arrived over the last two centuries, and 
smaller numbers of various groups from India and 
further west. The ethnic diversity of Bangkok is dis- 
cussed in Section 5.3.1.4.1 above. Another reason 
for the widespread distribution of small pockets 
of minority groups from surrounding countries is 
the pre-twentieth-century policy of bringing entire 
villages of war captives from Laos, Cambodia and 
Burma and resettling them in the central plain, so 
that they could pay taxes and work for the Thai state. 
Many such villages have now fully assimilated and 
speak only Thai, but some also maintain their trad- 
itional languages. 

A third type of minority is groups who are more 
numerous in Burma and Laos, but also live in remote 
mountainous areas of Thailand. While Thailand has a 
long-term policy of helping these so-called hill tribes, 
the help consists of providing schools to teach Thai, 
building roads to facilitate the influx of Thai settlers 
and even foreign tourists, and economic develop- 
ment, all of which lead to assimilation. The number 
of hill tribes recognized by the government has grad- 
ually risen from nine in the 1970s to over twenty 
today, but no assistance with hill tribe language main- 
tenance has ever been provided by the Thai govern- 
ment. There are, however, some Thai universities, such 
as Mahidol, which are making considerable efforts in 
this area. 
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Southern Asia: from Iran to Bangladesh 


6.1 Introduction 


From west to east the countries covered in this section 
are Iran, Afghanistan, Pakistan, India, Nepal, Sri 
Lanka, Bhutan and Bangladesh, along with a number 
of island states or dependencies in the Indian Ocean — 
Maldives, Seychelles, Comoros, Reunion, Mauritius 
and Cocos Islands. Madagascar, in spite of its geo- 
graphical location, is dealt with in Section 4 because 
of the linguistic affinities of Malagasy. 

The language distribution of the area is complex 
and heterogeneous, and the same is true of most of 
the individual countries. Language boundaries rarely 
correspond to national boundaries, and by no means 
uniformly correspond to internal administrative 
boundaries. Individual and regional bilingualism or 
multilingualism are commonplace. The status of a 
given language variety — whether it is more appro- 
priately regarded as a language or as a dialect — is 
not always clear. Available statistics vary in quality, 
reliability and date. The number of languages is 
large; Gordon (2005) records the following totals for 
living languages: Iran 75, Afghanistan 47, Pakistan 72, 
India 415 (the Census of India 1961 lists indigenous 
languages returned as being in excess of 1,500, a 
point that will be returned to later), Nepal 123, Sri 
Lanka 7, Bhutan 24, Bangladesh 39, Maldives 1, 
Seychelles 3, Comoros 7, Reunion 3, Mauritius 6. 
For such reasons absolute precision in mapping 
individual languages is not always possible, but a 
satisfactory indication of the major locations of most 
languages can be given. 

Several major language families are represented 
over this wide area. They are Semitic (to a limited 
extent only and mainly in border regions in the west), 
Turkic, the Indo-Iranian branch of Indo-European 
(with the major sub-branches of Iranian and Indo- 
Aryan), Dravidian, Austro-Asiatic, Tibeto-Burman, 
Kam-Tai and Andamanese. There is one language — 
Burushaski — with no known affiliation and another 
whose affiliation is disputed — Nihali, treated in this 
work as an isolate. Not surprisingly, typological dif- 
ferences are to be found between, and to some extent 
within, these groups, though areal factors may have 
diminished some of the distinctions (see, for instance, 
Masica 1986). In what follows, the basic presentation 
is by country, though there are inevitable overlaps 
because of the fact that, for example, the main lin- 
guistic groups in both India and Sri Lanka are 
speakers of Indo-Aryan or Dravidian. 


6.2 Countries of the area 


6.2.1 Iran 


The population of Iran was estimated as 69,018,924 
in 2004. The languages spoken by this population 
belong to four different language families, Indo- 
European (specifically the Iranian sub-branch of 
the Indo-Iranian branch), Altaic (Turkic branch), 
Dravidian and Semitic. The dominant one of these is 
Iranian, with some 58 per cent of the population 
speaking a language belonging to this group. Among 
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Iranian languages, Farsi (Persian) — the national lan- 
guage — is spoken by more than a third of the popula- 
tion. Total numbers of speakers for the principal 
languages spoken in the country (taken mainly from 
Gordon 2005) are given in Table 6.1. The dates of the 
ultimate sources for these figures range from 1979 to 
2002, and all are therefore approximate. It is not pos- 
sible to assign the scattered Domari (Middle Eastern 
Romani) people to a particular area. Speakers of 
Domari also use the language of the region in which 
they live, e.g. Farsi, Luri, Bakhtiari or Qashqai. 


Table 6.1 Principal languages of Iran 


Language Speakers 
Azerbaijani 23,500,000 
Farsi (Persian) 23,000,000 
Kurdish 6,600,000 
Gilaki 3,265,000 
Mazanderani 3,265,000 
Luri 2,375,000 
Turkmen 2,000,000 
Qashqai 1,500,000 
Arabic 1,400,000 
Domari 1,338,271 
Bakhtiari 1,000,000 
Baluchi, Western 451,000 
Baluchi, Southern 405,000 
Khorasani Turkish 400,000 
Dari 350,000 
Hazaragi 283,000 
Takestani 220,000 
Armenian 170,800 
Kurmanji 200,000 
Pashto 113,000 
Taleshi 112,000 
Khalaj 42,107 
Harzani (Taleshi) 28,132 
Hawrami (Gurani) 22,948 
Semnani 21,099 
Khunsari (Kumzari) 21,099 
Karingani (Taleshi) 17,583 
Aimaq 17,000 
Brahui 10,000 
Dari 10,000 
Bashkardi 7,033 
Kazakh 3,000 


6.2.2 Afghanistan 


The population of Afghanistan was estimated as 
28,513,677 in 2004. Because of the lack of recent cen- 
suses, no such precise figure is available for the num- 
ber of speakers of any of Afghanistan’s 40+ languages, 
which represent four families, the Indo-Iranian and 
Nuristani branches of Indo-European, the Turkic 
branch of Altaic, Dravidian and Semitic. The last of 
these comprises only Arabic, and that essentially only 
because of its place in the practice of Islam. The one 
Dravidian language is Brahui. Pashto has been pro- 
moted as the national language but has met with 
some competition from Dari (Persian). The numbers 
of speakers for the principal languages spoken in the 
country (taken mainly from Gordon 2005) are given 
in Table 6.2. The dates of the ultimate sources for 
these figures range from 1979 to 2000, and all are 
therefore approximate. 


Table 6.2 Principal languages of Afghanistan 


Language Speakers 
Pashto 8,000,000 
Dari 5,600,000 
Hazaragi 1,770,000 
Uzbek 1,403,000 
Turkmen 500,000 
Aimaq 480,000 
Pashayi 217,000 
Baluchi 200,000 
Brahui 200,000 
Shugni 20,000 
Kati 15,000 
Wakhi 9,566 
Gawar-Bati 8,000 
Tajiki 5,000 
Kamviri 4,000 
Munji 3,768 
Savi 3,000 
Uyghur 3,000 
Malakhel 2,860 
Gujari 2,000 
Karakalpak 2,000 
Kazakh 2,000 
Wotapuri 2,000 
Waigali 1,500 
Ashkun 1,200 
Prasuni 1,000 
Sanglechi-Ishkashimi 1,000 
Shumashti 1,000 
Tregami 1,000 
Kirghiz 750 
Tirahi 100 
Ormuri 50 


6.2.3 Pakistan 


The most widely spoken languages of Pakistan, in 
descending order of numbers of speakers, are Pun- 
jabi, Pashto, Sindhi, Siraiki (Saraiki), Urdu and Bal- 
uchi (Balochi), with Punjabi (Panjabi) speakers com- 
prising almost half the total population of 
132,352,279 in 1998. In 1981 the next largest totals 
were for Hindko and Brahui, but separate figures 
were not provided for these in 1998. These languages 
fall into three groups: Indo-Aryan (Punjabi, Siraiki, 
Sindhi, Urdu, Hindko), Iranian (Pashto, Baluchi) and 
Dravidian (Brahui). Other languages spoken include 
Bashgali, Chitrali, Dameli, Gujarati, Gujuri, Rajas- 
thani, Kalasha, Kashmiri, Khowar, Kohistani, Koli, 
Marwari, Shina, Torwali (Indo-Aryan), Hazaragi, 
Munji, Yidgha, Persian, Tajiki, Wakhi (Iranian), 
Arabic (Semitic), Balti (Tibeto-Burman) and Burush- 
aski (no known affinities). Detailed statistics of recent 
data relating to these are not available. 

Different languages dominate in different prov- 
inces: Punjabi in Punjab, Sindhi in Sind, Pashto in the 
North-West Frontier Province (NWFP) and in Feder- 
ally Administered Tribal Areas (FATA). In Balu- 
chistan, Baluchi, Brahui and Pashto all have an 
important place. Much of this is apparent from Table 
6.3, which is based on the 1998 census. The percent- 
age figures provided by the census give the following 
approximate numbers of speakers: Urdu 10.0m, Pun- 
jabi 58.3m, Sindhi 18.7m, Pashto 20.4m, Baluchi 
4.7m, Siraiki 13.9m. A separate figure for Brahui was 
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Table 6.3 Principal languages of Pakistan: speakers as percentage of population, 1998 


Pakistan NWFP FATA Punjab Sindh Baluchistan Islamabad 
Punjabi 44.15 0.97 0.23 75.23 6.99 2.52 71.66 
Pashto 15.42 73.90 99.10 1.16 4.19 29.64 9.52 
Sindhi 14.10 0.04 0.01 0.13 59.73 5.58 0.56 
Siraiki 10.53 3.86 = 17.36 1.00 2.42 1.11 
Urdu 7.57 0.78 0.18 4.51 21.05 0.97 10.11 
Baluchi 3.57 0.01 0.04 0.66 2.11 54.76 0.06 
Others 4.66 20.43 0.45 0.95 4.93 4.11 6.98 


not provided in the 1998 census, but it is clear that 
there has been a major reduction since the recording 
of 20.7 per cent for Brahui speakers in Baluchistan in 
1981 and the inclusive figure of 4.11 per cent for 
‘others’ in 1998. 

The proportions of people speaking the various 
languages in the capital city, Islamabad, differ con- 
siderably from those for the country as a whole, as 
figures 6.1 and 6.2 show; in the case of Islamabad, the 
percentage figures for Sindhi (0.56), Baluchi (0.06) 
and Siraiki (1.11) are too low to show in such a 
diagram. 


4.66% 
3.57% Punjabi 
7.57% Pashto 
Sindhi 
10.53% 44.15% taps 
B Siraiki 
@ Urdu 
14.10% w Baluchi 
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Figure 6.1 Principal languages of Pakistan: speakers as 


percentage of population, 1998 
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Figure 6.2 Principal languages of Islamabad: speakers 
as percentage of population, 1998 


Given these figures, it is perhaps rather remark- 
able that Urdu is the national language of Pakistan, 
and therefore the language of the central government, 
as well as being the main language of literacy and 
broadcasting. That it has a substantial presence at all 
in Pakistan is the result of Muslim refugees’ settling in 
Karachi and other cities in the region after the parti- 
tion of British India in 1947. The status of English as 
an official language means that educated people in 
the cities of Pakistan tend to be trilingual in Urdu, 
English and their mother tongue. In rural areas 
monolingualism is the norm, though Urdu is widely 
understood through its place in broadcasting and 
films, and its use as a lingua franca is widespread. 


6.2.4 India 


The linguistic situation in India is extremely complex. 
As will have been apparent from the widely differing 
figures given in the introduction to this section, it is 
impossible to be dogmatic about the number of lan- 
guages spoken. The reasons for this are varied but 
familiar enough. Available statistics are necessarily to 
some extent unreliable. Perhaps more importantly, 
the distribution of the different language groups 
forms continua, with normally no distinct line of 
demarcation between different speech varieties. For 
many varieties there is no written standard. A prob- 
lem arises from the fact that, by whatever criterion a 
language is defined and its area of use demarcated, 


the majority of the languages have never been 
reduced to writing — except, that is, by linguists for 
descriptive purposes, but not for the purpose of use 
by native speakers. In other cases, because of relatively 
high levels of illiteracy, speakers may not associate 
their speech with a related form used by literates for 
reading and writing. 

One result of such factors is that speakers of two 
varieties that are sufficiently related to be regarded 
by a descriptive linguist as the same language may 
well name their language differently when ques- 
tioned by a census enumerator, a situation which 
goes a long way to explaining why several languages 
appear in census returns as having only one speaker 
(47 languages in 1961) or very small numbers of 
speakers. It then becomes the task of linguists 
employed by the census office to determine which 
sets of returns can properly be grouped together as 
representing one language. The views of other lin- 
guists as to what constitutes a separate language also 
come into play. As far as India is concerned, the 
views expressed by Grierson in The Linguistic Survey 
of India (1903-28) have been very influential, but 
have been accepted to differing degrees in the succes- 
sive censuses that followed its publication. A good 
discussion of this complex issue can be found in 
Masica (1991: 23-7). 

The number of languages for which statistics are 
given in census publications varies considerably. The 
variations are due to a range of factors, including the 
method chosen for the presentation of the data 
collected, the lower limit chosen in respect of the 
number of speakers and the nature of decisions about 
classification. Thus Rajasthani is separately listed in 
1971, whereas in 1981 and later it is included in the 
figures for Hindi. The 1981 census focused its atten- 
tion on 145 languages. That of 1971 focused on 175 — 
out of a total of 1,652 language names submitted by 
enumerators. Of these, 292 are explicitly stated to be, 
with greater or lesser degrees of certainty (‘possibly’, 
‘apparently’, ‘obviously’), spurious. If these are 
excluded, along with 52 non-Indian languages, there 
still remain a possible 1,302 Indian languages, though 
a further 259 of these are recorded as having fewer 
than ten speakers. 

In 1991, as Table 6.4 will show, the number 
of languages for which detailed information has 
been made available was 114, namely only those for 
which 10,000 or more speakers were recorded (as had 
happened in 1981 also). In this respect, the presenta- 
tion of the results of the census of 1961 was more 
thorough. Out of the 1,652 mother-tongue names 
identified by enumerators (not all of them, of course, 
indicating an actual language), 1,018 languages from 
the four major language families represented in India 
were abstracted: 574 Indo-Iranian languages with a 
total of 321,720,700 speakers, 226 Tibeto-Burman 
languages with a total of 3,183,801 speakers, 153 
Dravidian languages with a total of 107,410,820 
speakers, and 65 Austro-Asiatic languages with a total 
of 6,192,495 speakers. Similar information was no 
doubt gathered in the 1991 census, but it does not 
appear to have been published: in the Rajya Sabha 
(the upper house of the Indian parliament) on 
13 December 2004, the Culture Minister, Jaipal 
Reddy, is reported as stating that, according to the 
1991 census, there are 3,372 languages in India — 
1,576 classified and 1,796 unclassified. 

That the published statistics from the 1991 
census thus give a somewhat different picture from 
those of 1961 is just one of the problems involved in 


the mapping of India’s languages. Other problems 
relate to the fact that not every language name in the 
published census can be equated with an individual 
language recognized by a geolinguist. The name 
Kisan, for example, regularly appears in a census but 
is classified under two different families, Dravidian 
and Indo-Aryan; that is to say that some of the sub- 
jects who returned it speak a variety of Kurux and 
others a dialect of Maithili. It would, in fact, appear to 
be the case that membership of a tribal group is 
sometimes confused with language spoken. Among 
other census names that do not appear on our maps 
are Jatapu (which is probably to be equated with the 
Dravidian language Kuvi) and Pochury (which seems 
to refer to a cluster of Tibeto-Burman languages in 
Nagaland, including Ntenyi and Sema). 

Apart from family affiliations, the languages of 
India can usefully be divided into two groups, a small 
group of languages — most of them with very large 
numbers of speakers — that are recognized by the 
Constitution, and a much larger number that are not. 
Most of the former have their own writing system; 
most of the latter have no written form. Members of 
the first group are referred to as ‘scheduled lan- 
guages’, because of their appearance in Schedule VII 
of the Constitution. Originally these totalled 12, but 
their number has been progressively increased to 22. 
These are Assamese, Bengali, Bodo, Dogri, Gujarati, 
Hindi, Kannada (referred to in many earlier publica- 
tions as K/Canarese), Kashmiri, Konkani, Maithili, 
Malayalam, Manipuri, Marathi, Nepali, Oriya, Pun- 
jabi, Sanskrit, Santali, Sindhi, Tamil, Telugu and Urdu. 
The numbers of speakers given in the 1991 census for 
languages of India that have more than 10,000 
speakers are shown in Table 6.4 (figures for the 2001 
census not yet being available). Because conditions 
did not allow the enumeration to take place in 
Jammu and Kashmir, the figures for a few languages 
are taken from the 1981 census. Given that the total 
population of the country increased from rose from 
838,567,936 to 1,027,015,247 between 1991 and 2001, 
the numbers of speakers for almost all these lan- 
guages will now be significantly higher. One of the 
now scheduled languages, Maithili, does not appear 
in this list. In 1991, it was not one of the 18 then 
scheduled, and the total of its speakers was among 
those of the 43 languages that were included in the 
figure for Hindi. However, the total number of those 
claiming to speak Maithili at the time of the census 
was recorded: 7,766,597. 

The figures that emerge for language families 
from these statistics are shown in Table 6.5. From this 
it is evident that the largest number of individuals 
speak an Indo-Aryan language, while the largest 
number of languages spoken in the country are 
Tibeto-Burman. This contrast is even greater if lan- 
guages with less than 10,000 speakers (not listed in 
censuses since 1961, but accounting for a total of 
565,949 persons in 1991) are taken into account. 
Many of these languages are seriously endangered, as 
are some of the non-literary Dravidian languages, 
such as Toda and Kota (765 and 862 speakers respect- 
ively in 1961). 

In 1991 approximately 96.5 per cent spoke one 
or another of the then-18 Schedule VIII languages; 
now, with the increase to 22, the percentage can be 
assumed to be about 98. Most of these languages are 
the official language of a state (or, in the case of 
Hindi, of more than one state) and, since the reorgan- 
ization of the country into linguistic states in 1956, 
major language boundaries follow administrative 
boundaries fairly closely. 

As has been indicated above, statistics on the lan- 
guages of India are highly dependent on the analysis 
made of information provided by the latest census. 
The number of language names generated is always 
very large, but the idiosyncratic nature of many of 
these is evident from the fact that unitary figures — 
sometimes only one — are often attached. The census 
linguist has the task of reducing these to some sort of 
order, and the same solution is not necessarily 
adopted at successive censuses. The 1961 census 
report, as noted above, lists several languages that do 
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Table 6.4 Languages of India with more than 10,000 speakers (1991) 


Language (family) Speakers State(s) where majority live Language (family) Speakers State(s) where majority live 

Hindi (I-A) 337,272,114 Uttar Pradesh/Madhya Pradesh/Bihar/ Tibetan (T-B) 69,416 Sikkim/Karnataka/Himachal Pradesh/Aru- 
Jharkhand/Rajasthan/Chhattisgarh nachal Pradesh /West Bengal 

Bengali (I-A) 69,595,738 West Bengal Kabui (T-B) 68,925 Manipur 

Telugu (Dr) 66,017,615 Andhra Pradesh Phom (T-B) 65,350 Nagaland 

Marathi (I-A) 62,481,681 Maharashtra Hmar (T-B) 65,204 Manipur/Assam 

Tamil (Dr) 53,006,368 Tamil Nadu Kinnauri (I-A) 61,794 Himachal Pradesh 

Urdu (I-A) 43,406,932 Uttar Pradesh/Bihar/Maharashtra/Andhra Bishnupuriya (I-A) 59,233 Assam/Tripura 
Pradesh/Karnataka Kuki (T-B) 58,253 Manipur/Assam/Nagaland/Meghalaya 

Gujarati (I-A) 40,673,814 Gujarat Bhotia (T-B) 55,483 Sikkim/Arunachal Pradesh/Uttar Pradesh/ 

Kannada (Dr) 32,753,676 Karnataka West Bengal 

Malayalam (Dr) 30,377,176 Kerala Sanskrit (I-A) 49,736 Uttar Pradesh 

Oriya (I-A) 28,061,313 Orissa Paite (T-B) 49,237 Manipur/Mizoram 

Punjabi (I-A) 23,378,744 Punjab Chakru/Chokri (T-B) 48,207 Nagaland 

Assamese (I-A) 13,079,696 Assam Balti (T-B) 47,718 Jammu and Kashmir (1981) 

Bhili/Bhilodi (I-A) 5,572,308 Madhya Pradesh/Rajasthan/Maharashtra Sangtam (T-B) 47,461 Nagaland 

Santali (A-A) 5,216,325 Bihar/West Bengal/Orissa Yimchungre (T-B) 47,221 Nagaland 

Kashmiri (I-A) 3,174,684 Jammu and Kashmir (1981) Bhumij (A-A) 45,302 Orissa 

Gondi (Dr) 2,124,852 Madhya Pradesh/Maharashtra Parji (Dr) 44,001 Madhya Pradesh 

Sindhi (I-A) 2,122,848 Gujarat Monpa (T-B) 43,226 Arunachal Pradesh 

Gorkhali/Nepali (I-A) 2,076,645 West Bengal/Assam/Sikkim Lahnda (I-A) 41,183 Jammu and Kashmir/Rajasthan/Madhya 

Konkani (I-A) 1,760,607 Karnataka/Goa/Dadra and Nagar Haveli Pradesh (1981) 

Tulu (Dr) 1,552,259 Karnataka Lepcha (T-B) 39,342 Sikkim/West Bengal 

Dogri (I-A) 1,520,889 Jammu and Kashmir (1981) Wancho (T-B) 38,600 Arunachal Pradesh 

Kurukh/Oraon (Dr) 1,426,618 Bihar/Madhya Pradesh/West Bengal Rengma (T-B) 37,521 Nagaland/Assam 

Manipuri/Meithei (T-B) 1,270,216 Manipur Zeliang (T-B) 35,079 Nagaland 

Bodo/Boro (T-B) 1,221,881 West Bengal Lalung (T-B) 33,746 Assam 

Khandeshi (I-A) 973,709 Maharashtra/Gujarat Chang (T-B) 32,478 Nagaland 

Ho (A-A) 949,216 Bihar/Orissa Chakhesang (T-B) 30,985 Nagaland 

Khasi (A-A) 912,283 Meghalaya Nocte (T-B) 30,441 Arunachal Pradesh 

Mundari (A-A) 861,378 Bihar/Orissa/West Bengal/Assam Halam (T-B) 29,322 Tripura/Assam 

Tripuri (T-B) 694,940 Tripura/Mizoram/Assam Mishmi (T-B) 29,000 Meghalaya 

Garo (T-B) 675,642 Meghalaya /Assam Koda/Kora (A-A) 28,200 West Bengal 

Kui (Dr) 641,662 Orissa Limbu (T-B) 28,174 Sikkim 

Halabi (I-A) 534,313 Madhya Pradesh/Maharashtra Gadaba (A-A) 28,158 Andhra Pradesh/Orissa 

Lushai/Mizo (T-B) 538,842 Mizoram Mogh (T-B) 28,135 Tripura 

Korku (A-A) 466,073 Madhya Pradesh/Maharashtra Tangsa (T-B) 28,121 Arunachal Pradesh 

Munda (A-A) 413,894 Orissa/Bihar/West Bengal/Assam Korwa (A-A) 27,485 Madhya Pradesh 

Miri/Mishing (T-B) 390,583 Assam/Arunachal Pradesh Liangmei (T-B) 27,478 Manipur 

Karbi/Mikir (T-B) 366,229 Assam Nicobarese (A-A) 26,261 Nicobar Islands 

Savara (A-A) 273,168 Orissa/Andhra Pradesh Vaiphei (T-B) 26,185 Manipur 

Koya (Dr) 270,994 Andhra Pradesh/Orissa Koch (T-B) 26,179 Meghalaya 

Kharia (A-A) 225,556 Bihar/Orissa Jatapu (Dr) 25,730 Andhra Pradesh 

Khond/Kondh (Dr) 220,783 Orissa/Andhra Pradesh Khiemnungan (T-B) 23,544 Nagaland 

English (I-E) 178,598 Maharashtra/Tamil Nadu/etc. Lakher (T-B) 22,947 Mizoram 

Nissi/Dafla (T-B) 173,791 Arunachal Pradesh Zemi (T-B) 22,634 Assam/ Manipur/ Nagaland 

Ao (T-B) 172,449 Nagaland Lahauli (T-B) 22,027 Himachal Pradesh 

Sema (T-B) 166,157 Nagaland Arabic (Sem) 21,975 Uttar Pradesh/Bihar/Karnataka/etc. 

Kisan (Dr) 162,088 Orissa Deori (T-B) 17,901 Assam/Arunachal Pradesh 

Adi (T-B) 158,409 Arunachal Pradesh Konda (Dr) 17,864 Andhra Pradesh 

Rabha (T-B) 139,365 Meghalaya Juang (A-A) 16,858 Orissa 

Konyak (T-B) 137,722 Assam/Nagaland Sherpa (T-B) 16,105 Sikkim/West Bengal 

Malto (Dr) 108,148 Bihar Zou (T-B) 15,966 Manipur 

Thado (T-B) 107,992 Manipur/Assam Pawi (T-B) 15,346 Meghalaya 

Tangkhul (T-B) 101,841 Manipur Maring (T-B) 15,268 Manipur 

Kolami (Dr) 98,281 Maharashtra/Andhra Pradesh Shina (T-B) 15,032 Jammu and Kashmir (1981) 

Angami (T-B) 97,631 Nagaland Gangte (T-B) 13,695 Manipur 

Coorgi/Kodagu (Dr) 97,001 Karnataka Kom (T-B) 13,548 Manipur 

Dimasa (T-B) 88,543 Assam Khezha (T-B) 13,004 Nagaland/Manipur 

Lotha (T-B) 85,802 Nagaland Anal (T-B) 12,156 Manipur 

Mao (T-B) 77,810 Manipur Pochury (T-B) 11,231 Nagaland 

Ladakhi (T-B) 72,587 Jammu and Kashmir Maram (T-B) 10,144 Manipur 


Table 6.5 Principal language families of India, 1991 


appropriate and where information is available, these 
are separately mapped. 


Language family No. of languages Speakers 
Indo-Aryan 19 701,915,596 Table 6.6 Languages of India listed in the 1961 census, 
Dravidian 16 188,644,132 but subsumed under Hindi in 1981 and 1991 
Austro-Asiatic 14 9,461,999 
Tibeto-Burman 63 7,195,194 Language* Speakers States where spoken 
BIHARI (34) 16,806,772 Bihar 
Bhojpuri 7,964,755 Bihar 
not occur thirty years later. A selection of 1961 figures Bihari 19,040 Assam/Madhya Pradesh 
is given in Table 6.6. In this, languages printed in Maithili 4,984,811 Bihar 
capital letters indicate groupings of sets of languages RAJASTHANI (73) 14,933,016 — Rajasthan 
(the figure in parentheses being the number of lan- Harauti 560,934 Rajasthan 
guages separately recognized but included within = Malvi 1,142,478 Madhya Pradesh 
these superordinate language names). Some, though = Marwari 6,242,449 Rajasthan 
by no means all, of these language names appear in Mewar 1,853,769 Rajasthan 
the census returns for 1991, but in the final report see ; apie eee 
they are grouped under Hindi, the stated total for Ce wes ‘309,967 tee aiaceh 


which includes 100 million persons returning one or 
another of 43 different language names. Where 


* Note: Capitals indicate GROUPINGS (with number of languages) 


The maps in this section show that a number of 
areas of the country are bilingual or multilingual. 
Except by the use of maps of a much larger scale, 
however, the extent of this regional multilingualism 
cannot be shown in full detail and with total accuracy. 
In remote mountain areas, for instance, each of a 
group of geographically close villages may have its 
own language. Large conurbations often produce a 
different problem in that they may have a linguistic- 
ally mixed population, while on a map the only lan- 
guage indicated will be the dominant language of the 
region in which they are situated. Figures 6.3-6.8, 
based on statistics from the 1991 census, show the 
numerically important languages in six major cities 
or city areas of India. Even this sort of representation 
does not show the full extent of the linguistic mix, in 
that ten other languages in Delhi, for example, regis- 
ter speakers numbering in excess of 10,000. Fifteen 
years after the census, Bombay, Calcutta and Madras 
are more commonly referred to as Mumbai, Kolkata 
and Chennai. 
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Figure 6.3 Principal languages of Delhi: speakers as 
percentage of population, 1991 


9.37% 

2.01% B Marathi 
2.44% B Hindi 
2.83% B Gujarati 

41.669 B@ Urdu 
12.08% ae 
Bi Tamil 
Bi Konkani 
13.89% @ Kannada 
15.72% others 


Figure 6.4 Principal languages of Greater Bombay: 
speakers as percentage of population, 1991 
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Figure 6.5 Principal languages of Calcutta: speakers as 
percentage of population, 1991 


6.2.5 Nepal 


Nepal’s 2001 population census provides more 
information on the language situation than earlier 
ones, listing a total of 92 languages spoken by the 
population of 22,736,934. There are in addition 
several ‘unknown’ languages, for the identification 
of which research is clearly needed. The 64 languages 
with more than 1,000 speakers are listed in Table 6.7. 
The others, each representing less than 0.01 per cent 
of the population, are (with their language family): 
>100 speakers — Mewahang (T-B), Sanskrit (I-A), 
Kaike (T-B), Raute (T-B), Kisan (Dr), Churauti 
(T-B), Baram/Maramu (T-B), Tilung (T-B), Jero/ 
Jerung (T-B), Dungmali (T-B), Oriya (I-A); >10 
speakers — Linghkim (T-B), Kusunda (T-B), Sindhi 
(I-A), Munda (A-A), Haryanvi (I-A), Magahi (I-A), 
Sam (T-B), Kurmali (I-A); and <10 — Kagate (T-B), 
Dzonkha (T-B), Kuki (T-B), Chhingtang (T-B), Mizo 
(T-B), Nagamese (I-A), Lhomi (T-B), Assamese 
(I-A), Sadhani (I-A). From the listing it will be 
apparent that four language families are repre- 
sented: Indo-European (mainly Indo-Aryan), Sino- 
Tibetan (mainly Tibeto-Burman), Austro-Asiatic and 
Dravidian. 

Though this most recent census avowedly does 
not succeed in recording all the languages spoken in 
Nepal, it is considerably more informative then earlier 
ones. In the 1991 census, for instance, only 32 lan- 
guages were listed. It seems that this may be partly 
ascribable to an increasing awareness of their separate 
identity among ethnic minorities (see Yadava 2003: 
140), but there are other reasons. Angika and Bajjika, 
for example, formerly taken as dialects of Maithili 
and Bhojpuri respectively, are now listed as separate 
languages. More significant in terms of the number 
of languages named was the decision in 2001 to list 
separately 25 East Himalayish languages that earlier 
censuses had grouped together as ‘Rai Kiranti’. 
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Figure 6.6 Principal languages of Hyderabad District: 
speakers as percentage of population, 1991 
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Figure 6.7 Principal languages of Madras: speakers as 
percentage of population, 1991 


@ Kannada 
B Tamil 

B Telugu 

@ Urdu 

@ Malayalam 
B Hindi 

B Marathi 

O others 


38.38% 


16.66% 


21.38% 


Figure 6.8 Principal languages of Bangalore: speakers 
as percentage of population, 1991 


Nepali, in terms of number of speakers and in 
other respects the major language of Nepal, is not the 
language with the longest history in the country. It 
came into the region with Rajput immigrants a mil- 
lennium or so ago. Over the centuries its speakers 
spread to the east, with the result that in some areas it 
is now the only language spoken and throughout 
most of the country it serves as a lingua franca. The 
partial exception to this is Hindi, which is used as a 
lingua franca among the groups living in the Terai 
region whose mother tongue is another central Indo- 
Aryan language (Avadhi, Bhojpuri, Maithili, Rajban- 
shi). As the rastrabhasa or ‘language of the nation’, 
Nepali is the only language of government and the 
law, and the official medium of education. Recogni- 
tion has been given to Nepal’s other languages as 
rastriya bhasa or ‘national languages’, but Nepali 
remains dominant. Some of the languages with small 
numbers of speakers appear to be moving towards 
extinction. 


6.2.6 Sri Lanka 


The main linguistic division in Sri Lanka is between 
Sinhalese (Sinhala) and Tamil, and the speakers of 
these languages make up more than 99 per cent of the 
population. Both are recognized as national lan- 
guages, but Sinhalese has the additional status of 
official language. English is widely used in govern- 
ment and administration. Other languages spoken are 
Indo-Portuguese, a Portuguese-based creole spoken 
at home by some Burghers (the number of speakers 
of these descendants of colonisers and Ceylonese in 
1971 was estimated at 2,250); Sri Lankan Creole 
Malay, spoken by most members of the Malay com- 
munity, though there is a decreasing fluency among 
the younger generation in some areas; Veddah (a 
member of the Sinhalese/ Maldivian subgroup of 


Table 6.7 Languages of Nepal, 2001 


Language (family) Speakers Percentage 
Nepali (I-A) 11,053,255 48.98 
Maithili (I-A) 2,797,582 12.40 
Bhojpuri (I-A) 1,712,536 7.59 
Tharu (Dagaura/Rana) (I-A) 1,331,546 5.90 
Tamang (T-B) 1,179,145 5 22 
Newar (T-B) 825,458 3.66 
Magar (T-B) 770,116 3.41 
Awadhi (I-A) 560,744 2.48 
Bantawa (T-B) 371,056 1.64 
Gurung (T-B) 338,925 1.50 
Limbu (T-B) 333,633 1.48 
Bajjika (I-A) 237,947 1.05 
Urdu (I-A) 174,840 0.77 
Rajbansi (I-A) 129,883 0.58 
Sherpa (T-B) 129,771 0.58 
Hindi (I-A) 105,765 0.47 
Chamling (T-B) 44,093 0.20 
Santhali (A-A) 40,193 0.18 
Chepang (T-B) 36,807 0.16 
Danuwar (I-A) 31,849 0.14 
Dhangar/Jhangar (Dr) 28,615 0.13 
Sunuwar (T-B) 26,611 0.12 
Bangla (I-A) 23,602 0.10 
Marwari (Rajasthani) (I-A) 22,637 0.10 
Majhi (I-A) 21,841 0.10 
Thami (I-A) 18,991 0.08 
Kulung (T-B) 18,686 0.08 
Dhimal (T-B) 17,308 0.08 
Angika (I-A) 15,892 0.07 
Yakkha (T-B) 14,648 0.06 
Thulung (T-B) 14,034 0.06 
Sangpang (T-B) 10,810 0.05 
Bhujel/Khawas (T-B) 10,733 0.05 
Darai (I-A) 10,210 0.05 
Khaling (T-B) 9,228 0.04 
Kumal (I-A) 6,533 0.03 
Thakali (T-B) 6,441 0.03 
Chhantyal (T-B) 5,912 0.03 
Sanketic (Nepal Sign) 5,743 0.03 
Tibetan (S-T) 5,277 0.02 
Dumi (T-B) 5,271 0.02 
Jirel (T-B) 4,919 0.02 
Wambule/Umbule (T-B) 4,471 0.02 
Puma (T-B) 4,310 0.02 
Yholmo (T-B) 3,986 0.02 
Nachhiring (T-B) 3,553 0.02 
Dura (T-B) 3,397 0.02 
Meche (T-B) 3,301 0.01 
Pahari (I-A) 2,995 0.01 
Lepcha (T-B) 2,826 0.01 
Bote (T-B) 2,823 0.01 
Bahing (T-B) 2,765 0.01 
Koi/Koyi (T-B) 2,641 0.01 
Raji (T-B) 2,413 0.01 
Hayu (T-B) 1,743 0.01 
Byangshi (T-B) 1,734 0.01 
Yamphu/Yamphe (T-B) 1,722 0.01 
Ghale (T-B) 1,649 0.01 
Khadiya (I-A) 1,575 0.01 
Chhiling (T-B) 1,314 0.01 
Lohorong (T-B) 1,207 0.01 
Punjabi (I-A) 1,165 0.01 
Chinese (S-T) 1,101 0.01 
English (I-E) 1,037 0.01 
Unidentified 168,340 0.75 


Indo-Aryan), spoken by a small and diminishing 
number of aboriginal people (411 in the 1963 census) 
living in the mountainous areas of Badulla and Pol- 
onnaruwa Districts; and English, spoken by an esti- 
mated 100,000 among the Burgher, Eurasian and 
European populations. 

Official classification is by ethnic group, and this 
has varied slightly from one census to the next. Up 
until the 1971 census there was a separate listing of 
Indian Moors. Eurasians (grouped with Burghers in 
earlier censuses), Europeans and Veddahs are no 
longer treated as separate groups, on the grounds that 
they are no longer significant as distinct ethnic 
groups in Sri Lankan society. The most recent census 
was conducted in 2001 However, because of the 
ethnic/linguistic conflict a full enumeration was not 
possible in eastern and northern parts of the country, 
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and no population figures are available for the Jaffna, 
Mannar, Vavuniya, Mullaitivu, Kilinochchi, Batticaloa 
and Trincomalee Districts. Table 6.8 gives the figures 
for the remainder of the country in the 2001 census 
alongside the more comprehensive set of figures for 
1981. The relative proportion of the total for the dif- 
ferent communities can be expected to be different 
from either; accurate figures for the districts listed 
above would increase the proportions both of Sri 
Lanka Tamils and of Sri Lanka Moors (Muslims) in 
2001; on the other hand there has without doubt 
been a reduction of the number of Sri Lanka Tamils 
through emigration and other causes during the 
period of the emergency, and the percentage of this 
group in the total can be assumed to be less than in 
1981. A resolution on the situation in Sri Lanka put 
before the European Parliament on 17 May 2006, for 
instance, stated that ‘the conflict ... has killed over 
60,000 people [and] displaced one million’. The total 
population of the country grew from 14,846,750 in 
1981 to an estimated 18,797,257 in 2001. 


Table 6.8 Communities of Sri Lanka: 1981 and 2001 
censuses 


Community 1981 2001 
Sinhalese 10,979,568 13,865,245 
Sri Lanka Tamil 1,886,864 631,949 
Indian Tamil 818,656 855,025 
Sri Lanka Moor 1,046,927 1,359,331 
Burgher 39,374 75,283 
Malay 46,963 54,784 
Sri Lanka Chetty — 10,807 
Bharata —_ 2,165 
Other 29,398 23,920 


The language of the first group in Table 6.8 is 
Sinhalese; that of the following three groups is Tamil. 
Sri Lanka Chetty and Bharata are new categories 
introduced in 2001. The former are descendants of 
Chetty merchants who travelled to Ceylon from 
Nagapatnam in India from the mid-seventeenth cen- 
tury onwards and therefore mainly Tamil speakers. 
The Bharathas, who have a history of migration from 
India since the time of Portuguese rule, are also pre- 
dominantly of Tamil origin. At the time of the 1981 
census, residents listed as not holding Sri Lankan citi- 
zenship totalled 635,150 out of 14,846,750, and of 
these, 613,927 are stated as holding Indian citizen- 
ship. Most of this group can be assumed to be Tamil 
speakers. 


6.2.7 Bhutan 


The most recent population census in Bhutan was 
conducted at the end of May 2005. Until the results 
become known, it is impossible to be sure of the likely 
size of the country’s population, and even less of the 
numbers of speakers of Bhutan’s languages. The cen- 
sus of 1969 put the total population at 930,614, that 
of 1991 at 1,375,400. Current external estimates go as 
high as 2.2 million, while others suggest a figure less 
than one million. The documentation of the language 
situation is little better. One can, however, confidently 
say that most of Bhutan’s languages are Tibeto- 
Burman. The principal exception is Nepali, widely 
spoken in the south of the country and used to some 
extent elsewhere as a lingua franca. The national lan- 
guage is Dzongkha; like all languages spoken there it 
is a minority language, but probably the one with the 
largest number of mother-tongue speakers. It is also 
the dominant lingua franca in most areas. Its regional 
location as far as first-language speakers are con- 
cerned is in the west. Its importance as a medium of 
instruction in schools is increasing. English plays an 
important part in education at the secondary school 
and higher levels. In areas where they are predomin- 
ant, Bumthang and Tshangla (Sharchop) are used as 
media of instruction in primary schools. Table 6.9, 
from the First Linguistic Survey of Bhutan (van 
Driem 1991 as reported in van Driem 2001: 871) 


gives estimates of numbers of speakers of Bhutan’s 
principal languages, all Bodish apart from Nepali. 


Table 6.9 Languages of Bhutan, 1991 


Language Speakers 
Dzongkha 160,000 
Nepali 156,000 
Tshangla 138,000 
Khengkha 40,000 
Bumthang 30,000 
Chocangacakha 20,000 
Dzalakha 15,000 
Kurtopkha 10,000 
Nyenkha 10,000 
Lakha 8,000 
Brokpa 5,000 
Lhokpu 2,500 
Gongduk 2,000 
Lepcha 2,000 
Dakpakha 1,000 
Bhokha (Tibetan) 1,000 


6.2.8 Bangladesh 


In most of the administrative units of Bangladesh, 
whether at the level of the six divisions or at that of 
the 64 zilas into which these divisions are subdivided, 
the vast majority (more than 99 per cent) of the 
country’s inhabitants are Bengali (Bangla) speakers, 
and Bengali is therefore, quite naturally, the national 
language. The exceptions to the regional dominance 
of Bengali are those units where there is an above- 
average concentration of members of tribal groups, 
for which the country-wide total in 1991 was 
1,205,978, out of a total population of 111,455,185. 
The total population of the country at the time of the 
census of February 2001, adjusted to take account of 
the estimated undercount, is given as 130,029,749. 
Though one of the declared objectives of the census 
was to obtain basic information on the tribal popula- 
tion, no details of this had been made available in 
mid-2006. The most recent data, therefore, are those 
provided by the 1991 census. On that occasion about 
half the population was recorded as tribal in three 
zilas, all of them in the hilly areas in the east of 
Chittagong Division (see Figure 6.9): Bandarban 
47.27 per cent; Khagrachhari 47.51 per cent; and 
Rangamati 53.22 per cent. Elsewhere, the percentage 
was less than 4, and generally less than 1. 

The statistics provided by the five censuses held 
between 1961 and 1991 vary rather disconcertingly. 
Moreover, since the numbers recorded relate to tribes, 
they are sometimes an imperfect guide to numbers of 
speakers of different languages. With that reservation, 
approximate numbers of speakers for the languages 
of Bangladesh are given in Table 6.10. Where possible, 
figures are abstracted from the 1991 census; others 
(with the applicable year indicated) are from Gordon 
(2005). 

The variety of Bengali used in the modern 
written language — in both Bangladesh and the Indian 
state of West Bengal — is based on the colloquial 
variety that had developed in Calcutta in the early 
part of the twentieth century, and this has become the 
standard spoken form too. There are, however, con- 
siderable dialect differences and some dialects of 
Bangladesh (e.g. those spoken in Sylhet and in 
Chittagong) are so far removed from the standard as 
to be barely comprehensible to speakers from other 
parts; that is to say that there is a case for considering 
them to be separate languages. Along with standard 
Bengali, English is widely used in administration and 
in higher levels of education. 


6.2.9 Maldives 


The Republic of the Maldives is essentially a mono- 
lingual state, the majority of the population (270,101 
in the census of 2000) being speakers of Maldivian 
(Divehi, Dhivehi), an Indo-Aryan language closely 
related to, but distinct from, Sinhalese, and the official 
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Figure 6.9 Bangladesh: administrative divisions 


Table 6.10 Principal languages of Bangladesh, 1991 
census 


Language Speakers Zila(s) where spoken 

Bengali (I-A) 110,000,000 

Chittagonian (I-A) 14,000,000 _ Chittagong (1998) 

Sylheti (I-A) 5,000,000 Sylhet (2001) 

Arakanese (T-B) 200,000 Bhola/Chittagong/ 
Bandarban (2001) 

Chakma (I-A) 252,858 Khagracchari/ 
Rangamati 

Saontal (A-A) 202,162 Dinajpur/Naogaon/ 
Rajshahi/Rangpur 

Marma (T-B) 157,301 Bandarban/ 
Rangamati 

Garo (T-B) 102,000 Sherpur/ 
Mymenshing/ 
Netrakona (1993) 

Bishnupriya (I-A) 40,000 Habiganj (2003) 

Manipuri (T-B) 24,882 Maulvibazar 

Tanchaynga (I-A) 21,639 Rangamati 

Koch (T-B) 16,567 Tangail 

Bum (Bawm) (T-B) 13,471 Bandarban/ 
Joypurhat 

Khasia (A-A) 12,280 Maulvibazar 

Rajbongshi (I-A) 5,444 Joypurhat/Jessore 

Pangkhu (T-B) 3,227 Rangamati 

Khyang (T-B) 2,343 Bandarban 

Munda (A-A) 2,132 Khulna/Satkhira 

Chak (2) 2,127 Bandarban 

Tipra (Tippera) (I-A) 1,242 Comilla 

Khumi (T-B) 1,241 Bandarban 

Lushai (T-B) 662 Bandarban/ 
Rangamati 

Mro (T-B) 126 Rangamati 


language of the country. Of the 1,201 islands that 
make up the Maldivian archipelago, only 202 are 
inhabited. The most populous (62,519) is Male, 
which lies approximately at the centre of the (510- 
mile) 820-kilometre-long chain. In Male one small 
distinct community, descended from Indian traders 
who entered the Maldives in the 1860s, speaks a dia- 
lect of Hindi. 


6.2.10 Seychelles 


At the time of the 2002 census the Republic of the 
Seychelles, which comprises more than 100 islands in 
the western Indian Ocean, had a population of 
75,876. The vast majority (91.8 per cent) speak 
Seychelles Creole French. Since 1981 Creole has been 
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regarded as the country’s first language, with English 
and French in second and third positions respectively. 
In 1992 Creole became the medium of instruction for 
the first five years of school, with English taking its 
place in later years. 


6.2.11 Comoros and Mayotte 


The Federal Islamic Republic of the Comoros (popu- 
lation of 590,151 at the time of the 2003 census) 
and the island of Mayotte (an overseas collectivité 
territoriale of France, but claimed by the Comoros; 
population 186,026 according to Gordon 2005) 
together make up the Comoro archipelago, which 
comprises four islands situated in the northern part 
of the channel lying between Mozambique and the 
island of Madagascar. The most widely spoken lan- 
guage in the Comoros is Comorian (228,896). This is 
related to the Swahili spoken on the mainland, but 
is not mutually intelligible with it. There are clear 
dialect differences between the varieties spoken on 
the four islands. Shingazidja, spoken on Nijazidja 
(known under French rule as Grande Comore) stands 
somewhat apart from the rest. The main dialects 
which have been identified in this remaining group 
are Shimwali, spoken on Mwali (formerly Mohéli), 
Shinzwani, spoken on Zwani (formerly Anjouan), 
and Shimaore, spoken on Mayotte. About half the 
inhabitants of Mayotte speak Maore, a variety of 
Comorian. There are an estimated 39,000 speakers of 
Bushi, a dialect of Malagasy. The official language in 
Mayotte is French, of which there are 2,500 native 
speakers. The Republic of the Comoros has two 
official languages, Arabic and French. 


6.2.12 Mauritius 


The country of Mauritius is made up of the island of 
Mauritius, the island of Rodrigues and a number of 
very small islands. The vast majority (almost 97 per 
cent) of the population of 1,143,069 (2000 census) 
live on the main island, with 3 per cent on Rodrigues 
and less than 100 individuals elsewhere. Known 
locally as the ‘rainbow state’ because of the varied 
origins of its inhabitants, Mauritius is home to a 
range of languages which reflects these origins. At 
the time of the census, 43.71 per cent claimed to have 
an Indian language as their ancestral tongue, with 
those claiming Creole origin coming next at 36.78 per 
cent. Among those of Indian origin, those from the 
Bhojpuri parts of India form the largest single group 
(31.60 per cent). The languages spoken at home, 
however, show a rather different pattern from that of 
ancestral languages, as is apparent from Table 6.11, 
where the percentage recorded as speaking Indian 
languages, for example, totals only 13.93. 


Table 6.11 Languages usually spoken at home by 
Mauritians, 2000 census 


Language Speakers Percentage 
Creole 791,465 69.24 
Bhojpuri 142,385 12.46 
French 39,827 3.44 
Chinese languages 8,736 0.76 
Hindi 7,245 0.63 
Tamil 3,622 0.32 
English 3,505 0.31 
Telugu 2,169 0.19 
Marathi 1,888 0.17 
Urdu 1,789 0.16 
others (or not stated) 140,438 12.29 


The figures in Table 6.11 indicate a process by 
which most ancestral languages are being replaced by 
Creole (see Rajah-Carrim 2005, from which statistics 
given here are taken), a process which is more rapid 
in urban than in rural areas (Stein 1981: 282). The 
effect is to make Creole the country’s most widely 
lingua franca, even though English is the official lan- 
guage and is used as the medium of instruction. 


6.2.13 Reunion 


The island of Reunion, an Overseas Department of 
France, has a population (standing at 706,300 at the 
time of the 1999 census) of mixed origin — European, 
African and Chinese. The official language is French, 
but Reunion Creole French, the mother tongue of 
some 85 per cent of the population, is the most 
widely used language. Tamil is the mother tongue of 
10 per cent of the population and French of 5 per 
cent. 


6.2.14 Cocos Islands/Christmas Island 


Of the 27 islands that make up the Australian 
external Territory of the Cocos Islands, only two are 
inhabited. The majority of the 550 or so inhabitants 
speak Cocos Malay. This is also the language of 
about a quarter of the employees of the Phospate 
Mining Company on Christmas Island, which has 
no permanent inhabitants. Rather more than half 
the workers there speak one or another Chinese 
dialect. 


6.2.15 Chagos Islands 


The Chagos Archipelago, formerly part of the British 
colony of Mauritius, became a separate overseas terri- 
tory of the UK — the British Indian Ocean Territory — 
when Mauritius was granted independence in 1965. 
At that time the population of the inhabited islands 
(principally the island comprising more than two- 
thirds of the land surface of the group, namely Diego 
Garcia) consisted of some 2,000 people of mainly 
African and South Asian extraction, known collect- 
ively as Chagossians or Ilois, whose language was a 
French-based creole. In the 1970s, following an 
agreement to establish a joint US-UK military facility 
on the islands, all the indigenous inhabitants were 
expelled, to be resettled in Mauritius and the Sey- 
chelles. The language of the present population of 
Diego Garcia, consisting of some 4,000 military per- 
sonnel and civilian contractors (some of the latter 
from the Philippines and Mauritius), is English. 
Throughout the period of their exile, the Chagossians 
have maintained their right of return. In May 2006, 
two High Court judges sitting in London held that 
orders to prevent their return were irrational and 
unlawful. 


6.3 Language families spoken in 
the area 


6.3.1 Turkic 


Turkic languages (more widespread in the regions 
covered by Section 7 of this atlas) are spoken in the 
northern parts of Iran and Afghanistan. Eastern Tur- 
kic (e.g. Uzbek), Southern Turkic (e.g. Azerbaijani, 
Khalaj, Qashqai, Turkmen), Kipchak (Karakalpak, 
Kazakh) and Northern Turkic (e.g. Kirghiz) are repre- 
sented. Typologically, these languages are not far 
apart one from the other, sharing (among other char- 
acteristics) the phonological feature of vowel har- 
mony, the morphological feature of agglutination and 
the syntactic feature of verb-final sentence pattern. 


6.3.2 Indo-Iranian 


Indo-European is represented in southern Asia by 
one of its major branches — Indo-Iranian. There has 
been some disagreement about whether this branch is 
made up of two or of three sub-branches, that is to 
say whether the Nuristani languages form a third sub- 
branch along with Iranian and Indo-Aryan, or 
whether they may be satisfactorily grouped within 
Indo-Aryan. The view that they form a third sub- 
branch is, however, the more widely accepted one, 
and that is the position adopted here. Historically, the 
split between the two main branches of Indo-Iranian 
probably took place not later than 2000 Bc. 


6.3.2.1 Iranian 


The earliest records of Iranian go back to Avestan, the 
language of the sacred texts of Zoroastrianism, which 
dates back to the seventh to fifth centuries Bc, and to 
Old Persian, the language of the Achaemenid king- 
doms of Persia (sixth to fourth centuries Bc). There 
are reasons to believe that the geographical spread of 
Iranian languages was greater in ancient times than is 
the case now. Middle Iranian — approximately third 
century Bc to ninth century ap — comprises four 
groups: South-West, represented by Middle Persian, 
the direct ancestor of modern Persian; North-West, 
represented by Parthian, which belongs to the region 
south-east of the Caspian Sea; North-East, repre- 
sented by Sogdian, spoken over a large area centred 
on Samarkand, and Khwarazmian, spoken in part of 
what is now Uzbekistan; and South-East, represented 
by Saka, spoken in Khotan in what is now the western 
Chinese province of Sinkiang, and Bactrian. 

The home of modern Iranian languages stretches 
from Turkey eastward to China. In the countries pre- 
sented in this section, they predominate in Iran and 
Afghanistan, and are widely spoken in Pakistan. They 
include members of the South-West Iranian (Persian 
(Farsi), Tati, Luri, Tajiki, Kumzari, Dari, Aimaq and 
Hazaragi), North-West Iranian (Kurdish, Kurmanji, 
Gilaki, Takestani, Galeshi, Mazanderani, Semnani, 
Gurani, Ashtiani, Vafsi, Bashkardi, Baluchi, Parachi, 
Malakhel, Ormuri) and South-East Iranian (Pashto, 
Waneci, Sistani, Shugni, Iahkashimi-Sanglechi, Munji, 
Wakhi, Yidgha) groups. Other representatives of these 
groups and of North-East Iranian are found in Tur- 
key, Russia, Georgia, Turkmenistan, Uzbekistan, 
Kyrghystan, Tajikistan, and Azerbaijan (see Section 5). 

Among the features that separate Iranian from 
Indo-Aryan is the absence from the former of the set 
of voiced aspirates (bh, dh, dh, jh, gh) in the plosive 
system, which are a characteristic of the latter. The 
impact of Arabic and Turkish loan words in Iranian 
languages has resulted in a number of phonological 
changes, which have moved them further from their 
Indo-Iranian base. Word order in sentences in Iranian 
is typically verb-final. Persian, Dari and Pashto are all 
written in the Arabic script. 


6.3.2.2 Nuristani 


Nuristani languages are spoken principally in the 
Nuristan region of Afghanistan, though there are 
some speakers in neighbouring areas of Pakistan. 
These languages — Kati, Bashgali, Prasuni, Waigali, 
Tregami (Gambiri) and Ashkun — were earlier known 
collectively as Kafiri. 


6.3.2.3 Indo-Aryan 


Indo-Aryan (or Indic) is by far the largest language 
group in this area, both in terms of the number of 
languages and the number of speakers — the latter 
approaching 20 per cent of the world’s population. 
They are the numerically dominant languages in 
Pakistan, India, Nepal, Sri Lanka, the Maldives and 
Bangladesh. 

The earliest records of Indo-Aryan are the Vedic 
texts, of which the most ancient, the Rigveda, dates 
from the middle of the second millennium sc. This is 
also the time when the Indo-Aryans are believed to 
have moved from the Afghanistan region into what is 
now Pakistan. Over the centuries they moved south- 
ward and eastward, and during the period 600 Bc to 
AD 1000 — known as the age of Middle Indo-Aryan — 
dialects became differentiated into the various 
Prakrits that were the ancestors of the New Indo- 
Aryan languages. 

The classification of these New Indo-Aryan 
languages is a matter of some controversy (the vari- 
ous elements of which are succinctly presented by 
Masica 1991: 446-63). Grierson’s view that the 
Dardic languages (north-west India and north-east 
Pakistan) formed a separate branch of Indo-Iranian, 
midway between Iranian and Indo-Aryan, is no 
longer accepted. On the matter of the precise genetic 
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relationship of the members of the Indo-Aryan group 
of languages, however, there is no clear consensus, for 
the criteria for setting up an unambiguous family tree 
are often in conflict. In part, this may be because the 
languages were spoken over a large area within which 
there were no major natural geographical barriers to 
prevent population movements; so different lan- 
guages were in contact and influenced each other’s 
structure. A further effect of this circumstance was 
the existence of dialect continua, which even now 
make it difficult to identify the places where one lan- 
guage ends and another begins. The labels used in the 
keys to the maps that follow — North-western, North- 
ern, Central, Eastern, South-western and Southern — 
are common to a number of analyses, but the precise 
membership of each will be found to vary from one 
to another. 

One distinguishing characteristic of Indo-Aryan 
in comparison with other Indo-Iranian languages is 
the set of retroflex consonants in the phonological 
inventory of the majority, a feature which is taken to 
result from the influence of contact with Dravidian 
languages. Common to many Indo-Aryan languages 
are ergative constructions, a feature not shared with 
Dravidian. Common to both families is SOV (sub- 
ject—object—verb) word order. 

The major Indo-Aryan languages, namely those 
with a long literary tradition, have their own indi- 
vidual writing system. With few exceptions, these 
have their origin in the Brahmi script used in the 
Asokan inscriptions. The main exception is Urdu, 
written in the Arabic script, which is also used for 
Sindhi and sometimes for Punjabi. The script used 
for Maldivian, which is unique, has Arabic elements 
mixed with others. Though they have the same origin 
and though they are self-evidently formed on the 
same principles, the Brahmi-based scripts are not 
necessarily intelligible to users of a related variety. 
The main north Indian scripts used for Indo-Aryan 
languages are Devanagari (used with minor vari- 
ations for Hindi, Marathi and Nepali, as well as for 
the classical Sanskrit), Gujarati, Gurmukhi (used for 
Punjabi), Bengali (also used for Assamese and 
Manipuri), Oriya and Sinhalese. 

One Indo-Aryan speech variety that falls out- 
side the general genetic structure of the family is 
Nagamese or Naga Pidgin. This pidgin, based on 
Assamese, is used by speakers of the various Naga 
languages to communicate with outsiders with whom 
they come into contact and has a history of perhaps 
two centuries. As Map 51 shows, it is spoken 
throughout Nagaland, and in parts of Manipur and 
Arunachal Pradesh. 


6.3.3 Dravidian 


Though the Dravidian languages are concentrated in 
the southern part of the Indian subcontinent, the 
presence of pockets of Dravidian speech — in Pakistan 
well to the north-east, in Nepal to the north, and in 
Bihar and Bengal to the north-west — is an indica- 
tion that this language family, which has not been 
shown with absolute certainty to be related to any 
other family, once covered a much more extensive 
area. Interestingly, Dravidian languages share several 
typological features, both in their agglutinating 
morphology and in their SOV word order, with 
Turkic languages, Japanese, Korean and a number of 
other languages and language families, and this has 
sometimes been taken as evidence of a common 
origin. 

During the centuries of contact between Dravid- 
ian and Indo-Aryan languages there has been much 
mutual influence and words have been borrowed in 
both directions. In most Dravidian languages there is 
a considerable Sanskritic element. In the majority of 
cases this has had a profound influence on the phon- 
ologies of the languages concerned. In the earliest 
known form of Dravidian (and, it is assumed, in 
proto-Dravidian) there are no aspirated stop con- 
sonants, nor is voicing distinctive in stop consonants. 
Sanskrit loan words in three of the four major 
Dravidian languages (Kannada, Malayalam and 


Telugu) retain the four-way distinctions in plosives 
that are found in the source (e.g. for velars, k, kh, g, 
gh). Tamil, on the other hand, has shown a much 
greater tendency to assimilate loans to the native Dra- 
vidian phonological system. 

This fact is reflected in the four different scripts 
used for these languages. The first three have four 
plosive symbols for each of the five points of articula- 
tion; Tamil has only one. The Kannada script is also 
used for Tulu. Islamic texts in Tamil and Malayalam 
have sometimes been printed in a modified Arabic 
script. 

The early records of Dravidian are in Tamil and 
are of two types, both going back to the second cen- 
tury Bc. These are inscriptions in Brahmi script in 
caves and on rocks, principally in what is now the 
Madurai District of the Indian state of Tamil Nadu, 
and poems of the so-called Sangam (Cankam) 
period. Since, some millennia ago, most of the sub- 
continent was Dravidian-speaking, it could be that 
there are much earlier records of a Dravidian lan- 
guage — in the inscriptions belonging to the ancient 
civilization of Harappa in the Indus valley. However, 
until such time as these inscriptions are convincingly 
and unambiguously deciphered, the question must 
remain open. 

On one matter in respect of Dravidian, there is 
widespread agreement, namely that there are three 
principal branches of the family — North (Kurux, 
Malto and Brahui), Central (Kolami, Naikri, Naiki, 
Parji, Ollari and Gadaba) and Southern, with the last 
of these being further divided into South (Tamil, 
Malayalam, Irula, Kodagu, Kurumba, Toda, Kota, 
Badaga, Kannada, Koraga and Tulu) and South- 
Central (Telugu, Gondi, Konda, Kui, Kuvi, Pengo and 
Manda). A recently discovered member of the South- 
Central group is Awe (or Indi); more information 
about its structure and the precise location of its 
speakers is awaited with interest. 


6.3.4 Austro-Asiatic 


The Austro-Asiatic family of languages, spoken over 
an area stretching from the eastern state of Orissa in 
India to Vietnam and the interior of peninsular 
Malaysia, comprises two branches — Munda and 
Mon-Khmer. The majority of Mon-Khmer languages 
are spoken in countries of South-East Asia. This sub- 
family is therefore discussed in detail in Section 5. 
Two groups of Mon-Khmer languages are spoken in 
parts of Southern Asia. Firstly, there is the northern 
group of Khasian (Khasi, War and Pnar) spoken in 
the Indian state of Meghalaya (see 5.2.9). Secondly, 
there is Nicobarese, a group of languages spoken in 
the Indian territory of the Nicobar Islands. With one 
exception — Shom Peng, spoken by a tribal group liv- 
ing in the central part of Great Nicobar — these form a 
dialect chain from north to south: Car (or Northern 
Nicobarese), Chowra (Chaura), Teressa-Bompoka, 
Nancowry (or Central Nicobarese), and Great (or 
Southern) Nicobarese (see 5.2.8). Numbers of 
speakers of these languages were diminished by the 
tragic event of the tsunami of 26 December 2004. 

The Munda branch of Austro-Asiatic is confined 
almost entirely to South Asian countries, and of these 
principally to India, though there are a few speakers 
in Bangladesh and Nepal. Within India the main con- 
centration of Munda languages is in the eastern states 
of West Bengal, Bihar, Jharkhand, Orissa and Andhra 
Pradesh, with Korku, spoken further west in the bor- 
der area of Madhya Pradesh and Maharashtra, being 
the main exception to this. 

Munda has two. sub-branches, North and 
South. North Munda divides into Korku and Kher- 
wari, and South Munda into Koraput and Central 
(Kharia and Juang). Kherwari is further divided into 
Santali and Mundari-Ho (Mundari, Asuri, Bhumij, 
Birhor, Ho, Korwa and Kora). Within Koraput 
Munda there are two groups, consisting of Gutob, 
Rema and Geta on the one hand, and Savara and 
Parengi on the other. An alternative name for Gutob 
is Gadaba, a fact that has caused some confusion in 
census returns because of the use of the same name 


for the Central Dravidian language Kondekor 
(Konekor) Gadaba. 


6.3.5 Tibeto-Burman 


One of two branches of the Sino-Tibetan family, 
Tibeto-Burman, comprises some hundreds of lan- 
guages spoken over a wide area from Tibet southward 
and eastward as far as north-eastern Vietnam. In the 
region covered by this section, Tibeto-Burman lan- 
guages constitute the most numerous set of speech 
varieties in the countries of Nepal and Bhutan, and in 
the Indian states that lie to the north and east of 
Bangladesh. A full account of the whole of the Sino- 
Tibetan family, including Tibeto-Burman, is to be 
found in Section 5: see 5.1.2.1.1 (Bodish), 5.1.2.1.2 
(Himalayan), 5.1.2.2 (Sal — including Bodo-Garo, 
Jinghpaw, Kuki-Chin), 5.1.2.3 (Central Tibeto- 
Burman) and 5.1.2.5 (Burmese-Negwi). 


6.3.6 Tai-Kadai 


The Thai subgroup of the Kam-Thai group, within 
the larger group Tai-Kadai, is represented in the east- 
ern part of Assam and in Arunachal Pradesh by 
Aiton, Khamti, Khamyang and Phake. Khamyang has 
no more than 50 speakers; of the other three Aiton 
and Phake have about 2,000 speakers each, whereas 
Khamti has about 70,000 — 11,000 in India, 59,000 in 
Burma. These languages differ from most other Thai 
languages, in that they — particularly Khamti — fre- 
quently exhibit SOV sentences, possibly through the 
influence of the verb-final ordering that is typical of 
most other languages in the area. There is, however, 
much variation. Tai-Kadai — along with Austronesian 
and Miao-Yao — may be part of an even larger group- 
ing, for which the name Austro-Thai has been pro- 
posed. More detailed information on this may be 
found in Section 5. On the South-western Thai lan- 
guages, including Khamti, Phake and Khamyang, see 
section 5.3.1.4. 


6.3.7 Andamanese 


Though most of the present-day inhabitants of the 
Andaman Islands speak one or another of India’s 
‘scheduled’ languages, there are small communities 
speaking the indigenous languages of the islands. 
These languages are members of the Andamanese 
family, which has not been shown convincingly to be 
related to any other family. Andamanese is divided 
into two groups, Great Andamanese and South 
Andamanese, the former being further divided into 
Northern and Central subgroups. Of Great Andama- 
nese, only A-Pucikwar of the Central subgroup sur- 
vives. The three South Andamanese languages are 
Onge (spoken on the island of Little Andaman), 
Jarawa (South and Middle Andamans) and Sentine- 
lese (North Sentinel Island). Numbers of speakers 
reported 25 years ago (D. Bradley, Map 37 in Wurm 
and Hattori 1981) were: 


A-Pucikwar 24 
Onge 106 
Jarawa 250 
Sentinelese 50 


Numbers may well be lower now, particularly in the 
wake of the tsunami of December 2004. Figures for 
ten Great Andamanese languages (including A- 
Pucikwar) taken from the Census of India 1901, that 
is to say before most of them became extinct, are 
given in Radcliffe-Brown (1964: 16). 


6.3.8 Burushaski 


Though affinities with other language families, 
including Indo-European, have been suggested, Buru- 
shaski is generally thought to be a language isolate. 
Spoken in the extreme north of Pakistan, it has two 
quite distinct dialects, one spoken in the Hunza- 
Nager valleys and the other, known as Werchikwar, in 
the Yasin valley. Speakers of the Hunza and Nager 
varieties, between which the differences are relatively 
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slight, total about 45,000 and those of Werchikwar 
approximately 15,000. Typologically, Burushaski is an 
SOV language. Burushaski is an unwritten language, 
but many of its speakers are bilingual and about 
20 per cent are literate in Pakistan’s national lan- 
guage, Urdu. 
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7.1 Introduction 


The dynamics of the area covered in this section make 
the mapping of languages particularly difficult. 
Firstly, the area has undergone (and in some regions 
is still undergoing) major political change, including 
the break-up of the area’s two ethnically most diverse 
components, the former USSR and the former Yugo- 
slavia. The break-up of the USSR has led in particular 
to the exodus of many ethnic Russians from the new 
Central Asian countries. At the time of writing (early 
2006), there are regions that are de facto outside the 
control of the political entities of which they are de 
jure components: Abkhazia and South Ossetia with 
respect to Georgia; Kosovo with respect to Serbia and 
Montenegro; Transdnestria with respect to Moldova; 
Karabagh and adjacent areas with respect to Azerbai- 
jan. Secondly, the area has seen a number of recent 
(and continuing) population and language shifts that 
often result in a number of languages being spoken in 
the same area. This is especially true of Russia, where 
indigenous languages are often being gradually — or 
not so gradually — replaced by Russian, in part as Rus- 
sians have moved into new areas, in part as the 
indigenous inhabitants of such areas have shifted to 
Russian. In addition, there are several other areas 
where different languages co-exist without any clear 
geographical demarcation. As indicated below, a 
number of compromises have had to be adopted in 
mapping such areas. 

Statistical sources concerning numbers of 
speakers are in general good for this area, including 
the former USSR and the former Yugoslavia, and the 
development of more open attitudes to ethnic minor- 
ities has led to considerable improvement in the stat- 
istics available for much of central and eastern 
Europe. However, relevant census data for Turkey are 
now 40 years out of date, and post-Soviet data are 
absent for some parts of the former USSR, for Serbia 
and Montenegro, and for Bosnia-Herzegovina. In 
all cases, the source of statistical data is indicated. 
Gordon (2005) is the standard reference work for the 
world as a whole; even though its statistics are some- 
times out of date and sources are not always cited, the 
work is updated periodically. 

A basic source for the maps has been Bruk and 
Apencenko (1964). It should be borne in mind, how- 
ever, that theirs is an ethnic, rather than a linguistic, 
atlas and that certain changes have therefore had 
to be made to bring its information into line with 
linguistic criteria. My main aim in the maps in this 
section has been to indicate which language is the 
majority language in a particular area. Occasionally, it 
has been necessary to indicate that two languages are 
widely spoken in an area without clear preponder- 
ance of one or the other. Small areas with exceptional 
language distribution are referred to in greater detail 
in the text accompanying the maps. For several 
indigenous languages of Russia that are under con- 
siderable pressure from Russian, I have adopted a 
somewhat historical perspective, indicating the area 
occupied by the corresponding ethnic group and 
within which the language is spoken (though not 
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everywhere by the majority of the population); such 
languages are marked with an asterisk in the key. 
Some languages are everywhere a minority language 
within the area mapped; they are not represented on 
the maps, but information on geographic distribution 
is included in the text. 

The language families well established within this 
area — that is, excluding languages that are essentially 
extensions of languages or language families spoken 
primarily outside the area covered — are classified here 
into five groupings: the Indo-European phylum, the 
Uralic phylum, the Altaic phylum, the Caucasian lan- 
guages, and the small families and isolates. In general, 
a conservative genetic classification has been followed: 
only those genetic units recognized by most special- 
ists in the language concerned have been allowed. 


7.1.1 Indo-European 


Languages of the Indo-European phylum are domin- 
ant in the western part of the area and, as a result of 
the spread of Russian, in Siberia. Within Indo- 
European, all Slavonic languages other than Upper 
and Lower Sorbian have their focus in this area, as do 
the Baltic languages, the single-language Indo- 
European branch Armenian, and a large part of the 
Iranian languages. The Baltic and Slavonic languages 
are often grouped together as Balto-Slavonic, while 
the Iranian languages are to be grouped together with 
the Indic (Indo-Aryan) languages as Indo-Iranian. 


7.1.2 Uralic 


Languages of the Uralic phylum are found mainly in 
European Russia and the northern Baltic, with some 
overspill into adjacent parts of Asia, though Hun- 
garian is found in central Europe and Finnish outside 
the area mapped. The genetic classification of the 
Finno-Ugric languages given in the key to the maps is 
somewhat simplified: the subgroups are listed in 
order of progressively greater genetic distance from 
Balto-Finnic. The Mordvin languages and Cheremis 
are traditionally counted as a single Volgaic group, 
but the unity of this is questionable; the unity of 
Ugric is also the subject of current controversy, with 
Hungarian being perhaps more distant from the Ob- 
Ugric languages Khanty and Mansi than the trad- 
itional classification implies. 


7.1.3 Altaic 


The genetic unity of the Altaic phylum is more con- 
troversial than that of the other phyla listed here, 
although the genetic unity of each group within 
Altaic (Turkic, Mongolic, Tungusic) is clear. The 
subclassification of Turkic languages is the subject of 
continuing debate. 


7.1.4 Caucasian 


The languages of the Caucasus that do not belong 
to any of the major families represented elsewhere — 
the so-called Caucasian languages — do not form 


a genetic unit, but fall into three well-defined 
such units: West Caucasian (North-west Caucasian, 
Abkhaz-Adyghe), East Caucasian (North-East Cau- 
casian, Nakh-Daghestanian) and Kartvelian (South 
Caucasian). While the hypothesis of a genetic relation 
between West and East Caucasian as a single North 
Caucasian phylum has some adherents, it is now 
generally acknowledged that Kartvelian is quite 
distinct. 


7.1.5 Isolates 


The small Chukotko-Kamchatkan family in north- 
eastern Siberia is not assigned to any higher-level 
grouping. Individual languages not assigned to any 
higher-level genetic unit are Yukaghir, Ket and 
Nivkh, although the view that Yukaghir is distantly 
genetically related to Uralic is widely held. Inciden- 
tally, the term Paleo-Siberian (Paleo-Asiatic), used to 
refer to languages of Siberia not belonging to Indo- 
European, Uralic or Altaic, is a negative characteriza- 
tion with no implications of overall genetic unity. For 
further details of genetic and typological information, 
reference may be made to Comrie (1981). 


7.2 Statistical information 


7.2.1 Poland, the Czech Republic and 
Slovakia 


Good census data are now available for all three 
countries. 

For Poland, the 2002 census gave the total popu- 
lation of the country as 38,230,080, the vast majority 
(over 98 per cent) being ethnic Poles with Polish as 
their native language. On the basis of the census stat- 
istics relating to ethnic groups, Moskal (2004) esti- 
mated the numbers of speakers of various minority 
languages in Poland; these estimates are reproduced 
in Table 7.1, omitting a couple of varieties that would 
probably be regarded as dialects rather than separate 
languages, but including Kashubian and Ruthenian. 
Of the largest minority languages, Kashubian is 
spoken primarily in the central-northern province of 
Pomerania, German in the south-western provinces 
of Opole and Lower Silesia, and Belarusian in the 
north-eastern province of Podlasie, while Ukrainian 
is more scattered. 

Table 7.2 provides statistics for the native lan- 
guages of the population of the Czech Republic 
according to the 2001 census. The data are taken from 
www.czso.cz/csu/edicniplan.nsf/t/C2003127AF/$File/ 
41140312.pdf, which provides data from the census 
prepared by the Cesky Statisticky Utad. (The total 
given for “Other”, an expected category missing from 
the official list, has been calculated as the difference 
between the total of the other categories and the 
number given as “Total”.) 

Table 7.3. provides comparable statistics for 
Slovakia according to its 2001 census. The data derive 
from _http://www.statistics.sk/webdata/slov/scitanie/ 
namj.htm, which provides data from the census 
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Table 7.1 Minority languages of Poland (Moskal 2004, 


based on the 2002 census) 


Language No. of speakers 
Kashubian 500,000 
German 300-500,000 
Belarusian 200-300,000 
Ukrainian 200-300,000 
Ruthenian 60-70,000 
Lithuanian 20-25,000 
Romany 15-30,000 
Russian 10-15,000 
Slovak 10-20,000 
Yiddish 8-10,000 
Armenian 5-8,000 
Czech 3,000 
Karaim 200 
Tatar unknown 


Table 7.2 Linguistic composition of the Czech 


Republic, 2001 


Language No. of speakers 
Czech 9,707,397 
Slovak 208,723 
Romany 23,211 
Polish 50,738 
German 41,328 
Russian 18,746 
English 3,791 
other 99,258 
unknown 76,868 
Total 10,230,060 


Table 7.3 Linguistic composition of Slovakia, 2001 


Language No. of speakers 
Slovak 4,512,217 
Hungarian 572,929 
Romany 99,448 
Ruthenian 54,907 
Czech 48,201 
Ukrainian 7,879 
German 6,343 
Polish 2,731 
Serbian/Croatian/Bosnian 1,399 
other 7,345 
unknown 66,056 
Total 5,379,455 


prepared by the Statisticky Urad Slovenskej Repub- 
liky. Accompanying maps show some aspects of 
the geographical distribution of the corresponding 
ethnic groups: Hungarians are widespread in south- 
ern Slovakia, though especially in the extreme south- 
west, where they constitute over 50 per cent of the 
population; more Roma live in the east than in the 
west; Czechs are most frequent in the north-west; 
Ukrainians are most concentrated in the far north- 
east. 

Polish, Czech and Slovak, all written in the Latin 
alphabet, are the official languages of Poland, the 
Czech Republic and Slovakia, respectively; Hungarian 
also has some use as a written language in 
Hungarian-speaking parts of Slovakia. 


7.2.2 Lithuania, Latvia and Estonia 


Census data for the Baltic states are overall among the 
best for the area mapped, in particular for Estonia; in 
the case of Latvia and Estonia, a major factor has 
been the desire to document the distribution of the 
large ethnic and linguistic Russian populations. 

For Lithuania, where ethnic Russians constitute 
only about 6.3 per cent of the total population, the 
numbers of speakers of the most widely spoken lan- 
guages (according to the 2001 census) are available 
from the website of the Department of Statistics to 


the Government of the Republic of Lithuania at 
http://www.std.lt/en/pages/view/?id=1763, and _ rele- 
vant statistics are reproduced in Table 7.4. Other 
web pages accessible from http://www.std.It/en/pages/ 
view/?id=1670 give more detailed information on 
the distribution of ethnic groups, showing in par- 
ticular that Russians are concentrated in urban 
areas, and Poles in Vilnius county, indeed constitut- 
ing a majority of the population in some districts 
thereof. 


Table 7.4 Linguistic composition of Lithuania, 2001 


Language No. of speakers 
Lithuanian 2,855,780 
Russian 277,318 
Polish 195,016 
Belarusian 16,028 
Ukrainian 8,185 
Romany 2,062 
Latvian 1,749 
German 963 
other 5,041 
unknown 121,830 
Total 3,483,972 


Comparable data for Latvia from the 2000 census 
provided by the Central Statistical Bureau of Latvia 
are available in Lat 2000 (2002) and reproduced in 
Table 7.5. This same electronic resource provides 
more detailed information on the geographical dis- 
tribution of ethnic groups for administrative districts 
and towns. Thus, Russians outnumber Latvians in 
the districts of Daugavpils and Kraslava, and in the 
following towns: Daugavpils, Rézekne, Riga, Jirmala 
and Liepaja. 

For Estonia, very detailed information from 
the 2000 census is available from the website of the 
Statistical Office of Estonia at http://pub.stat.ee/px- 
web.2001/I_Databas/Population_Census/Population_ 
Census.asp This resource gives figures not only for 
the country as a whole, as reproduced in Table 7.6, 
but also for individual counties and cities (including 
districts within Tallinn). These data show that the 
Russian-speaking population of Estonia is over- 
whelmingly urban, that Russian speakers outnumber 
Estonian speakers in the north-eastern county Ida- 
Viru, including all cities within that county (but only 
rarely in rural areas), and that, while Estonian 
speakers outnumber Russian speakers in Tallinn, 
there are some districts in the city (Lasnamiée, P6hja- 
Tallinn) with the inverse proportion. 

Lithuanian, Latvian and Estonian, all written in 
the Latin alphabet, are the official languages of 
Lithuania, Latvia and Estonia, respectively; Russian 
also has some use as a written language in Russian- 
speaking parts of Latvia and Estonia. Limited regional 
use is also made of the Latgalian variety of Latvian in 
Latvia, and of South Estonian in Estonia. 


Table 7.5 Linguistic composition of Latvia, 2000 


Table 7.6 Linguistic composition of Estonia, 2000 


Language No. of speakers 
Estonian 921,817 
Russian 406,755 
Ukrainian 12,299 
Belarusian 5,197 
Finnish 4,932 
Latvian 1,389 
other 8,474 
unknown 9,189 
Total 1,370,052 


7.2.3 Slovenia, Croatia, Serbia and 
Montenegro, Bosnia-Herzegovina, 
Macedonia and Bulgaria 


Good census data are available for Slovenia, Croatia, 
Macedonia, and Bulgaria — in the case of Bulgaria 
reflecting a more open attitude than during the Cold 
War — but I have not been able to find detailed rele- 
vant data for Serbia and Montenegro or Bosnia- 
Herzegovina. As a substitute for data from the last 
two countries, Table 7.7 reproduces data for Yugosla- 
via from the 1981 census, though it should be borne 
in mind that this includes data from Slovenia and 
Macedonia. In all the tables, I have grouped together 
under the linguistic label “Serbian/Croatian/Bosnian’ 
listings for Serbian, Croatian, Bosnian, Montenegrin, 
Serbo-Croat and Croato-Serbian varieties. 

Table 7.8 reproduces data from the Slovenian 
census of 2002 prepared by the Statisti¢ni Urad Repub- 
like Slovenije and available at http://www.stat.si/popis 
2002/en/rezultati_slovenija_prebivalstvo_dz.htm. The 
Slovenian data are unusual in that they list separately 
the languages spoken by monolinguals and the com- 
binations of languages spoken by bilinguals. 

Data from the Croatian census of 2001 are 
reproduced in Table 7.9, taken from www.dzs.hr/Hrv/ 
Popis%202001/popis20001.htm, part of the website 
of the Drzavni Zavod za Statistiku. In addition to the 
data for the country as a whole, there is also a break- 
down for individual counties, which shows the 
expected concentration of the Italian-speaking popu- 
lation in Istria, though Serbian/Croatian/Bosnian is 
the overwhelmingly dominant language not only of 
the country as a whole but also of each county. The 
published data make no mention of Istro-Rumanian. 
An article in the Zagreb newspaper Vecernji List of 
13 January 2002, reproduced on http://www. istrianet. 
org/istria/linguistics/istrorumeno/news/02_0113bura- 
zi-eng.htm, suggests that there are several hundred 
speakers in Istria. 

The data from the Macedonian census of 
2002 reproduced in Table 7.10 were kindly provided 
by Mrs Mira Dojcinovska of the State Statistical 
Office. 

The figures in Table 7.11 from the 2001 census of 
Bulgaria are taken from the website of the National 


Population 

Ethnic origin Corresponding language Latvian Russian 
Latvian 1,370,703 1,311,093 48,242 
Russian 703,243 664,743 31,141 
Belarusian 97,150 18,265 6,347 70,717 
Ukrainian 63,644 17,301 2,309 43,159 
Polish 59,505 11,529 11,727 34,340 
Lithuanian 33,430 13,187 14,203 5,437 
Yiddish 10,385 825 918 8,211 
Romany 8,205 5,637 1,670 574 
German 3,465 541 854 1,970 
Tatar 3,168 867 70 2,162 
Estonian 2,652 720 1,041 840 
Total 2,377,383 2,050,763 
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Table 7.7 Linguistic composition of Yugoslavia (pre- 
1991 frontiers), 1981 


Table 7.10 Linguistic composition of Macedonia, 
2002 


Language No. of speakers Language No. of speakers 

Serbian/Croatian/Bosnian 16,342,884 Macedonian 1,344,815 

Slovene 1,761,394 Albanian 507,989 

Albanian 1,756,663 Turkish 71,757 

Macedonian 1,373,930 Romany 38,528 

Hungarian 409,080 Arumanian 6,884 

Romany 140,619 Serbian/Bosnian 33,333 

Arumanian 135,589 other 19,241 

Turkish 82,090 

Slovak 74,033 Total 2,022,547 

Rumanian 59,870 

Bulgarian 37,268 

Ruthenian 19,413 Table 7.11 Linguistic composition of Bulgaria, 2001 

Telugu 19,411 — 

Czech 16,197 Language No. of speakers 

German 9,223 

poainin vate Bulgarian 6,697,158 

ee 3730) Turkish 762,516 

French a0 Romany 327,882 

polish i other 71,084 

Greek r638 not stated 45,454 

English ae unknown 24,807 

Danish 971 

Swedish 653 

Dutch 325 Total 7,928,901 

Norwegian 321 

Yiddish 58 

Siler 4,970 Statistical Institute at http://www.nsi.bg/Census_e/ 

unknown 161,004 Census_e.htm. This web page gives statistics not only 
for the country as a whole but also for each adminis- 

Total 22,424,687 trative district, from which one can see that in two 


Table 7.8 Linguistic composition of Slovenia, 2002 


Language No. of speakers 
Slovene 1,789,460 
Italian 1,488 
Hungarian 3,771 
Romany 1,630 
Albanian 2,296 
Serbian/Croatian/Bosnian 19,477 
Macedonian 780 
other monolingual 3,595 
Slovene and Italian 4,484 
Slovene and Hungarian 3,910 
Slovene and Serbian/Croatian/Bosnian 65,462 
Slovene and other 14,131 
other bilingual 243 
unknown 53,309 
Total 1,964,036 


Table 7.9 Linguistic composition of Croatia, 2001 


Language No. of speakers 
Serbian/Croatian/Bosnian 4,326,382 
Italian 20,521 
Albanian 14,621 
Hungarian 12,650 
Slovene 11,872 
Romany 7,860 
Czech 7,178 
Slovak 3,993 
Macedonian 3,534 
German 3,013 
Ruthenian 1,828 
Rumanian 1,205 
Russian 1,080 
Ukrainian 1,027 
Polish 536 
Turkish 347 
Bulgarian 265 
Yiddish 8 
Arumanian 7 
other 2,824 
unknown 16,709 
Total 4,437,460 


districts, Kurdzhali and Razgrad, Turkish is the 
majority language. 

Slovene, written in the Latin alphabet, is the 
official language of Slovenia. Macedonian and Bul- 
garian, both written in the Cyrillic alphabet, are the 
official languages of Macedonia and Bulgaria respect- 
ively. Of the varieties of Serbian/Croatian/Bosnian, 
Serbian (written in the Cyrillic Alphabet) is official 
in Serbia and Macedonia, Croatian (written in the 
Latin alphabet) in Croatia, and both of these plus 
Bosnian (written in the Latin alphabet) in Bosnia- 
Herzegovina. Albanian, written in the Latin alphabet, 
is a de facto official language in Kosovo. 

See Section 8.2 for Albania, Kosovo/Kosova, 
Greece and Cyprus; see Section 8.3 for Romania; see 
Section 8.4 for Hungary. 


7.2.4 Turkey 


The last Turkish census to include a question on lan- 
guage (namely, the language usually used in the 
home) is that of 1965. The figures in Table 7.12 
(which includes all languages with over 4,000 
speakers) are cited from the official census returns 
(TGNS 1965). It should be noted that these same stat- 
istics show many of the languages other than Turkish 
also having large numbers of second-language 
speakers, that is, members of the given ethnic 
group who are Turkish-dominant bilinguals. The cur- 
rent population of Turkey (mid-2005 estimate) is 
69,660,559; given that the minority languages, espe- 
cially the smaller ones, have been subject to assimila- 
tory pressure from Turkish for some time, one would 
expect them to have decreased as a percentage of the 
total. Of the languages not mapped, Circassian is 
spoken in compact areas (alongside Turkish) scat- 
tered through much of the country; most speakers 
of Greek, Armenian and Judezmo live in Istanbul. 
Turkish, written using the Latin alphabet, is the sole 
official language, though many of the draconian laws 
against other languages (including banning their use 
in public) have been repealed in recent years. 


7.2.5 Russia 


The published results of the 2002 census of the 
Russian Federation (Itogi 2002, 2004) represent in 
many ways a refreshing break from the way language 
statistics were reported in the published volumes of 


Table 7.12 Linguistic composition of Turkey, 1965 


Language Number of Second-language 
speakers speakers 
Turkish! 28,289,680 1,387,139 
Kurdish? 2,370,233 449,998 
Arabic 365,340 167,924 
Circassian® 58,339 48,621 
Greek 48,096 75,941 
Georgian 34,330 44,934 
Armenian 33,094 22,260 
English 27,841 
Bulgarian* 27,226 3,126 
Laz 26,007 55,158 
Serbian/Croatian/ 

Bosnian°® 24,271 3,135 
Albanian 12,832 39,613 
Judezmo® 9,981 3,510 
German 4,901 
Abaza 4,563 7,556 
other 54,693 
Total 31,391,421 
Notes: 


1 Includes speakers of other Turkic languages, though their number is 
small relative to this total. 

2 This figure combines those given for Kurdish, Zaza, ‘kirmanca’ and 
‘kirdasga’. 

3 Listed as a single language, with no distinction of West Circassian 
(Adyghe) versus East Circassian (Kabardian). 

4 Combines the figures given for Pomak (i.e. Bulgarian-speaking 
Muslims) and Bulgarian. 

5 Combines the figures for Bosnian, Serbian and Croatian. 

6 May include speakers of Yiddish. 


the later Soviet censuses; see Table 7.13. In particular, 
language data are presented directly, rather than 
through the prism of ethnic data. 

However, there are also new problems intro- 
duced by the precise way in which the data are 
reported in the published version. First, respondents 
were allowed to name more than one language 
spoken, and the totals for each language seem to 
reflect not only first-language speakers but also at 
least some second-language speakers. (The total given 
in the last line of Table 7.13 is the total population of 
the Russian Federation, not the result of adding 
together the numbers of speakers of all the lan- 
guages.) This leads to a substantially higher figure 
than for first-language speakers — not only for 
Russian, but also for some other languages, perhaps 
most strikingly English. Gordon (2005) estimates the 
number of native speakers of Russian in Russia in 
2000 at 117,863,645. 

Secondly, the new freedom given to respondents 
has led to some strikingly inconsistent results. In par- 
ticular, the numbers for most Andic languages are 
well under other estimates (and marked * in Table 
7.13), with only Akhvakh and Karata seeming plaus- 
ible, and Andi somewhat over the expected total; this 
probably reflects the desire of many speakers to iden- 
tify themselves with the local lingua franca, Avar. 
JaNSSR IV (1966) gives the following number of 
speakers for the individual Andic languages: Andi 
8,000-9,000, Botlikh 3,000, Godoberi 2,500, Karata 
5,000, Akhvakh 5,000, Bagvalal 4,000, Tindi 5,000, 
Chamalal 4,000. 

There are equally clear cases of over-reporting 
(marked ° in Table 7.13), especially for some of the 
smaller languages of northern Asia. Kerek and Yugh 
are known to have become extinct recently. More 
realistic figures for some other languages, following 
Gordon (2005) are: Enets 70, Kamchadal 60, Oroch 
100-150, Udihe 100, Yukaghir 40-200. 

All of these factors need to be taken into account 
in interpreting Table 7.13. 

Russian is the sole official language of the 
Russian Federation as a whole. In the constituent 
republics of the Russian Federation, the local lan- 
guage is normally de facto or de jure a co-official 
language, written in the Cyrillic alphabet except 
where indicated otherwise. These republics are, with 
identification of corresponding languages: Adygeya 
(Adyghe), Altay Republic (Altai), Bashkortostan 
(Bashkir), Buryatia (Buryat), Chechnya (Chechen), 
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Table 7.13 Linguistic composition of Russia, 2002 


Language No. of speakers 
Russian 142,573,285 
English 6,955,315 
Tatar 5,347,706 
German 2,895,147 
Ukrainian 1,815,210 
Bashkir 1,379,727 
Chechen 1,331,844 
Chuvash 1,325,382 
Armenian 904,892 
Avar 784,840 
French 705,217 
Azerbaijani 669,757 
Mordvin! 614,260 
Kabardian 587,547 
Kazakh 563,749 
Dargi 503,523 
Ossetian 493,610 
Mari? 487,855 
Udmurt 463,837 
Kumyk 458,121 
Yakut 456,288 
Ingush 405,343 
Lezgian 397,310 
Buryat 368,807 
Belarusian 316,890 
Karachay-Balkar 302,748 
Georgian 286,285 
Tuvan 242,754 
Uzbek 238,831 
Komi 217,316 
Romany 166,514 
Turkish 161,319 
Kalmyk 153,602 
Lak 153,373 
Moldovan 147,035 
Tajik 131,530 
Adyghe 129,419 
Tabasaran 128,391 
Spanish 111,900 
Komi-Permyak 94,328 
Polish 94,038 
Nogai 90,020 
Altai 65,534 
Korean 60,088 
Chinese 59,235 
Greek 56,473 
Italian 54,172 
Karelian 52,880 
Khakas 52,217 
Finnish 51,891 
Arabic 50,140 
Lithuanian 49,020 
Kyrgyz 46,319 
Turkmen 38,533 
Abaza 38,247 
Kurdish 36,609 
Latvian 34,759 
Nenets 31,311 
Bulgarian 30,894 
Yiddish/Hebrew’ 30,019 
Agul 29,399 
Rutul 29,383 
Estonian 26,645 
Vietnamese 26,197 
Japanese 24,787 
Andi 23,729 
Rumanian 22,663 
Tsez 15,356 
Khanty 13,568 
Czech 13,242 
Mongolian 11,498 
Tsakhur 9,771 
Hungarian 9,712 
Serbian/Croatian/Bosnian 9,674 
Persian 9,568 
Portuguese 9,531 
Abkhaz 9,447 
Pashto 8,580 
Aramaic 7,762 
Chukchi 7,742 
Gagauz 7,597 
Evenki 7,584 
Even 7,168 
Swedish 7,113 
Karata 6,574 
Bezhta 6,461 


Shor 6,210 
Hindi 5,853 
Akhvakh 5,793 
Veps 5,753 
Talishi 5,310 
Dolgan 4,865 
Nanay 3,886 
Albanian 3,220 
Koryak 3,019 
Tat 3,016 
Udi 2,960 
Mansi 2,746 
Dutch 2,599 
Mingrelian 2,590 
Chamalal* 2,355 
Slovak 2,169 
Uighur 1,932 
Teleut 1,892 
Khwarshi 1,872 
Hunzib 1,839 
Selkup 1,641 
Karakalpak 1,561 
Dunganese 1,088 
Crimean Tatar 1,069 
Kumandi 1,044 
Saami 787 
Votic 774 
Ulcha 732 
Bengali 696 
Nivkh 688 
Tindi* 616 
Yukaghir 604 
Hinuq 548 
Chelkan 539 
Archi 524 
Nganasan 505 
Ket 485 
Roshani 441 
Tubalar 436 
Eskimo 410 
Kamchadal? 385 
Tofa 378 
Ingrian 362 
Balochi 345 
Chulym Turkic 270 
Oroch” 257 
Udihe’ 227 
Aleut 175 
Svan 153 
Negidal 147 
Yugh? 131 
Enets® 119 
Godoberi* 103 
Botlikh* 90 
Karaim 88 
Orok 64 
Laz 62 
Bagvalal* 57 
Astrakhan Nogai 55 
Shughni 52 
Alutor 40 
Krymchak 29 
Yurta Tatar 22 
Kerek? 15 
Alabugat Tatar 10 
Baraba Tatar 8 
other 107,715 
not given 1,420,544 
Total 145,166,731 
Notes: 


1 Mordvin includes both Erzya and Moksha; Erzya constitutes about 
60 per cent of the total, Moksha about 40 per cent. 

2 Mari combines figures for Hill Mari and Eastern Meadow Mari. 

3 The combination of figures for the genetically unrelated Yiddish and 

Hebrew represents a bizarre intrusion of non-linguistic criteria. 

4 These numbers seem unrealistically low. See discussion above 
(p. 233). 

5 These numbers seem unrealistically high. See discussion above 
(p. 233). 


Chuvashia (Chuvash), Daghestan (see below), 
Ingushetia (Ingush), Kalmykia (Kalmyk), Kabardino- 
Balkaria (Kabardian, Karachay-Balkar), Karachay- 
Cherkessia (Karachay-Balkar, Kabardian), Karelia (to 
some extent Finnish, using the Latin alphabet), Kha- 
kasia (Khakas), Komi Republic (Komi), Mari El (Mari 
— there are two varieties of the written language, 
Meadow-Eastern Mari and Hill Mari), Mordovia 
(Erzya and Moksha Mordvin), Sakha/Yakutia (Yakut), 
North Ossetia-Alania (Ossetian), Tatarstan (Tatar), 


Tyva (Tuvan), Udmurtia (Udmurt). A lower level of 
use of the local language is the norm in the autono- 
mous districts: Aginsk Buryat (Buryat), Chukchi 
(Chukchi), Evenki (Evenki), Jewish (Yiddish), 
Khanty-Mansi/Yugra  (Khanty, Mansi), Komi- 
Permyak (Komi-Permyak), Koryak (Koryak), Nenets 
(Nenets), Taymyr/Dolgan-Nenets (Dolgan, Nenets), 
Ust' Ordy Buryat (Buryat), Yamal Nenets (Nenets); all 
the named languages use the Cyrillic alphabet, except 
for Yiddish, which is written using the Hebrew 
alphabet. 

According to the constitution of multi-ethnic 
and multilingual Daghestan, all indigenous languages 
are official, but in practice the following have this status: 
Avar, Dargi, Kumyk, Lak, Lezgian, Tabasaran. In recent 
years, limited use as a written language has been 
approved for some other languages, including Andi, 
Rutul, Tsakhur and Tsez. All use the Cyrillic alphabet. 

Nogai, though not associated with an ethnic 
administrative area, also has some currency as a 
written language in Russia. In addition, some limited 
use is made of a wide range of smaller languages on a 
limited basis, for instance in teaching local lore in 
schools. All such languages use the Cyrillic alphabet. 


7.2.6 Ukraine, Belarus and Moldova 


Linguistic data from the 2001 census of Ukraine, 
prepared by the State Statistics Committee of Ukraine, 
are available on the website http://www.ukrcensus. 
gov.ua/ and are reproduced in Table 7.14. Ethnic data 
are also available from the same source for individual 
regions, and show that Ukrainians are an ethnic 
majority in every region except for the Autonomous 
Republic of Crimea, where Russians predominate; 
otherwise, Russians are more concentrated in the east 
of the country than in the west. A striking change vis- 
a-vis Soviet times is the return of large numbers of 
Crimean Tatars to Crimea. 


Table 7.14 Linguistic composition of Ukraine, 2001 


Language No. of speakers 
Ukrainian 32,708,475 
Russian 14,343,272 
Crimean Tatar 228,344 
Moldovan 181,020 
Hungarian 149,396 
Rumanian 138,467 
Bulgarian 131,353 
Belarusian 54,608 
Armenian 50,350 
Tatar 25,802 
Azerbaijani 23,956 
Gagauz 22,809 
Romany 21,277 
Polish 18,589 
Georgian 12,551 
Greek 5,856 
German 4,063 
Yiddish 3,212 
Total 48,457,000 


For Belarus and Moldova, no up-to-date lan- 
guage statistics have been available to me, and Tables 
7.15 and 7.16 therefore reproduce data from the 1989 
census of the USSR (IVPN 1989 1993: 154, 524). 

Ukrainian, written in the Cyrillic alphabet, is the 
official language of Ukraine, with Russian also being 
widely used; Crimean Tatar also has some use as a 
written language, using the Latin alphabet. In Belarus, 
Belarusian and Russian, both written in the Cyrillic 
alphabet, are co-official languages. Moldovan, written 
in the Latin alphabet, is the official language of 
Moldova, with Russian, Ukrainian and Gagauz also 
having currency as written languages, all using the 
Cyrillic alphabet. 


7.2.7 The South Caucasus 


For the three independent states of the South 
Caucasus (also known, especially in earlier work, as 
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Table 7.15 Linguistic composition of Belarus, 1989 


Language No. of speakers 
Belarusian* 6,664,156 
Russian* 3,243,179 
Ukrainian 132,110 
Polish 55,727 
Romany 8,899 
Yiddish 8,530 
Lithuanian 4,394 
Tatar 3,212 
other 31,599 
Total 10,151,806 
Note: 


* For Belarusian and Russian, the number given includes all native 
speakers in Belarus. For other named languages, the number given 
includes all members of the corresponding ethnic group in Belarus 
with the given language as native language. 


Table 7.16 Linguistic composition of Moldova, 1989 


Language No. of speakers 
Moldovan* 2,687,793 
Russian* 1,003,563 
Ukrainian 369,999 
Gagauz 139,906 
Bulgarian 69,614 
Yiddish 17,016 
Romany 9,486 
Belarusian 8,397 
German 2,283 
Azerbaijani 1,706 
Armenian 1,436 
Tatar 1,430 
Rumanian 790 
Polish 458 
other 21,483 
Total 4,335,360 
Note: 


* For Moldovan and Russian, the number given includes all native 
speakers in Moldova. For other named languages, the number given 
includes all members of the corresponding ethnic group in Moldova 
with the given language as native language. 


Table 7.17 Linguistic composition of Armenia, 2001 


Language No. of speakers 
Armenian 3,139,152 
Kurdish 31,799 
Russian 29,563 
Ukrainian 818 
other 11,679 
Total 3,213,011 


Table 7.18 Linguistic composition of Georgia, 1989 


Language No. of speakers 
Georgian* 3,790,630 
Russian* 479,279 
Armenian 371,642 
Azerbaijani 300,474 
Ossetian 125,872 
Abkhaz 92,369 
Greek 57,230 
Ukrainian 28,018 
Kurdish 24,905 
Belarusian 4,660 
Avar 3,976 
Aramaic 3,335 
Tatar 2,429 
Moldovan 1,951 
Yiddish 1,369 
other 112,702 
Total 5,400,841 
Note: 


* For Georgian and Russian, the number given includes all native 
speakers in Georgia. For other named languages, the number given 
includes all members of the corresponding ethnic group in Georgia 
with the given language as native language. 


Transcaucasia), recent linguistic data are available to 
me only for Armenia, from the census of 2001; the 
data, prepared by the National Statistical Office of the 
Republic of Armenia, are available on-line at http:// 
docs.armstat.am/census/eng.php (unit 2, table 5.2) 
and are reproduced in Table 7.17. The figure cited in 
Table 7.17 for Kurdish is that given in the original 
under the heading “Yezed”, and may therefore refer to 
only part of the overall Kurdish-speaking population. 

For Georgia and Azerbaijan, no recent relevant 
data have been available to me, and Tables 7.18 and 
7.19 therefore reproduce data from the 1989 census of 
the USSR (IVPN 1989, 1993: 444, 484). 

In Georgia, more detailed figures for Abkhazia 
from IVPN 1989 (p. 452) and South Ossetia (p. 466) 
show that in 1989 nearly all speakers of Abkhaz lived 
in Abkhazia, and that South Ossetia had a majority of 


Table 7.19 Linguistic composition of Azerbaijan, 1989 


Language No. of speakers 
Azerbaijani* 5,780,787 
Russian* 528,762 
Armenian 328,681 
Lezgian 154,239 
Avar 42,398 
Ukrainian 21,117 
Tatar 20,020 
Talishi 19,203 
Turkish 16,539 
Tsakhur 12,744 
Georgian 12,453 
Tat 10,119 
Yiddish 9,494 
Kurdish 7,970 
Udi 5,491 
Belarusian 5,022 
Lak 1,519 
Moldovan 1,425 
other 43,195 
Total 7,021,178 
Note: 


* For Azerbaijani and Russian, the number given includes all native 
speakers in Azerbaijan. For other named languages, the number 
given includes all members of the corresponding ethnic group in 
Azerbaijan with the given language as native language. 


Ossetian speakers (64,185 out of a total population of 
98,527). Despite the large-scale population move- 
ments that accompanied the break-up of the USSR in 
these areas, at least these generalizations still remain 
true. The published figures from the 1989 census, like 
all those from later Soviet censuses, provide language 
statistics as a function of ethnic statistics, and there- 
fore do not include statistics for languages whose 
speakers are considered members of another ethnic 
group, in particular Svan, Mingrelian, Laz and Tsova 
Tush, whose speakers are included among ethnic 
Georgians. Gordon (2005) estimates the number of 
speakers in Georgia as follows: Svan 15,000, Mingre- 
lian 500,000, Laz 2,000 and Tzova Tush 3,420. 

For Azerbaijan, the more detailed figures given 
for Karabagh (p. 498) show an Armenian-speaking 
majority (143,323 out of a total population of 
189,085). Two languages spoken in Azerbaijan are not 
listed in the census statistics, since their speakers are 
considered ethnic Azerbaijanis: Gordon (2005) gives 
the number of speakers of Budukh as 1,000, of Kryz 
as 6,000. 

Armenian and Georgian, each written in its own 
distinct alphabet, are the official languages of Arme- 
nia and Georgia respectively. Azerbaijani, now written 
in the Latin alphabet, is the official language of 
Azerbaijan, though Russian is also widely used. In 
Abkhazia, Abkhaz and Russian are de facto official 
languages; the same applies to Ossetian and Russian 
in South Ossetia. 


7.2.8 Central Asia 


Of the five newly independent Central Asian states 
that emerged from the break-up of the USSR - 
Kazakhstan, Turkmenistan, Uzbekistan, Kyrgyzstan 
and Tajikistan — I have been able to find post-Soviet 
detailed census information only for Kyrgyzstan, and 
this is restricted to information on the ethnic com- 
position of the population. Table 7.20 reproduces the 
ethnic information from the 1999 census from Nas 
Kyrg (2000), together with comparable linguistic 
information relating to the 1989 census of the USSR 
(IVPN 1989, 1993: 548). The substantial decline in the 
Russian (also the Ukrainian and German) population 
of Kyrgyzstan is obvious. 


Table 7.20 Ethnic and linguistic composition of Kyrgyzstan, 1999 and 1989 


Language 1999 ethnic 1989 linguistic 
Kyrgyz* 3,128,147 2,239,044 
Uzbek 664,950 540,118 
Russian* 603,201 1,090,667 
Dunganese 51,766 35,471 
Ukrainian 50,442 36,836 
Uighur 46,944 26,186 
Tatar 45,438 53,510 
Kazakh 42,657 29,045 
Tajik 42,636 29,965 
Turkish 33,327 13,411 
German 21,471 64,214 
Korean 19,784 9,857 
Azerbaijani 13,873 
Kurdish 13,871 
Karachay-Balkar 3,889 
Belarusian 3,168 
Armenian 2,807 
Crimean Tatar 2,593 
Bashkir 2,401 
Chechen 2,399 
Dargi 2,363 
Lezgian 2,233 
Mordvin 1,711 
Greek 1,183 
Chuvash 947 
Yiddish 925 
Kalmyk 521 
other 72,175 34,547 
Total 4,822,938 4,257,755 
Note: 


* In the 1989 linguistic statistics, for Kyrgyz and Russian, the number given includes all native speakers in Kyrgyzstan. For other named languages, 
the number given includes all members of the corresponding ethnic group in Kyrgyzstan with the given language as native language. Note that, for 
many of the smaller speech communities named in the 1989 statistics, the corresponding ethnic group is not separately named in the ethnic 


statistics available to me for 1999. 
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For the other states, Tables 7.21-24 give the 
language statistics from the 1989 census of the 
USSR for Kazakhstan, Turkmenistan, Uzbekistan and 
Tajikistan, respectively (IVPN 1989, 1993: 296, 646, 
192, 592). Table 7.25 gives contemporary ethnic com- 
position statistics, taken from McGeveran (2006). 
Again, the sharp decline in the percentage of Russians 
in the population is everywhere evident. 


Table 7.21 Linguistic composition of Kazakhstan, 
1989 


Language No. of speakers 
Russian* 7,797 A58 
Kazakh* 6,467,683 
German 520,920 
Ukrainian 328,243 
Uzbek 317,319 
Tatar 226,098 
Uighur 176,157 
Azerbaijani 78,460 
Belarusian 62,967 
Korean 53,420 
Turkish 47,761 
Chechen 44,376 
Greek 32,633 
Dunganese 28,570 
Bashkir 28,472 
Kurdish 24,229 
Tajik 23,428 
Moldovan 17,280 
Ingush 17,108 
Mordvin 13,415 
Lezgian 12,444 
Armenian 11,871 
Kyrgyz 11,105 
Chuvash 11,072 
Udmurt 7,431 
Mari 7,306 
Polish 7,300 
Georgian 6,641 
Romany 6,249 
Lithuanian 5,335 
Bulgarian 3,822 
Yiddish 3,555 
other 64,336 
Total 16,464,464 
Note: 


* For Kazakh and Russian, the number given includes all native 
speakers in Kazakhstan. For other named languages, the number 
given includes all members of the corresponding ethnic group in 
Kazakhstan with the given language as native language. 


Table 7.22 Linguistic composition of Turkmenistan, 
1989 


Language No. of speakers 
Turkmen* 2,533,523 
Russian* 421,015 
Uzbek 302,189 
Kazakh 85,345 
Tatar 28,938 
Balochi 27,682 
Azerbaijani 26,634 
Armenian 20,989 
Ukrainian 18,779 
Lezgian 9,287 
Persian 6,011 
Belarusian 5,176 
Bashkir 3,349 
German 2,310 
Tajik 2,759 
Karakalpak 2,607 
Kurdish 1,724 
Korean 1,587 
Mordvin 1,259 
other 21,554 
Total 3,522,717 
Note: 


* For Turkmen and Russian, the number given includes all native 
speakers in Turkmenistan. For other named languages, the number 
given includes all members of the corresponding ethnic group in 
Turkmenistan with the given language as native language. 


Table 7.23 Linguistic composition of Uzbekistan, 
1989 


Language No. of speakers 
Uzbek* 14,128,089 
Russian* 2,151,634 
Tajik 896,771 
Kazakh 751,635 
Karakalpak 389,775 
Tatar 367,228 
Kyrgyz 140,134 
Turkmen 109,811 
Korean 102,175 
Turkish 101,383 
Ukrainian 75,396 
Azerbaijani 35,263 
Armenian 31,383 
Bashkir 23,684 
Uighur 21,986 
German 18,907 
Yiddish 17,348 
Belarusian 15,558 
Romany 6,748 
Mordvin 5,633 
Greek 5,595 
Chuvash 5,494 
Persian 3,739 
other 404,708 
Total 19,810,077 
Note: 


* 


For Uzbek and Russian, the number given includes all native 
speakers in Uzbekistan. For other named languages, the number 
given includes all members of the corresponding ethnic group in 
Uzbekistan with the given language as native language; the figure for 
Tajik includes members of the ethnic group ‘Central Asian Jews’ with 
Tajik as native language. 


Table 7.24 Linguistic composition of Tajikistan, 1989 


Language No. of speakers 
Tajik* 3,151,754 
Uzbek 1,178,383 
Russian* 495,180 
Kyrgyz 62,553 
Tatar 55,433 
Ukrainian 20,807 
Turkmen 19,661 
German 19,459 
Kazakh 10,041 
Korean 7,342 
Crimean Tatar 6,794 
Ossetian 5,879 
Bashkir 4,566 
Belarusian 3,595 
Armenian 3,524 
Mordvin 2,675 
Azerbaijani 2,578 
Yiddish 1,804 
Chuvash 1,433 
Pashto 1,115 
other 38,027 
Total 5,092,603 
Note: 


* For Tajik and Russian, the number given includes all native speakers 
in Tajikistan. For other named languages, the number given includes 
all members of the corresponding ethnic group in Tajikistan with the 
given language as native language. 


The more detailed figures for Karakalpakstan, an 
autonomous republic within Uzbekistan, show that 
speakers of Karakalpak constitute just over one-third 
of the population, speakers of Uzbek just under one- 
third, speakers of Kazakh just under one-quarter 
(p. 204). Given that Soviet census statistics do not 
list figures for languages whose speakers are assigned 
to some other ethnic group, Table 7.24 contains 
no information on the Pamir languages and some 
other languages spoken in Tajikistan whose speakers 
are considered to be ethnic Tajiks. For the Pamir 
languages and Yaghnobi, JaNSSSR I (1966) gives 
the following figures: Roshani 8,000-8,500, Bartangi 
3-4,000, Oroshori 1,500-2,000, Shughni 20,000, 
Yazgulami 1,500-2,000, Ishkashmi 500, Wakhi 


6-7,000, Yaghnobi 2,000. For the Indo-Aryan lan- 
guage Parya, Gordon (2005) lists 1,195 speakers in 
Tajikistan. 

The situation regarding official languages is 
somewhat fluid, more especially with respect to the 
status of Russian (which is widely used in all five 
countries) and the officially sanctioned alphabet. My 
best current information is that both Russian and, 
respectively, Kazakh and Kyrgyz are official languages 
in Kazakhstan and Kyrgyzstan, while only Turkmen, 
Uzbek and Tajik are official languages in Turkmeni- 
stan, Uzbekistan and Tajikistan, respectively. Kazakh 
and Kyrgyz are written in the Cyrillic alphabet. Tajik 
may be written using the Cyrillic or Arabic alphabet. 
Turkmen and Uzbek are now supposed to be written 
officially using the Latin alphabet. In Karakalpakstan, 
Karakalpak has some official status as a written lan- 
guage. In addition, Dunganese and Uighur both have 
some currency as written languages in the areas 
where they are spoken. 


7.3 Genetic, typological and 
sociolinguistic information 


7.3.1 Indo-European languages 


All the living branches of the Indo-European family 
are represented within this area, with the exception of 
Celtic. Slavonic, Baltic and Armenian have their prin- 
cipal foci within the area mapped. 


7.3.1.1 Slavonic languages 


All the living Slavonic languages, with the exception 
of Upper and Lower Sorbian (spoken in Germany), 
have their foci within the area mapped. 

East Slavonic includes Russian (in Russia, but 
also widespread in most other former Soviet repub- 
lics), Ukrainian (in Ukraine, with some spill-over 
into adjacent countries and a substantial émigré 
community in Canada) and Belarusian or Belorussian 
(in Belarus, with some spill-over into Poland). 

West Slavonic refers to Polish (in Poland, with 
some spill-over into the Czech Republic and Slovakia 
and a substantial minority in Lithuania and adjacent 
regions, in addition to émigré communities in North 
America and western Europe), as well as Czech (in 
the Czech Republic) and Slovak (in Slovakia, with 
some spill-over into Hungary and Serbia and Mon- 
tenegro). The variety referred to in the census stat- 
istics of some countries as Ruthenian is essentially a 
variety of eastern Slovak with some admixture of 
Ukrainian, spoken by a population group ethnically 
closest to the Ukrainians. Kashubian is sometimes 
considered a distinct language, sometimes a dialect of 
Polish. 

South Slavonic Serbian/Croatian/ 
Bosnian, formerly known as Serbo-Croatian (in 
Croatia, Bosnia-Herzegovina, Serbia and Montene- 
gro, with some spill-over into other parts of the 


contains 


former Yugoslavia), Slovene (in Slovenia, with some 
spill-over into Italy and Austria), Bulgarian (in 
Bulgaria, with some spill-over into adjacent countries 
and Ukraine) and Macedonian (in Macedonia, with 
some spill-over into Bulgaria, where it is considered a 
dialect of Bulgarian, and Greece). 

Typological characteristics of Slavonic languages 
are phonemic palatalization (especially in East Sla- 
vonic and Polish), widespread morphophonemic 
alternations, a conservative morphological system 
(especially in nouns, although nominal declension 
has been lost in Bulgarian and Macedonian), a 
well-defined semantic opposition of perfective and 
imperfective aspect in verbs and ‘free’ (i.e., largely 
pragmatically determined) word order (though 
Subject—Verb—Object is pragmatically most neutral). 


7.3.1.2 Baltic languages 


Both surviving Baltic languages, Lithuanian and 
Latvian, are spoken primarily in the respective Baltic 
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Table 7.25 Ethnic composition of Kazakhstan, Turkmenistan, Uzbekistan and Tajikistan, mid-2005 (estimate, as 


percentage of total population) 


Country 
Ethnic origin Kazakhstan Turkmenistan Uzbekistan Tajikistan 
Kazakh 53 2 3 
Turkmen 77 
Uzbek 3 9 80 25 
Tajik 5 65 
Karakalpak 3 
Uighur 1 
Tatar 2 
Russian 30 7 6 4 
Ukrainian 4 
German 2 
Total population 15,185,844 4,952,081 26,851,195 7,163,506 


states, though Lithuanian in particular has a large 
émigré population, especially in North America. 
Typologically, they are similar to Slavonic languages, 
though with an even more conservative morpho- 
logical system, especially in Lithuanian. 


7.3.1.3 Indo-Iranian 


Of the two main sub-branches of Indo-Iranian, 
namely Indo-Aryan and Iranian, the former is repre- 
sented only marginally in the area mapped, by 
Romany and Parya. Romany, in a number of dialects, 
is the language of the Roma (Gypsies) scattered 
throughout eastern Europe, though in many areas 
Roma have assimilated linguistically to surrounding 
languages; Romany is nowhere an official language, 
though limited use of written versions has developed 
in recent years. Because of the scattered nature 
of Romany speakers it has not been possible to 
plot them on the maps. Parya is spoken between 
Dushanbe (Tajikistan) and the border of Uzbekistan, 
in Central Asia; the speech community is too small to 
plot on the maps. 

By contrast, a substantial proportion of the 
Iranian languages are spoken in the area covered, in 
particular in the Caucasus, Central Asia and Turkey; 
the major foci of Iranian languages are Iran and 
Afghanistan, though substantial Iranian-speaking 
communities are also found in Pakistan and Iraq, to a 
lesser extent in Syria and the Gulf states, with some 
even in China. The sub-divisions of the Iranian 
branch and the individual languages spoken in the 
area are given below. 

North-west Iranian covers Kurdish (predomin- 
antly found in Turkey, Iran and Iraq; small percentage 
in the Caucasus and Turkmenistan) — note that some 
varieties of Kurdish are sufficiently distinctive to be 
considered as separate languages, e.g. Zaza, which was 
listed a separate language in Turkish census statistics 
— and two other languages: Talishi (Azerbaijan and 
Iran); Balochi (predominantly in Pakistan and Iran, 
also in Afghanistan and the Gulf states, with a small 
percentage in Turkmenistan). 

South-west Iranian includes (pre- 
dominantly in Iran and Afghanistan — in the latter 
as regional variant Dari — small percentage in 
Uzbekistan and Turkmenistan), Tajik (Tajikistan, 
parts of Uzbekistan), which is essentially a regional 
variant of Persian, and Tat (the latter has some use as 
a written language in the Caucasus). The term Tat is 
also widely used for a wider range of Iranian language 
communities in Azerbaijan and north-western Iran. 

North-east Iranian comprises Ossetian (northern 
Caucasus) and Yaghnobi (originally spoken in the 
upper valley of the Yagnob River, some 60 miles north 
of Dushanbe, but speakers now mostly moved else- 
where in Tajikistan because of the severe climate in 
their original homeland). 

South-east Iranian consists of the closely knit 
Shughni-Roshani subgroup — Roshani (including 
the Khufi dialect; Tajikistan, fewer in Afghanistan), 
Bartangi (Tajikistan), Oroshori (Tajikistan) and 
Shughni (Tajikistan, perhaps somewhat fewer in 
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Afghanistan) — and three other languages: Yazgulami 
(Tajikistan), Ishkashmi (Tajikistan, more in Afghani- 
stan) and Wakhi (Tajikistan, more in Afghanistan 
with spill-over into Pakistan, China). 

Iranian languages exhibit a fair amount of typo- 
logical variation. Although the usual basic order is 
Subject—Object—Verb, other constructions are not 
head-final, and in Persian and Tajik the noun phrase 
is largely head-initial. Phonological systems are gen- 
erally relatively simple, with greater complexity in the 
Eastern languages. Nominal morphology ranges from 
the complete loss of case in Persian and Tajik to more 
complex fusional (inflectional) systems, as in Kurdish. 
Most languages (though not Persian and Tajik) have 
some morphological ergativity; that is, direct objects 
and intransitive subjects have the same form, with 
a distinct form for transitive subjects, though 
the ongoing loss of ergativity found throughout 
the branch has given rise to some highly idiosyn- 
cratic combinations of morphological ergativity and 
morphological accusativity. 


7.3.1.4 Armenian 


The Armenian branch consists of the single language 
Armenian, though there are two rather different 
standard modern Armenian languages, Eastern 
Armenian and Western Armenian. Western Armenian 
is spoken in Turkey (mainly Istanbul) and by most 
Armenians in the diaspora. Eastern Armenian is 
spoken predominantly in the Caucasus, including 
not only Armenia but also substantial minorities in 
Georgia and Azerbaijan (especially the Karabagh 
region); Armenian communities in or from Iran also 
speak Eastern Armenian. Unlike Classical Armenian, 
Modern Eastern Armenian has a_ predominantly 
agglutinative morphology, especially for nouns (e.g. 
hay-er-i-n ‘of the Armenians’, literally “‘Armenian-pL- 
GEN-DEF ). The word order is basically Subject— 
Object—Verb, and the noun phrase is also basically 
head-final, except that finite relative clauses follow. 
Subordination in Armenian is primarily by means of 
finite clauses. 


7.3.1.5 Albanian 


Another single language comprising a branch of 
Indo-European, Albanian has its main focus in 
Albania, but an important secondary focus in the 
Kosovo region of Serbia and Montenegro and 
adjacent areas, especially Macedonia, with about one- 
third of the total number of Albanian speakers; there 
are also a few Albanian speakers in south-eastern 
Ukraine. 


7.3.1.6 Romance 


The major Romance languages in the area mapped 
are varieties of Rumanian. The most widely spoken 
by far is Daco-Rumanian, usually just called Ruma- 
nian, which is the official language of Romania and, 
under the name Moldovan, of Moldova. Of the other, 
more distinct varieties of Rumanian, Arumanian 


(Macedo-Rumanian) is spoken by compact popula- 
tions in eastern Serbia, while the few speakers of 
Istro-Rumanian live on the Istrian peninsula in 
Croatia. In the last-named region there are also some 
speakers of Italian. Finally, Judezmo (Ladino, Judeo- 
Spanish) is spoken by rapidly diminishing com- 
munities of Jews in the former Yugoslavia, Bulgaria 
and Turkey; there is a traditional written language 
using the Hebrew alphabet. Istro-Rumanian, Italian 
and Judezmo are not plotted on the maps. 


7.3.1.7 Germanic 


The major German-speaking population in the area 
is now in Central Asia and southern Siberia. Before 
the Second World War they lived as a compact group 
on the Volga, opposite Saratov, but during the war 
they were moved, mainly to Soviet Central Asia 
(especially Kazakhstan). German has some use here as 
a written language, in essentially the same form, using 
the Latin alphabet, as in German-speaking countries. 
The other country in the area mapped with the larg- 
est German-speaking population is the Czech Repub- 
lic, though there are also speakers in Poland. The 
other Germanic language in the area mapped is Yid- 
dish (Judeo-German), the language of Ashkenazic 
Jewish communities, which is now spoken, within the 
area mapped, essentially only in the former USSR, 
with a small number in Poland; Yiddish speakers are 
nowhere a majority, and Yiddish is not plotted on 
the maps. Stepped-up emigration has substantially 
reduced the German- and Yiddish-speaking com- 
munities, especially since the break-up of the USSR. 


7.3.1.8 Hellenic (Greek) 


The one language of the Hellenic branch, Greek, has 
its main focus in Greece, but there are small Greek- 
speaking communities in the area mapped, especially 
in Istanbul, in south-eastern Ukraine and near Tbilisi 
in Georgia. 


7.3.2 Altaic languages 


The unity of the Altaic phylum, comprising the 
Turkic, Mongolic and Tungusic families, is widely, 
though not universally, accepted; more controversial 
is the widening of the family to include yet further 
languages, such as Korean, which has its focus in 
Korea but also has some speakers in Central Asia 
and in the Russian Far East. Typologically, the Altaic 
languages are remarkably similar. Most have vowel 
harmony, for instance the alternation of the plural 
suffix -lar/-ler in Turkish, with adam-lar ‘men’, but 
ev-ler ‘houses’. Word order is usually rather strictly 
verb-final, with postpositions and noun-final order 
in the noun phrase too. Widespread use is made of 
non-finite constructions (participles, nominalizations, 
converbs), corresponding in function to English finite 
subordinate clauses. 


7.3.2.1 Turkic languages 


Turkic languages predominate in Turkey, Central Asia 
and parts of the Caucasus (especially Azerbaijan), in 
addition to spilling over into other Balkan countries 
(especially Bulgaria) and having outliers in various 
parts of Russia (e.g. the Volga, southern Siberia, east- 
central Siberia). The precise number of languages to 
be distinguished and their internal classification 
are controversial; the classification adopted here — 
with branches for Southern, Western, Eastern and 
Northern Turkic, and a fifth branch containing just 
one language, Chuvash — makes minimal claims 
about the grouping of low-level units into larger 
units. 

Chuvash is a Turkic language spoken in east- 
central Russia, around the middle Volga. 

Southern Turkic (South-western Turkic, Oghuz) 
comprises Turkish (predominantly in Turkey, also 
Bulgaria, Cyprus, substantial recent émigré popula- 
tion in Germany), Azerbaijani or Azeri (Iran and 
Azerbaijan, with perhaps rather more speakers in Iran 
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than in Azerbaijan), Gagauz (in Moldova, with some 
spill-over into Ukraine; in the Balkans, Gagauz is 
usually included under Turkish), Crimean Tatar 
(Crimea), including the Krymchak dialect; Turkmen 
(Turkmenistan), with perhaps half as many speakers 
again scattered throughout the Middle East). 

Western Turkic (Kipchak) can be divided into 
three sub-groups. The Ponto-Caspian (Kipchak- 
Cuman) subgroup consists of Karaim (spoken near 
Trakai in Lithuania, though being rapidly assimilated 
to Russian; there is a traditional written language 
using Hebrew script, now restricted to liturgical 
purposes), Kumyk (northern Caucasus) and 
Karachay-Balkar (northern Caucasus); the Uralian 
(Kipchak-Bulgar) subgroup comprises Tatar (around 
the middle Volga, but also with considerable scatter- 
ing well into Siberia), some Siberian varieties of 
which are sufficiently distinctive to be considered 
separate languages — for example Baraba Tatar and 
especially Chulym Tatar or Chulym Turkic, the latter 
transitional to Northern Turkic — and Bashkir 
(middle Volga); lastly, the Central Turkic (Kipchak- 
Noghai, Aralo-Caspian) subgroup contains Nogai, 
including varieties separately listed in the 2002 
Russian census as Astrakhan Nogai, Yurt Tatar and 
Alabugat Tatar (Caucasus), Karakalpak (western 
Uzbekistan), Kazakh (Central Asia (Kazakhstan), 
with a smaller number in China) and Kyrgyz 
(Kirghiz) (Central Asia (Kyrgyzstan, some spill-over 
into Tajikistan), with smaller numbers in China, 
Afghanistan). 

Eastern Turkic (Karluk) refers to Uzbek 
(Uzbekistan, parts of Tajikistan; substantial group 
also in Afghanistan, small group in China) and 
Uighur (mainly China; some speakers in Central 
Asia). 

Northern Turkic (Eastern Hunnic) includes Tuvan 
(southern Siberia, some also in Mongolia), Tofalar 
(also known as Tofa, Karagas; southern Siberia), 
Yakut (or Sakha; found in east-central Siberia), 
Khakas (Abakan Tatar, Yenisei Tatar) (southern 
Siberia), Shor (southern Siberia) and Altai (Oirot) 
(southern Siberia). Dolgan is now often considered a 
distinct language, rather than a variety of Yakut. 
Chelkan, Kumandi, Teleut and Tubalar are likewise 
often considered languages distinct from Altai, rather 
than varieties thereof. 


7.3.2.2 Mongolic languages 


The main focus of the Mongolic language family is 
Mongolia and northern China. Two Mongolic lan- 
guages are languages of Russia. Buryat is spoken in 
southern Siberia, with a smaller number of speakers 
in China. Kalmyk, properly a variety of Oirat, is 
spoken to the north of the Caucasus. Other Oirat 
speakers are found in China, where both Buryat and 
Kalmyk are considered dialects of Mongolian. For 
more detail on the Mongolic languages, see Section 
7.4 on the Mongol speakers of Russia, below. 


7.3.2.3 Tungusic (Manchu-Tungusic) 
languages 


The Tungusic languages are spoken in eastern Siberia, 
including Sakhalin island, and north-eastern China, 
with some spill-over into Mongolia. The precise 
number of languages and their subgrouping are con- 
troversial; the following are the distinct languages 
considered to be spoken in Russia. 

Northern (North-western, Siberian, Evenki) Tun- 
gusic comprises Evenki (Tungus) (central Siberia, 
with some spill-over into Mongolia), Even (Lamut) 
(eastern Siberia) and Negidal (eastern Siberia). 

Southern (Amur) Tungusic consists of two sub- 
groups: the Nanai subgroup, which means Nanay 
(Gold) (eastern Siberia; some use as a written lan- 
guage, using the Cyrillic alphabet), Ulcha (Olcha) 
(near mouth of Amur River in eastern Siberia) and 
Orok (Ulta) (Sakhalin island); and the Udihe 
subgroup, which is Udihe (Udehe) (south-eastern 
Siberia) and Oroch (south-eastern Siberia). 


7.3.3 Uralic languages 


Most Uralic languages are spoken exclusively or 
predominantly in the area mapped, though the two 
languages with the largest number of speakers, 
Hungarian and Finnish, are spoken primarily outside 
the area mapped. Typologically, the more eastern 
Uralic languages are very similar to Altaic languages, 
while the more western languages make more use 
of Subject-Verb—Object word order and of finite 
subordinate clauses. Vowel harmony, however, is 
scattered throughout the family. 

The family tree of the Uralic languages reflects a 
successive splitting off of the various branches, first 
Samoyedic, then Ugric (though the unity of Ugric, 
especially the relation between Hungarian and Ob- 
Ugric, is the subject of debate), then Permic, then 
Volgaic (though the unity of Volgaic is unclear, 
with Mordvin perhaps closer to Balto-Finnic than to 
Cheremis), then Saami (Lappish), leaving finally 
Balto-Finnic. The composition of the individual 
branches is given below. 


7.3.3.1 Samoyedic 


The Northern Samoyedic subgroup consists of 
Nenets (Yurak Samoyed) (in northern Russia; some 
use as a written language, using the Cyrillic alphabet), 
Enets (Yenisey Samoyed) (northern Siberia) and 
Nganasan (Tavgi Samoyed) (northern Siberia); the 
Southern Samoyedic subgroup has just one language, 
Selkup (Ostyak Samoyed) (western Siberia). 


7.3.3.2 Ugric 


Hungarian is the main language here (predominantly 
in Hungary, but sizeable minorities in Rumania, 
Slovakia, Serbia, some speakers in western Ukraine); 
the Ob-Ugric subgroup contains Khanty (Ostyak) 
(western Siberia) and Mansi (Vogul) (western 
Siberia). 


7.3.3.3 Permic 


There are three languages in this branch: Komi 
(Zyryan) (north-eastern European Russia), Komi- 
Permyak (north-eastern European Russia) and Udmurt 
(Votyak) (north-eastern European Russia). 


7.3.3.4 Volgaic 


This includes Mordvin (Volga; two distinct lan- 
guages, Erzya and Moksha, are recognized) and Mari 
(Cheremis) (Volga). 


7.3.3.5 Saami 


Also called Lappish, this is properly a number of dis- 
tinct, mutually unintelligible languages, spoken in 
northern Scandinavia and the Kola peninsula of 
Russia; Russian Saami speakers speak Eastern Saami, 
most of them the Kola dialect. 


7.3.3.6 Balto-Finnic 


This branch consists of Finnish (Suomi), found 
predominantly in Finland, with a small number of 
speakers in Russia and Estonia (Karelian and Ingrian 
are linguistically close to Finnish but are usually con- 
sidered distinct languages), Karelian, spoken in 
the far north-west of European Russia (the varieties 
Olonets — in Russian: livvikovskij — and Liidian are 
sometimes considered separate languages), Ingrian 
(Baltic coast to the west of St Petersburg), Veps 
(north-west European Russia, in areas to the east of 
Lake Ladoga and south of Lake Onega), Votic 
(between St Petersburg and the Estonian border) and 
Estonian (predominantly in Estonia, with small 
émigré communities especially in Sweden and North 
America). South Estonian, spoken in south-eastern 
Estonia, is perhaps sufficiently distinctive to be con- 
sidered a separate language. Livonian, formerly 


spoken in the Courland peninsula in Latvia, recently 
became extinct. 


7.3.4 Caucasian languages 


The languages of the Caucasus that do not form part 
of any of the major families represented elsewhere 
(e.g. Indo-European, Turkic) fall into three groups: 
Kartvelian (South Caucasian), West Caucasian and 
East Caucasian. Whether West and East Caucasian 
together form a North Caucasian phylum is contro- 
versial; Kartvelian is no longer widely thought to 
be related to the northern languages. Virtually all 
Caucasian languages are spoken exclusively or pre- 
dominantly in southernmost Russia, Georgia and 
Azerbaijan; the only exceptions are Ubykh, whose 
speakers emigrated to Turkey and are now completely 
assimilated linguistically by Turkish, and Laz, nearly 
all of whose speakers are to be found in Turkey; emi- 
gration has taken some speakers of other languages, 
especially Circassian, to Turkey and the Middle East. 
The composition of the three groups is detailed 
below. 

Typologically, the Caucasian languages are far 
from uniform, and the following list of features 
widespread in the area should not be taken as 
characteristic of each individual language or group. 
Phonological inventories tend to be high, and one of 
the West Caucasian languages, Ubykh, ranks close to 
the world record — though conversely West Caucasian 
languages approach world records for the smallness 
of their vowel inventories; the Kartvelian languages, 
by contrast, have smaller consonant inventories, but 
tolerate extensive consonant clustering. Nominal case 
systems usually include an ergative case, ergative 
morphology being widespread among Caucasian lan- 
guages. Noun class systems are more or less restricted 
to East Caucasian, but are widespread there. East 
Caucasian languages tend to have rich case inven- 
tories and restricted indexing of noun phrases in the 
verb, while West Caucasian represents the opposite 
type with few case distinctions and rich possibilities 
for indexing noun phrases in the verb by means 
of pronominal affixes, as in Abkhaz s-b-oyt' ‘you 
(FEMALE) see me’, where s- encodes the first person 
singular direct object, b- the second person singular 
feminine transitive subject. Widespread use is made 
of non-finite verb forms, except in Kartvelian. Word 
order is usually basically verb-final, with postposi- 
tions and with genitives preceding their head noun; 
adjectives usually precede their head noun, except in 
West Caucasian, where they usually follow. 


7.3.4.1 Kartvelian (South Caucasian) 


Four languages are usually recognized, though Min- 
grelian and Laz are sometimes treated as dialects of a 
single language, Zan. Speakers of Svan and Min- 
grelian consider themselves ethnic Georgians, and 
use Georgian as their written language. Georgian 
is spoken predominantly in Georgia, with smaller 
communities in Turkey and Iran. Svan and Min- 
grelian (Megrelian) are spoken in north-western 
Georgia. Laz (Chan) is spoken primarily in northern 
Turkey, with some spill-over into Georgia. 


7.3.4.2 West Caucasian (North-west 
Caucasian, Abkhaz-Adyghe) 


Abkhaz and Abaza form one subgroup; indeed, on 
structural grounds, Abaza could be regarded as a dia- 
lect of Abkhaz. In Russia, Adyghe (West Circassian) 
and Kabardian (East Circassian) are considered dis- 
tinct languages, while in Turkey they are grouped 
together as a single Circassian language. Circassian 
also has some use as a written language outside 
Russia, usually using the Latin alphabet. Abkhaz is 
spoken in Abkhazia; Abaza is spoken in the northern 
Caucasus; Adyghe (West Circassian) and Kabardian 
(Kabardo-Cherkessian, East Circassian) are spoken in 
the northern Caucasus, and also in Turkey; Ubykh, 
recently extinct, was spoken in Turkey, in the village 
of Hac1 Osman Kéyii, near the Sea of Marmara. 
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7.3.4.3 East Caucasian (North-East Caucasian, 
Nakh-Daghestanian) 


The subgrouping presented here is provisional; in 
particular, the inclusion of Tsezic with Avar-Andic 
in a single group has been questioned, as has the 
unity of Lak-Dargi and the inclusion of Khinalugh in 
Lezgic. The non-Nakh languages are usually grouped 
together as Daghestanian. 

In the Nakh subgroups, Chechen and Ingush are 
both written languages of the northern Caucasus, 
using the Cyrillic alphabet, and are co-official lan- 
guages of Chechnya and Ingushetia respectively. Bats 
(Tsova Tush) is spoken in northern Georgia. 

Avar-Andic-Tsezic consists of Avar (north- 
western Daghestan; also used as a lingua franca by 
some other smaller Daghestanian speech com- 
munities), the Andic languages (to the west of Avar) — 
namely Andi, Botlikh, Godoberi, Karata, Akhvakh, 
Bagval, Tindi and Chamalal, of which Tindi and 
Bagval are very close, likewise Botlikh and Godoberi — 
and the Tsezic (Didoic) languages (to the south of the 
Andic languages): Tsez (Dido), Khwarshi, Hinugq, 
Bezhta (Kapuchi), Hunzib. 

Lak-Dargi (central Daghestan) includes just 
Lak and Dargi (Dargwa), the latter including some 
varieties sufficiently divergent to be considered dis- 
tinct languages. 

Finally, the Lezgic languages (south-eastern Dagh- 
estan, spilling over into northern Azerbaijan) are Archi, 
Tabasaran, Agul, Rutul, Tsakhur, Budukh, Khinalug, 
Udi (northern Azerbaijan, eastern Georgia), Lezgian 
and Kryz (Qriz, Dzhek). 


7.3.5 Paleo-Siberian and other languages 


This section deals with languages spoken in the area 
mapped, but not belonging to any of the families dis- 
cussed above. The term Paleo-Siberian (Paleo-Asiatic) 
refers to languages spoken in Siberia and not belong- 
ing to any of the other language families or phyla 
(Indo-European, Altaic, Uralic); no claim of genetic 
unity is implied. Recognized language families within 
Paleo-Siberian are: Chukotko-Kamchatkan, Ket (now 
an isolate, the sole survivor of the Yeniseian family, 
following the recent extinction of Yugh), Nivkh 
(isolate) and Yukaghir (isolate); further sometimes 
included within Paleo-Siberian is Eskimo-Aleut. 
Other language families encroaching on the area 
mapped are Semitic (a branch of Afro-Asiatic) and 
Sino-Tibetan. 


7.3.5.1 Chukotko-Kamchatkan 


Five distinct languages are usually recognized. Kam- 
chadal is the most divergent member of the family; 
earlier, Kamchadal subsumed at least three different 
languages or highly divergent dialects, of which 
only Western Kamchadal survives. Characteristic 
typological features include a dominant-recessive 
vowel harmony system, ergative morphology and 
widespread use of incorporation, though in 
Kamchadal these features are apparent as vestiges 
at best. Both vowel harmony and incorporation 
are illustrated by alternative Chukchi renderings 
of ‘they set the net’: without incorporation of ‘net’ 
as kupren nantovatg-an, with incorporation as 
koprantovatg-at. 

The Chukotko-Kamchatkan languages are all 
spoken in north-eastern Siberia: © Chukchi 
(Chukchee), Koryak, Alutor, Kerek and Kamchadal 
(Itelmen). 


7.3.5.2 Ket 


A member of the Yeniseian family, Ket is now a lan- 
guage isolate, following the extinction of the other 
languages of the family. Ket is spoken on the middle 
Yenisey river, in central Siberia. It is typologically 
quite divergent from other neighbouring languages, 
having phonemic tone, a gender system and a com- 
plex verbal morphology. 


7.3.5.3 Nivkh (Gilyak) 


Nivkh is a language isolate spoken at the mouth of 
the Amur River and on Sakhalin island, in eastern 
Siberia. The Amur and Sakhalin varieties are quite 
divergent from one another. Nivkh is characterized 
phonologically by a rich system of morphophonemic 
alternations of initial consonants. Morphologically 
and syntactically, it is similar to Altaic languages. 


7.3.5.4 Yukaghir 


Yukaghir, spoken in north-eastern Siberia, is a lan- 
guage isolate, though the possibility of a genetic link 
to the Uralic family has gained considerable support. 
The two main varieties of the language, Tundra 
Yukaghir and Kolyma Yukaghir, differ enough to be 
perhaps considered distinct languages. The grammat- 
ical typology of Yukaghir is quite similar to the east- 
ern Uralic languages (or the Altaic languages). One 
interesting feature is the marking of focus in the 
nominal and verbal morphology, e.g. met-ek uu-l 
‘I went’, but met mer-uu-jen ‘I went’. 


7.3.5.5 Eskimo-Aleut 


The Eskimo-Aleut family has two branches, Eskimo 
and Aleut. Eskimo is properly a number of languages 
rather than dialects, with a major division between 
Yupik and Inuit (Inupiaq). It is spoken predomin- 
antly in Alaska, northern Canada and Greenland, but 
also on the eastern tip of Russia. Eskimo speakers in 
Russia are all speakers of Siberian Yupik — also spoken 
on St. Lawrence Island, Alaska — following the death 
of the last speaker of the highly divergent Sireniki 
Eskimo in 1997. Most speakers of Aleut live in the 
Aleutian islands (Alaska), though there are some 
speakers on the Commander Islands off the coast 
of north-eastern Siberia, including speakers of an 
interesting Aleut—-Russian mixed language on Mednyj 
(Copper) Island. 


7.3.5.6 Semitic 


Two languages from the Semitic branch of Afro- 
Asiatic encroach on the area mapped. Arabic is 
spoken predominantly in the Middle East and North 
Africa, though also in parts of Turkey, and there 
are some speakers remaining in Tajikistan and 
Uzbekistan. Aramaic (Assyrian, Aisor) is spoken 
mainly in northern Iraq and adjacent parts of Iran 
and Syria, but there are some speakers in the South 
Caucasus. 


7.3.5.7 Sino-Tibetan 


Dunganese could structurally be considered a dialect 
of Mandarin Chinese, but it is recognized as a distinct 
language in Kyrgyzstan and Kazakhstan, in part 
reflecting the ethnic distinctiveness of the Dunganese 
(Hui), most of whom live in China. 
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7.4 The Mongol speakers 

of Russia 

Alan Sanders 
The Buryat (buryaad) were conquered by Jochi, son 
of Genghis Khan, in 1207 and colonized from the 


seventeenth century onwards by Russians. They live 
mostly in the Buryat Republic and the autonomous 


districts centered on Aginskoye (Chita region) and 
Ust-Ordynsky (Irkutsk region), which in early Soviet 
times were both part of the Buryat-Mongol Autono- 
mous Republic. Generally there is a division between 
western and eastern dialects and cultures, the Buryat 
from west of Lake Baikal having been converted to 
Russian Orthodoxy, while those from east of the lake 
were converted from shamanism to Lamaism, while 
using the old Mongol script as the principal literary 
medium. The main ethnic groups are the Bulagat, 
Ekhirit and Khori Buryat. The Bulagat (Bulgad) 
settled on the Lena River and its tributaries (north- 
west of Lake Baikal); their name may mean ‘sable 
hunters’ or be a variant of Buryat. The Ekhirit 
(Ehrid) settled on the Barguzin River and tributaries 
(north-east of Lake Baikal); their name may mean 
‘squirrel hunters’. They converted to Orthodoxy 
officially, as they had been converted earlier to 
Lamaism, but remained shamanists. The Alar Buryats 
from western areas of the republic are semi-sedentary 
and live in hexagonal wooden gers (houses shaped 
like the traditional round felt tents). The Khori 
Buryat (Khori Tiimed) from the Uda valley south- 
east of Lake Baikal, nomadized into Mongolia in the 
mid-seventeenth century, moved to the Nerchinsk 
area east of Chita on the Shilka River some 25 years 
later, and finally settled in the Aginskie steppes to the 
south in the eighteenth century. 

Buryat has numerous dialects (named after the 
area where the speakers live), and their language has 
preserved some ancient features (e.g. personal end- 
ings to verb forms), and Middle Mongol words 
(twelfth- to sixteenth-century) found in the Secret 
History of the Mongols. The Secret History is the oldest 
extant Mongol account of the life and times of 
Genghis Khan. Part legend and part family chronicle, 
it was recorded some time after his death in 1227. The 
Tsongool dialect of Buryat, somewhere between 
Buryat and Khalkha, is spoken in the Noikhan area, at 
the confluence of the Selenga and Khilok rivers. The 
Aga Buryat dialect is spoken in Aginskoe (autono- 
mous district), famous for its Aga lamasery (datsan), 
a former part of the Buryat Republic which was 
absorbed into Chita region. Some Buryat are Buryat- 
ized Tuva or Tungus. 

There were unsuccessful attempts to introduce 
the Cyrillic script amongst the Buryats in Russia in the 
nineteenth century, but they continued to write in 
Buryatized classical (literary) Mongol until the intro- 
duction of Jamtsarano’s Latin script (with one extra 
letter) in 1929-31. This was displaced by a Cyrillic 
alphabet, with three additional letters, in 1938. 

The Buryat were shamanist rather than Buddhist, 
but were brought by Russian influence to Orthodox 
Christianity, especially those dwelling west of Lake 
Baikal. Officially promoted as the Buryat national 
epic since the 1940s, the Geser is actually of Tibetan 
origin. After the 1917 revolution the western Buryat 
tribes became part of Soviet Russia as the Buryat- 
Mongol republic, while the eastern Buryat came 
under the short-lived Far Eastern Republic as the 
Mongol-Buryat republic. Reunited in the USSR, the 
Buryat territory became the Buryat-Mongol 
Autonomous Soviet Socialist Republic (ASSR), but 
the word ‘Mongol’ was later removed amidst Soviet 
fears of ‘Pan-Mongolism’. 

Some Jungarian or Oirad Mongols settled in 
Buryat territory in the fifteenth century. More settled 
in the area of Nizhneudinsk (western Irkutsk region) 
and the former Zungarbukhotsky district in the 
seventeenth and eighteenth centuries, after the 
destruction of the Jungarian state. 

The Kalmuck or Kalmyk (khalimag) are Jungar- 
ian Oirad Mongols whose ancestors were the 
Kkhoshuud and Torguud, Oold and Dorvéd, some 
300,000 of whom settled on the Volga in Russia in 
1616 to escape the Qing (Manchu). It is suggested 
that the name khalimag means ‘those who stayed 
behind’ (as the ice broke up on the Volga), when 
125,000 of them returned to Jungaria in 1771. Those 
who remained in Russia now live mostly in the 
Kalmyk Republic (Khal'mg Tangch), the former 
Kalmyk ASSR, in the Caspian steppes. After Stalin’s 
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abolition of the Kalmyk ASSR in December 1943 for 
alleged wartime collaboration with the Germans, they 
were settled in other areas of the Volga valley (Astra- 
khan, Rostov, Volgograd and Stavropol) and parts of 
Siberia (Orenburg) and Central Asia. The Kalmyk 
republic was restored in 1957, but as a result of the 
scattering of the population only about 10 per cent 
still spoke their mother tongue. Since the collapse of 
the USSR, the Kalmyk have been encouraged to 
restore their culture and language, which contains 
many Russian loans. The Kalmyk national epic is 
the Jangar, which appeared in the eighteenth century 
and contains Buddhist elements, though the Kalmyk 
were originally shamanists. The Kalmyks used the 
modified classical Mongol script (tod bichig) until the 
introduction of a modified Cyrillic script, followed by 
Romanization (1931-8), and then a new Cyrillic script. 

The Oirad dialect is spoken by ‘Sart Kalmuck’, 
descendants of Odld and Torguud Mongols who in 
1880 emigrated from Jungaria to Chelpek and Bérii 
Bashi, east of Lake Issyk K6l, in the modern 
Kyrgyzstan Republic. 

The Khalkh (Halh) Mongols living in Russia do 
not have an administrative territorial unit of their 
own, having mostly gone to the USSR (1924-90) to 
study or work there, and stayed on. 

The Oirot (Oirad) or Altai-Kizhi of the southern 
Altay Republic are partly Mongolized Turks. Their 
language is Turkic, but some of the vocabulary of 
livestock herding, for example, like the lifestyle of the 
herdsmen, is basically Mongol, under the influence of 
the successive rule of the Mongol Empire and the 
Jungarian khanate. The classical Mongol script was 
used until 1929, when they adopted a Latin script 
with six extra letters; they were then obliged in 1937 
to adopt a Cyrillic script (two extra letters). Their 
territory was called the Autonomous Oirat region 
from 1922, and renamed the Gorny (mountainous) 


Altai in 1948, the Russian name of the people, altaitsy, 
being derived from this. 

The Tuva (Urianghai, Soyot) of the Republic of 
Tuva (Tyva, former Tuvan ASSR) are also partly 
Mongolized Turks. Like the Altay, their traditional 
lifestyle and culture are similar to that of the Khalkh 
Mongols. Their language is Turkic, but until the 
twentieth century it had no written form, and literate 
Tuva read and wrote the classical Mongol (Uighur) 
script or its “clear script” (Oirad) variant. The 
administrative divisions of Tuva are still called kojuun 
(banners). The central dialect is the basis of the 
national spoken and literary language. The north- 
eastern dialect (Todzhi), with strong development of 
vocabulary reflecting a hunting way of life, displays 
nasal coloration. The south-eastern dialect shows 
considerable Mongol influence, while the western dia- 
lect shows interaction with Altaic dialects like 
Telengit. 

After experimenting with a Latin script from 
1930, the Tuva were given a Cyrillic script in 1940. 
Like Mongolia, the Urianghai territory was part of 
the Qing Empire, whose governor was resident in 
Uliastai, but Russian and Soviet expansion led to the 
creation of a quasi-independent Tannu Tuva People’s 
Republic in 1924. This was annexed to the Soviet 
Union in 1944. 
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8.1 Introduction 


The great majority of the languages dealt with in 
this section belong to the Indo-European (or 
Indo-Hittite) family. Finnish, Karelian, the Saami 
languages and Hungarian, however, are members of 
the Uralic(-Yukaghir) family; Turkish, Gagauz and 
Tatar are Turkic languages attributed by some to the 
Altaic family; Maltese, Arabic and Berber are of the 
Afro-Asiatic family; and Basque, which has no con- 
vincingly established relationship with any other 
language, thereby counts as a language isolate, ie. a 
one-member family. The situation is further com- 
plicated by the relatively recent immigration of 
speakers of languages from other families such as 
Sino-Tibetan (Chinese), Eskimo-Aleut (Greenlandic) 
and Dravidian (Tamil). 

The Indo-European family has, in the view of the 
majority of scholars (cf. Ruhlen 1987: 56), ten 
branches, two of which — Anatolian and Tocharian — 
are extinct. Of the remaining eight, six are repre- 
sented in the area mapped: 


Hellenic 
Italic 

Celtic 
Balto-Slavic 
Germanic 


Albanian 


NU PWN 


and the two other branches — Indo-Iranian and 
Armenian — are represented by (immigrant and 
nomadic) languages not localized on the maps, but 
mentioned in the accompanying texts: Indo-Iranian, 
comprising Indic (Romani, in various forms; Hindi, 
Urdu, Gujarati, Bengali, Sylheti, Panjabi, Parsi and 
Kashmiri) and Iranian (Kurmanji, Kirmanjki and 
Farsi); and Armenian (Armenian). 

Within each branch, the Indo-European lan- 
guages mapped may be related as follows: 


1 Hellenic: 
Greek 
Tsakonian 
2 Italic 
Romance 
Balkan Romance 
Rumanian 
Megleno-Rumanian 
Macedo-Rumanian 
Istrorumanian 
Italo-Romance 
Italian 
Sardinian 
Rhaeto-Romance 
Romansch 
Friulian 
Ladin 
Gallo-Romance 
French 
Occitan 
Ch. Is. French 
Ibero-Romance 
Catalan 
Spanish 
Galician 
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Portuguese 
Kabuverdianu (Crioulo) 
3 Celtic 
Brythonic 
Welsh 
Breton 
Goidelic 
Irish 
Scottish Gaelic 
4 Balto-Slavonic 
Slavonic 
East 
Ukrainian 
Russian 
West 
Sorbian 
Polish 
Slovak 
South 
Bulgarian 
Macedonian 
Serbian 
Croatian 
Slovene 
5 Germanic 
North 
West 
Icelandic 
Faroese 
Nynorsk 
East 
Bokmal 
Danish 
Swedish 
West 
English 
Frisian 
West Frisian 
East Frisian 
North Frisian 
Netherlandic/German 
Dutch 
German 
Letzeburgesch 
6 Albanian 
Albanian 
Tosk 
Gheg 


The Uralic-Yukaghir languages mapped all belong to 
the Finno-Ugric branch of the Uralic group, and are 
related as follows: 
Finno-Ugric 
Ugric 
Hungarian 
Finnic 
North Finnic 
Lappic 
Saami 
Baltic Finnic 
Finnish 
Karelian 
Liwi 
Inevitably, the maps that follow present a 
somewhat oversimplified picture of the current dis- 
tribution of languages in the area covered. Firstly, the 


lines representing language boundaries often indicate 
political rather than purely linguistic divisions: thus, 
at ground level, there is a gradual continuum and not 
a sharp divide between dialects/regional languages on 
either side of, say, the Dutch-German or the Spanish- 
French border, though it should be said that the 
inhabitants generally adhere to the nationally recog- 
nized written languages prevailing on their side of the 
border and are increasingly orienting themselves in 
oral communication to the national norms. 

Secondly, it has not been possible to indicate on the 
maps the extent to which the major agglomerations of 
Europe have become multilingual as a result of recent 
population shifts within the continent and of immigra- 
tion from other continents; some statistics are given, 
however, in the texts accompanying relevant maps. 

Thirdly, the maps give little impression of the 
degree to which Europeans are becoming steadily 
more competent in languages other than their native 
tongue as a result of education, international com- 
merce and communications, increased professional 
and recreational travel, and the effects of the mass 
media. Currently, English is clearly the most pres- 
tigious “second language’ in this respect, with French 
still playing an important role in European Union 
administration — and German, as the language 
spoken natively by more Western Europeans than any 
other, now also claiming greater recognition. 

Finally, the maps do no more than hint at the 
extent to which speakers of autochthonous minority 
languages (there are few states in Western Europe that 
are without linguistic minorities) are currently seeking 
to (re-)establish their identity, demanding substantial 
recognition in government, education and the media. 
There can be no doubt, however, that the survival of 
many of these languages through the twenty-first cen- 
tury is threatened by a deadly combination of, on the 
one hand, official apathy (in some cases, even wilful 
neglect) and, on the other, lack of interest among the 
population in preserving the endangered language, 
which they may see as more of a liability than an 
advantage. 

Major sources of information in compiling the 
maps have been Dauzat (1953), Straka (1979), Ruhlen 
(1987) and Gordon (2005); much of the statistical 
information is drawn from the web-based version of 
Gordon (2005). 


8.2 Albania, Kosovo/Kosova, 
Greece and Cyprus 


Albanian and Greek are alike in that each is the sole 
surviving representative of a branch of Indo- 
European. Furthermore, both languages are generally 
recognized to be members of the Balkan ‘linguistic 
area’: under mutual influence, Romanian, Bulgarian, 
Macedonian, Greek and (especially Tosk) Albanian 
have developed a number of shared structural features. 


8.2.1 Albania 


Albanian (known to its speakers as shqip) has two 
major dialects, Gheg and ‘Tosk, spoken within 
Albania to the north and south respectively of the 


259 


Western Europe 


Shkumbin River; Gheg is also used in Kosovo/Kosova 
and in parts of the Former Yugoslav Republic of 
Macedonia, while it is Tosk that is used by the com- 
munities of Albanians in Greece, mainland Italy and 
Sicily. Tosk is the basis for the contemporary official 
language, which since 1909 has been written in a 
Roman alphabet. The development of Albanian has 
been heavily influenced by Latin, Modern Greek, Ser- 
bian, Turkish and Italian, to such an extent that its 
direct historical derivation from Indo-European has 
been somewhat obscured. There is a substantial 
Greek-speaking minority, and in southern Albania 
there is a group of some 400,000 ‘Armini’, of whom 
about 12 per cent speak Macedo-Rumanian or Aru- 
manian, a language which has been distinct from 
other Rumanian for 1,000 to 1,500 years. 


Table 8.1 Languages spoken in Albania 


Language Numbers of speakers (date of source) 
Tosk Albanian 3,119,000 (1998) 
Gheg Albanian 300,000 (1980) 
Greek 60,000 (1989) 
Vlach Romani 60,000 (1991) 
Macedo-Rumanian 50,000 (1995) 
Macedonian 15,000 (2001) 


8.2.2 Kosovo/Kosova 


Kosovo/Kosova (the Serbian and Albanian names 
respectively), an autonomous province of Serbia, is 
currently being administered by the United Nations. 
Until the conflict of 1999, Kosovo/Kosova had a 
90 per cent Albanian population (1.8m), who spoke 
Gheg Albanian. A large number of these were forced 
to flee to neighbouring countries such as Albania, 
Macedonia, Montenegro and Bosnia-Herzegovina. 
Current numbers of Albanian speakers are uncertain. 


8.2.3 Greece 


Greek is spoken by 98.5 per cent of the population, 
both on the mainland and throughout the Greek 
islands. It is characterized by a wide variety of dia- 
lects, all of which (except Tsakonian, used by some 
300 shepherds in the eastern Peloponnese) are ultim- 
ately derived from the koine (the spoken Greek of 
approx. 350 Bc to ap 400). Alongside these dialects, a 
written and spoken form of Greek, Demotic (dhimo- 
tiki), which resembles the dialects of the most popu- 
lous areas of mainland Greece, has established itself as 
the normal form of communication in Greece. It 
is Demotic that is ‘used throughout the education 
system, in government, journalism and broadcasting’, 
but katharévousa, an archaising form of the language 
that retains several properties of Ancient Greek, 
persists ‘in the legal profession, the army, and the 
Orthodox Church’ (Price 1998: 203; see also Trudgill 
2000: 246-50). Pontic, a form of Greek originally 
spoken on the Black Sea but now unintelligible to 
Greek speakers, has established itself in Greece with 
some 200,000 speakers in and around Salonica. All 
forms of Greek are written in the Greek alphabet. 

Greece houses speakers of Albanian, Bulgarian, 
Macedonian (known as Slavic in Greece), Macedo- 
Rumanian (or Arumanian), Megleno-Rumanian and 
Turkish, as well as Romani, Ladino (Judeo-Spanish) 
and Armenian (see Trudgill 2000: 250-60). 


Table 8.2 Languages spoken in Greece 


Language Numbers of speakers (date of source) 
Greek 
Macedo-Rumanian 
Macedonian/Slavic 
Arvanatika Albanian 


9,859,850 (1986 
200,000 (1995 
180,180 (1986 
150,000 (2000 


Turkish 128,380 (1976 
Balkan Romani 40,000 (1996 
Bulgarian 30,000 (1986 
Tosk Albanian 10,000 (2002 
Megleno-Rumanian 3,000 (2002 
Vlach Romani 1,000 


8.2.4 Cyprus 


The divided island of Cyprus has a majority of 
Greek speakers, largely in the south, and a substantial 
minority of Turkish speakers, largely in the north. 
There are also small and rapidly diminishing 
numbers of Arabic (in the village of Kormakiti) and 
Armenian speakers. Cypriot Greek has retained 
certain archaic features. 


Table 8.3 Languages spoken in Cyprus 


Language Numbers of speakers (date of source) 
Greek 578,000 (1995) 
Turkish 177,000 (1995) 
Cypriot Arabic 1,300 

Armenian 2,740 (1987) 


8.3 Romania 


The official language of Romania is Rumanian 
(18,741,356 speakers in 2002). (Note: The spelling 
‘Rumanian’ has been adopted here for the name of the 
language to reflect the phonological development of 
unstressed Latin /o/ to Rumanian /u/, whereas the 
country will be indicated as Romania in keeping with 
its adoption of Romdnia as its official name.) 
Since the extinction of Dalmatian, spoken near 
Dubrovnik in Croatia, in 1898 (though Istriot, still 
spoken by no more than 1,000 people in the villages 
of Rovinj and Vodnjan on the Istrian peninsula in 
Croatia, may be related to Dalmatian), the various 
forms of Rumanian are the sole surviving member of 
the group of Romance languages known as Balkan 
Romance. The major language of Romania and of the 
adjacent Moldova is Daco-Rumanian, in contra- 
distinction to the Macedo-Rumanian (or Arumanian) 
spoken in Albania, Bosnia-Herzegovina, Bulgaria, 
Greece, the Former Yugoslav Republic of Macedonia 
and Serbia, the Megleno-Rumanian spoken north of 
Salonica (Greece) and the Istrorumanian of the Istrian 
peninsula (the latter not to be confused with Istriot). 

Alongside the speakers of Romanian and of 
Macedo-Rumanian (especially in the south-east 
province of Dobrudja), Romania is inhabited by 
speakers of various other languages, as shown in 
Table 8.4 (figures from the 2002 census), as well as 
small numbers of speakers of Bulgarian (6,747), 
Greek (4,146), Czech (3,339), Polish (2,755) and Yid- 
dish (1,100). The speakers of Hungarian are con- 
centrated in the Transcarpathian provinces, part of 
Hungary in the nineteenth century but incorporated 
by the Treaty of Trianon into Romania after the First 
World War (and again after the Second World War). 
There have been German speakers in Transylvania, 
and further south, since the Middle Ages; this 
German enclave was one of the largest of its kind in 
Eastern Europe, but in recent years has declined rap- 
idly through emigration to Germany. 


Table 8.4 Minority languages spoken in Romania 


Language Numbers of speakers 
Hungarian 1,447,544 
Vlach Romani 241,617 
Ukrainian 57,762 
German 45,129 
Russian 29,890 
Turkish 28,714 
Serbian/Croatian 26,732 
Crimean Tatar 21,482 
Slovak 16,108 
Note: 


All figures from 2002 census 


The major dialect division within Romanian is 
between Moldavan (in the north and in Moldova) 
and Wallachian (in the south), the latter being the 
basis of the standard language. Romanian manifests 
many typical characteristics of Romance languages; 
at the same time, its structure shows a number of 


areal features, that is, structural characteristics shared 
by various geographically adjacent but genetically 
distant languages. Unsurprisingly, the vocabulary of 
Rumanian has undergone considerable influence 
from the languages of the adjacent peoples. Although 
the basic lexical stock remains Latin in origin, 
Rumanian has imported many words from Slavonic, 
but also from Turkish and Albanian; in the last 200 
years, there has been conscious borrowing from 
French with a view to ‘reromanizing’ the vocabulary, 
and very recently English has become a major source 
of vocabulary extension, many under the watchful eye 
of the Romanian Academy. 


8.4 Hungary and 
Hungarian-speaking areas 


Hungarian, called by its speakers Magyar, is the 
tongue of the great majority (98 per cent) of the 
inhabitants of Hungary. Its dominance may explain 
its lack of official status in the Hungarian constitu- 
tion. The distribution of native speakers is shown in 
Table 8.5. Hungarian is also spoken by a sizeable 
minority in Romania (1,447,544 in 2002), and in 
adjacent parts of Slovakia (597,400 in 1993, or 11.2 
per cent of the population), Serbia (in the Autono- 
mous Region of Vojvodina: 450,500 in 1986), Austria 
(22,000 in 1995) and Slovenia (9,240 in 1991), as well 
as in Ukraine (176,000 in 2001), and in Israel (70,000 
in 1998); about half a million emigrants to the USA 
(450,000), Canada (87,000) and Australia (5,750) still 
command Hungarian. In the past, the Hungarian 
kingdom was considerably more extended than 
present-day Hungary: hence the relatively large num- 
ber of Hungarian speakers in surrounding countries. 


Table 8.5 Languages spoken in Hungary 


Language Numbers of speakers (date of source) 
Hungarian 10,298,820 (1995) 
German 250,000 (1988) 
Romani 170,000 (1995) 
Rumanian 100,000 (1995) 
Croatian and Serbian 32,130 (1988) 
Slovak 11,562 (2000) 
Slovenian 4,984 (2000) 


Most German speakers dwell in the north-west 
or in the southern counties of Baranya and Bacs- 
Kiskun. The majority of Romani (of whom there are 
some 600,000 in Hungary, making them the country’s 
largest minority) have adopted Hungarian as their 
native language. They are located across the entire 
country. The Rumanian speakers are concentrated in 
the south-east, and speakers of Croatian and Serbian 
in areas next to the Croatian and Serbian borders. 
Most of the minorities are bilingual in their native 
language and Hungarian; at the same time, Hungary 
has a long tradition of promoting and protecting the 
linguistic rights of its minorities. 

There are eight recognized dialects of Hungarian: 
Western, Trans-Danubian, Paléc, Southern; North- 
Eastern and Tisza (also spoken in bordering parts of 
Romania); and Transylvanian and Székely, spoken 
exclusively in Romania. Yet there is relatively little 
dialectal differentiation within the Hungarian- 
speaking community, a fact partially attributable to 
the influence of the Hungarian Academy, set up in 
1830. Furthermore, literacy in Hungary is high 
(currently 98—99 per cent). 

Hungarian belongs to the Finno-Ugric branch of 
the Uralic phylum. Within the Finno-Ugric family, 
Hungarian, together with the Ob-Ugric languages, 
makes up the Ugric grouping. But whereas the 
Ob-Ugrians are traditionally a sub-Arctic people, 
Hungarians are culturally Central Europeans; and 
Hungarian is now radically divergent from the 
Ob-Ugric languages in all aspects of its structure. At 
the same time, Hungarian remains a Uralic language 
and as such is typologically different from all the 
(Indo-European) languages that surround it. 
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8.5 Germany, Austria, 
Liechtenstein and 
Switzerland 


The official language of Germany, Austria and Liech- 
tenstein is German. In Switzerland, German, French 
and Italian are official languages, with a special status 
for Romansch. 

German dialects (or regional languages) divide 
into two groups: Low German, characterized by 
the form maken for ‘make’, and High German (sub- 
dividing into Middle and Upper German), with 
machen. This is an ancient distinction, discernible 
in the earliest documents; an intermediate group 
(Central German) is also often recognized. The terms 
‘Low’ and ‘High’ do not properly refer to any differ- 
ence in prestige, but rather to the geographical distri- 
bution of the dialects, Low German in the Northern 
plains, and High German in the Southern, more 
mountainous areas. 

It was High German that was destined to provide 
the basis for the standard written language: Luther 
composed his translation of the Bible in Eastern 
Middle German (a form of High German), which he 
took to be understandable to speakers of all dialects. 
The invention of the printing press, gradual accept- 
ance of Luther’s German in the Catholic south, polit- 
ical unification in 1871 and universal education in the 
20th century have all contributed to greater linguistic 
unification; yet Barbour (2000: 157) remarks that it 
‘turns out ... to be extremely difficult to argue the 
case for a single German language ... on purely lin- 
guistic criteria’. Many speakers of German also use 
a dialect/regional language: the maps show the 
general distribution of Low Saxon, East Low German, 
Low Franconian, Upper Saxon, Central German, 
Upper Franconian, Upper High German and Bavaro- 
Austrian. 

Throughout Germany, and especially in urban, 
industrial centres, there are heavy concentrations of 
people whose native language is not German. To 
some considerable extent, these are families of origin- 
ally migrant workers (the somewhat dated German 
term is Gastarbeiter), but there are also many refugees 
from conflicts and economic penury across the 
globe. The figures in Table 8.6 indicate the largest 
groups. 

As shown on the map, there are various small 
areas with traditional minority-language speakers. 
Sorbian (more traditionally, Wendish) is a West Slavic 
language used by (largely bilingual) inhabitants of 
Lusatia; a distinction is drawn between Upper and 
Lower Sorbian (around Bautzen and Cottbus respect- 
ively). Danish (North Germanic) is spoken near the 
German border with Denmark. East Frisian (West 
Germanic) is spoken in the villages of Striicklingen, 
Ramsloh and Scharrel in Saterland, while North Fri- 
sian is used on the islands of Sylt, Fohr, Amrun and 
Helgoland, and on the mainland around Niebiill; the 
two languages are not mutually intelligible. In 2000, 
there were some 85,000 speakers of Balkan, Sinte and 
Vlach Romani in Germany. 

In Austria, the official language German is uni- 
versally spoken, with South Bavarian the prevalent 
regional language in the Tyrol and in Styria; other 
important dialects are Alemannic (300,000 in 1991) 
and Walser. The figures in Table 8.7 also indicate 
various minority languages. 


Table 8.6 Languages spoken in Germany 


Table 8.7 Languages spoken in Austria 


Language Numbers of speakers (date of source) 
German 8,140,000 (1998) 
Serbian/Croatian 103,000 (1991) 
Turkish 68,000 

Polish 39,000 
Slovenian* 30,885 (2000) 
Kurmanji 23,000 
Hungarian 22,000 (1995) 
Note: 


* Slovenian is officially recognized as a regional language 


In Switzerland, the majority speak an Alemannic 
dialect of German (Schwyzertiiiitsch), but read and 
write standard German. Four of the Swiss cantons 
(Jura, Vaud, Neuchatel and Geneva) and parts of 
two others (Fribourg and Valais) are French- 
speaking, one (Ticino) is almost entirely Italian- 
speaking (with a Lombard dialect) and another 
(Graubiinden) is trilingual, with speakers of 
German, Italian and Romansch, a Rhaeto-Romance 
language. Romansch has since 1996 been recognized 
as the official language for relations between 
Romansch speakers and the Swiss government. Num- 
bers of speakers of the four national languages are 
shown in Table 8.8. 


Table 8.8 The national languages of Switzerland 


Language No. of speakers (date of source) 
German 4,215,000 (1990) 
French 1,272,000 (1990) 
Italian/Lombard 498,000 (1990) 
Romansch* 40,000 (1990) 
Note: 


* Most Romansch speakers are bilingual with German 


In addition there are substantial numbers of 
speakers of immigrant languages, e.g. Serbian- 
Croatian (142,000), Spanish (117,000), Portuguese 
(86,000). 

In Liechtenstein, the 30,300 inhabitants speak 
Alemannic German and read and write standard 
German. 


8.6 Italy, San Marino, Vatican 
City, Malta and Corsica 
(France) 


The official language of Italy is Italian, with French, 
German and Serbo-Croatian recognized regionally; 
the official language of San Marino is Italian, and that 
of Corsica is French. The official languages of the 
Vatican City are Italian and Latin, and those of Malta 
are Maltese and English. 

Italy is characterized by considerable linguistic 
diversity, with geographically well-separated dialects 
or regional languages being mutually unintelligible. 
Since political unification (1861), gradual progress 
has been made towards the codification of standard 
Italian. Important factors in this standardization have 
been conscription, growing social mobility and the 
influence of the media. Most inhabitants of Italy have 
available to them a continuum ranging from a 


Mapped Not mapped 
Language No. of speakers (date of source) Language No. of speakers 
German 82,133,000 (1998) Turkish 2,107,426 
Sorbian 29,000 (1996) Serbian/Croatian 652,000 
Danish 20,963 (2000) Italian 548,000 
East Frisian 11,000 (1976) Kurmanji 541,311 
North Frisian 10,000 (1976) Russian 360,000 
Greek 314,000 


regional dialect to the standard language. It was the 
Tuscan dialect (specifically that of Florence) that 
formed the basis for the current standard language, 
the starting-point being Dante Alighieri’s Divine 
Comedy in the early fourteenth century. For centuries 
thereafter, the question of which standard should be 
adopted for the peninsula was hotly debated (Ja ques- 
tione della lingua), but the matter is no longer a con- 
troversial issue. 

The dialects/regional languages of Italian fall into 
four groups: the northern group, spoken to the north 
of a rough line from La Spezia on the west coast to 
Rimini on the east, which merges into the Occitan/ 
Provencal dialects of southern France (notably 
Emilian, Ligurian, Lombard, Piedmontese and 
Venetian); the Tuscan group, which also includes 
Corsican, spoken by 200,000 people (1976) on the 
French island of Corsica, where French has been the 
official language since 1769, after the island was ceded 
to France by the Genoese; the central group (Latium, 
Umbrian and the Marches); and the southern group, 
which includes Abruzzese, Calabrian, Pugliese, Cam- 
panian, Lucanian, Sicilian and Sardinian. Sicilian, 
which differs markedly from Standard Italian, is 
sometimes regarded as a separate language; Sardinian, 
which retains in its phonology more aspects of Latin 
than any other vernacular, is fairly generally seen as 
a separate language from Italian (with four major 
variants), and is recognized as an official language in 
Sardinia. 

Other Romance languages spoken in Italy are 
French, in the Aosta valley; Franco-Provengal in the 
Aosta valley, but also as far afield as certain com- 
munities in Puglia and Calabria; and Occitan, along 
various tributaries of the Po by the French-Italian 
border. Italy houses two Rhaeto-Romance languages 
(although some scholars deny they are linked): 
Friulian is spoken, and to some extent written, in the 
east of the province Friuli-Venezia Giulia; and Ladin 
is spoken in the Val Badia and other Dolomitic valleys 
east of Bolzano. Catalan is still spoken in the port 
of Alghero, Sardinia by descendants of the Aragon 
empire. 

Among the most prominent non-Romance lan- 
guages are German (used in and around Bolzano and 
recognized as an official language in the Alto Adige) 
in various southern Bavarian variants; some Aleman- 
nic German (Walser) is heard in the Aosta Valley; 
Slovenian is used in the Gorizia—Trieste area adjacent 
to Slovenia; and, in villages scattered across southern 
Italy and Sicily, (Tosk) Albanian is still actively spoken 
and taught; there are also some pockets of Italiot 
Greek, now largely among older people, and Croatian. 
The speakers of these last languages descend from 
refugees and mercenaries from the distant past. 
Numbers of speakers in Italy, San Marino and Vatican 
City are shown in Table 8.9. Other sizeable language 
communities in Italy are speakers of Somali (51,000), 
Chinese (40,000), English (29,000) and Maltese 
(28,000). 

The language of the great majority of the 
inhabitants of Malta is Maltese (Afro-Asiatic). Maltese 
(some 300,000 speakers; 1975) is not a dialect of 
Arabic, but a language in its own right, related to 


Table 8.9 Languages spoken in Italy 


Language No. of speakers (date of source) 


Italian (including dialects 
and Sardinian) 

Friulian 

French, Occitan and 


55,000,000 
794,000 (2000 


Franco-Provengal 270,000 
German (all varieties) 263,000 (2000 
Slovenian 100,000 (1987 
Albanian 80,000 to 100,000 (1987 
Ladin 30,000 (2001 


Balkan, Sinte and 


Vlach Romani 23,000 (1980/1990) 


( 
Catalan 20,000 (1996 
Greek 20,000 (1987) 
Croatian 3,500 (1987 
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western Arabic dialects, but with Romance influ- 
ence; it is the only Afro-Asiatic language using the 
Latin alphabet. There are 2,400 speakers of English 
(1993). 


8.7 Spain (the Canaries), 
Andorra, Portugal (the 
Azores), the Cape Verde 
Islands and Gibraltar 


The official language of Spain is Spanish, the official 
languages of Andorra are Catalan and French; the 
official language of Portugal is Portuguese, while 
the Cape Verde Islands recognize Portuguese and 
Kabuverdianu; the official language of Gibraltar is 
English. 

Spanish (known to its speakers as espafiol or castel- 
lano) is a Romance language ultimately deriving from 
the Vulgar Latin of the Roman armies that took over 
control of the Iberian peninsula in the third and 
second centuries Bc. After the fall of the Roman 
Empire, the Romance dialects were subject to influ- 
ence from Germanic and Arabic. The central role of 
speakers of Castilian in the Reconquest of Spain from 
Islam helped to establish it as a national norm. The 
existence of this norm (watched over by the Real 
Academia de la Lengua, set up in 1714) impeded the 
free development of the various dialects/regional 
languages (see Map 68). 

Contemporary Spain contains 17 autonomous 
communities. In some of these, regional languages 
have recognized status alongside Castilian (itself 
spoken by 28,173,600 people in 1986, not quite three- 
quarters of the population): Aragonese (at least 
11,000 speakers) is recognized in Aragon; Asturian 
(about 100,000 speakers) in Asturias and in Castile 
and Leon; Catalan (6,472,828 native speakers; 1996) 
in Catalonia, Valencia and the Balearic Islands; 
Basque, with 580,000 speakers in Spain (1991), is 
recognized in Navarra and the Basque Country; and 
Galician (3,173,400 speakers; 1986) in Galicia. Cata- 
lan, Galician and Basque have the status of co-official 
languages in their respective communities. This 
upgrading in status has gone hand-in-hand with a 
blossoming of the regional languages, many of which 
had suffered centuries of repression. There are 
between 40,000 and 140,000 speakers of Calé (Iberian 
Romani). Many immigrants are Spanish-speaking 
(from Latin America); other major groups are Arabic 
(200,000), Western Farsi (25,000) and Chinese 
(20,000). 

Basque (known to its speakers as Euskara) is 
spoken, and is regionally recognized as an official 
language, in the provinces of Guiptizcoa, in parts of 
the provinces of Vizcaya, Navarra and Alava, and in 
Northern Spain, as well as in the western and 
southern areas of the French département of 
Pyrénées-Atlantiques (615,000 speakers; 1976); there 
is considerable dialectal differentiation, Guipuzcoan 
being dominant. Basque is a language isolate; that 
is, no conclusive evidence has been found for gen- 
etic connections with any other language, past or 
present. It is thought by many to be the sole sur- 
vivor of the languages spoken before the Romaniza- 
tion of Western Europe. Most Basque speakers are 
bilingual in Basque and Spanish (or, in France, 
French), but attach great importance to their lan- 
guage, the public use of which was repressed in the 
Franco era. 

The early history of Portuguese (spoken by 
9,869,000 people in Portugal; 1998) ran parallel to 
that of Spanish. It derives from Old Galician, as used 
in the north-west corner of the Iberian peninsula. 
After the Reconquest of the north-west, the area of 
modern Galicia gave its allegiance to Castile, while the 
south became independent (and Portugal has 
remained so except for the period 1580-1640). As the 
Moors were pushed ever southward, so the political 
centre of Portugal moved in a southerly direction. 


The standard language focuses around the dialect of 
Lisbon, but also incorporates several northern traits, 
so that there is large mutual intelligibility throughout 
Portugal and the Azores. Since 1999, there has been 
official recognition of the linguistic rights of the 
speakers of Mirandese (10,000 speakers), a variety 
of Leonese spoken in and around the town of 
Miranda in the very north-east of the country. 
There are also some 15,000 speakers of Galician in 
the northern province of Trds-os-Montes, 5,000 
users of Calaéo (Iberian Romani) and 500 of Vlach 
Romani. Many immigrants from Brazil, Mozam- 
bique, Angola and so on speak Portuguese; the 
largest groups of non-Lusophone immigrants speak 
Spanish (42,000), Arabic (27,000) or Ukrainian 
(numbers unknown). 

The map shows some smaller language-areas, 
including three island communities shown in insets. 
The Azores and the Cape Verde Islands are Portu- 
guese-speaking, with the great majority of the latter’s 
population speaking Kabuverdianu (393,943 speakers; 
1998), a Portuguese-based creole. The Canaries are 
Spanish-speaking (1,170,224; 1978). 

In Andorra, the majority language is Catalan 
(31,000), followed by Spanish (24,600) and French 
(2,400). The 31,000 inhabitants of the British depen- 
dency of Gibraltar use English (the official language; 
3,300) and Spanish (about 21,500); there is also a 
sizeable group of Moroccan Arabic speakers (2,900). 


8.8 France, Monaco and 
the Benelux 


The official language of France and Monaco is 
French, while the Netherlands gives official status to 
Dutch and, in Friesland, also to West Frisian; Belgium 
recognizes Dutch, French and German as official lan- 
guages, and Luxembourg awards this status to French, 
German and Letzeburgesch. 

The traditional division of Gallo-Romance into 
two languages, French (Langue d’Oil) and Occitan 
(Langue d’Oc, often known by the name of one of 
its regional variants, Provencal), no longer represents 
reality: the pervasive use of French throughout 
France, Monaco and neighbouring areas of Belgium 
and Switzerland has reduced Occitan to the status of 
a regional language. In recent years, however, it has 
been revived in the schools and universities of 
southern France, with a generally agreed standard- 
ization of the orthography. Franco-Provengal 
describes dialects used to the north-east of the 
Provencal area and adjoining parts of Switzerland and 
Italy. “There is within most regions a spectrum of 
variation from ‘pure’ patois ... through frangais 
régional ... to the standard language’ (Harris 1987: 
213) in keeping with the dialect groupings shown on 
Map 69. 

Within the boundaries of France, French is the 
native language of some 58,683,000 people (1998). 
There are also a large number of linguistic minorities 
to be found; those which are geographically concen- 
trated have been mapped. Breton is a Celtic language 
(Brythonic branch). Fewer than 10 per cent of the 
inhabitants of Brittany are monolingual Breton 
speakers, but the number of speakers has remained 
fairly constant, with the language a focus of national- 
ist sentiment. Basque (for details, see Section 8.7) is 
spoken in the western and southern areas of the 
département Pyrénées-Atlantiques, where the lan- 
guage is seen as a vital expression of cultural and eth- 
nic identity. Catalan is spoken in the département 
Pyrénées-Orientales, and the spoken language of 
Corsica is Corsican, a Tuscan dialect of Italian 
(see Map 67). Some 1.5m speakers of Alemannic 
German are found in Alsace, with fewer in Lorraine; 
and there are some 80,000 Dutch (Flemish) speakers 
in the north-east corner of France (the Westhoek 
“western corner’). Some 50,000 Romani speakers live 
in France. 

Of the immigrant languages, Algerian, Moroccan 


and Tunisian Arabic (1,364,000), Italian (1m), Portu- 
guese (750,000) and the Berber languages Kabyle 
(537,000) and Tamazight (150,000) are dominant, 
with the Creole French of the Lesser Antilles 
(150,000) and Turkish (135,000) also very well repre- 
sented. These immigrant languages are concentrated 
in the major conurbations. The numbers of speakers 
of the languages mapped are shown in Table 8.10. 


Table 8.10 Regional minority languages in France 


Language No. of speakers (date of source) 


Alemannic German 1,500,000 (1988 


( ) 
Breton 500,000 (1991) 
Corsican 341,000 (2001) 
Catalan 100,000 (1996) 
Dutch/Flemish 80,000 
Basque 76,200 (1991) 


In Monaco — alongside the official language, 
French — Ligurian and Provencal are used, but with 
fewer than 20 per cent of the population being native 
to Monaco, English is also much used. 

Belgium is divided into three ethnolinguistic 
regions (Dutch, French and German) in a federal 
structure. The language of the northern region is 
Dutch (known to its speakers as Vlaams, with 
Brabants, East and West Flemish dialects); that of the 
south is French (Walloon and Picard dialects), with 
German being spoken along the eastern border, to the 
north of Luxembourg. There are some 30,000 speakers 
of Letzeburgesch in the Belgian province of 
Luxembourg. The capital, Brussels, is situated within 
the Dutch-speaking area, but is itself officially 
bilingual Dutch—French, with French being numeric- 
ally dominant. There are sizeable migrant worker 
communities in urban areas, with 280,000 speakers of 
Italian, 127,700 of Moroccan, Algerian and Tunisian 
Arabic, 80,000 of Portuguese, 70,000 of Spanish and 
63,600 of Turkish, to name but the largest groups. 
Table 8.11 gives the numbers of speakers of the lan- 
guages mapped. 


Table 8.11 Regional minority languages in Belgium 


Language No. of speakers (date of source) 
Dutch at least 5.6m 
French at least 4m 
German 150,000 (1988) 


In Luxembourg, the mother tongue of at least 
300,000 of the 448,569 population (2002) is Letze- 
burgesch; German is taught in school as a second lan- 
guage, and French as a third, and these are the major 
languages of the press and the courts respectively. 
Letzeburgesch is linguistically a distinctive Franco- 
nian dialect of German. There is a large population 
of Portuguese-speaking immigrants (estimated at 
100,000) and speakers of Italian (20,800), satisfying a 
once-serious labour shortage. 

The language of the great majority (12,360,338 
in the year 2000) of inhabitants of the Netherlands — 
and the official language — is Dutch (or Netherlandic, 
known to its speakers as Nederlands). There is con- 
siderable dialectal variation, but also rapid movement 
towards standardization; three regional languages 
have been given government recognition in recent 
years, those of Drente, Groningen and Westerwold. 
In 1976 there were also some 700,000 speakers of 
West Frisian (known simply as Fries in the Nether- 
lands), which shares official status with Dutch in 
the north-westerly province of Friesland. Various 
waves of immigrants have brought many new 
languages to the Netherlands, notably Indonesian 
(300,000), Tarifit Berber (200,000), Turkish (192,000), 
Sranan (180,000), Papiamentu (80,000), Yue Chinese 
(70,000), Ambonese Malay (45,000), Kurmanji 
(40,000) and Moroccan Arabic (30,000). 
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8.9 United Kingdom and 
Republic of Ireland 


By far the most prominent language of the United 
Kingdom and of the Republic of Ireland is English 
(Indo-European; Germanic). Although there are 
countless minority languages in the UK, English is 
absolutely dominant ‘in education, in the media, in 
the workplace, in government and in all aspects of 
British life [and it continues to be the case that] 
minority languages and literacies only have a legitim- 
ized place within minority institutions such as the 
home, the temple, church, mosque or the local com- 
munity association’ (Martin-Jones 1984: 427). 

The major (rural) English dialect groupings are 
indicated on the map. There are also three major 
varieties of Scots: Lowlands Scots (used in the south, 
east and far north of the country), Highlands and 
Islands Scots (a form of Scottish English that has 
replaced Scottish Gaelic) and Ulster Scots (in parts of 
Northern Ireland). Linguists have recently devoted 
attention to the specific nature of urban dialects, find- 
ing these to be characterized by ‘structured hetero- 
geneity. Within any one city there can be great 
variation, but that variation is linked to the social 
structure of the city’s population, with differences in 
income, lifestyle, ethnicity and education all being 
reflected in linguistic distinctions. 

Since the 1950s, Britain has experienced con- 
siderable immigration and a concurrent radical 
increase in the number of community languages. 
Immigrants from the Caribbean brought with them 
various West Indian English-based Creoles. In recent 
years, British Jamaican Creole, differing in various 
ways from Jamaican Creole, has evolved among 
younger people. British Black English, on the other 
hand, can with more justice be seen as a form of 
English, but one influenced in all aspects by British 
Jamaican Creole. After the partition of the Indian 
subcontinent in 1947 and the Africanization of vari- 
ous East African countries in the 1970s, there arrived 
a good number of speakers (perhaps half a million) 
of such languages as Hindi, Urdu, Panjabi, Gujarati, 
Parsi, Sylheti, Bengali, Kashmiri and Tamil. Recent 
conflicts have brought speakers of Pashto from 
Afghanistan and the Kurd languages Kurmanji and 
Kirmanjki from Turkey, Iraq and Iran, and swelled 
the numbers of Arabic speakers. Earlier influxes 
from Hong Kong and Singapore, and the arrival of 
many ethnic-Chinese Vietnamese boat people made 
Chinese (principally Cantonese, Hakka and Yue) a 
language much heard in Britain. The tensions on 
Cyprus added to the ranks of Greek (200,000) and 
Turkish (60,000) speakers. Italian (200,000) is also 
well represented, as in most northern European 
nations. There are also significant numbers of speakers 
of Romani and AngloRomani. 

Of the 3.9m inhabitants of the Republic of 
Ireland (Poblacht na hEireann) only around 70,000 
are native users of Irish (a Celtic language of the 
Goidelic branch), though some 32 per cent of the 
population can speak the language to some degree 
(Census 1991). Otherwise the native language is Eng- 
lish. The areas in which Irish (or Gaeilge) is spoken 
are dotted around the coast (see Map 63) and are 
collectively known as the Gaeltacht. Constitutionally, 
Irish is the first national language, English the second; 
the equality of the languages was restated in an Act of 
2002. There are also some 95,000 speakers of Irish in 
Northern Ireland (UK). 

The minority languages mapped are all, with the 
exception of Channel Islands French, Celtic lan- 
guages. Whereas Cornish died out in the eighteenth 
century (though it has been revived by some 500 
enthusiasts) and Manx in 1974 with the death of its 
last native speaker (here, too, there are attempts at 
revival), Welsh (Britannic Celtic) and Scottish Gaelic 
(Goidelic Celtic) have survived — the former rather 
more vigorously — despite a gradual decline in the 
number of speakers, both monolingual and bilingual. 
The map shows the area within which at least 20 per 
cent of the population can speak Welsh, which has 


official status and strong local government support; 
particularly in the north-west and south-west, there 
are areas where Welsh is the majority language. And 
in Scotland, the map shows those areas with around 
10 per cent of the population able to speak Gaelic, 
with the largest concentration on the Outer Hebrides; 
a small number of speakers may be encountered 
throughout the Scottish Highlands, and there are 
estimated to be over 15,000 Gaelic speakers in Glas- 
gow. Channel Islands French exists in two dialects, 
those of Jersey (including Sark) and Guernsey, and is 
a survival of Anglo-Norman French; both French and 
English are official languages of the Channel Islands. 


Table 8.12 Regional languages in the British Isles 


Language No. of speakers (date of source) 

in the United Kingdom 
English 58,649,000 (1998) 
Welsh 508,098 (1991) 
Irish 95,000 (2004) 
Gaelic 58,650 (2003) 
Channel Islands French 14,000 (1976) 


in the Republic of Ireland 
English 3,800,000 
lrish 70,000 


8.10 Denmark, Norway, Sweden 
and Finland 


The official languages of Denmark, Norway, Sweden 
and Finland are Danish, Norwegian, Swedish, and 
Finnish and Swedish respectively. Between the three 
first-named languages, all of which descend historic- 
ally from Nordic, there is a certain degree of mutual 
intelligibility. That they are called languages rather 
than dialects of Scandinavian is above all attributable 
to the political separateness of the countries in 
which they are spoken. As far as Danish and Swedish 
are concerned, that separateness began in the 
mid-sixteenth century, after Sweden had achieved 
independence from Denmark in 1526. The Danish 
written norm centred on Copenhagen, while the 
court in Stockholm encouraged the development of a 
specifically Swedish norm. 

Denmark is almost exclusively Danish-speaking; 
there are small minorities of German speakers by the 
German border, and of Faeroese and Inuktitut 
speakers (not mapped) from the Faroe Islands and 
Greenland respectively. Not officially recognized as a 
language is Scanian (Skdne), spoken in extreme 
south-west Sweden and referred to in Denmark as 
‘East Danish’; Vikor (2000: 109) states that its 
speakers have been ‘Swedicized linguistically. The 
major immigrant language is Turkish, with some 
30,000 speakers. Numbers of speakers are shown in 
Table 8.13. 


Table 8.13 Languages spoken in Denmark 


Language No. of speakers (date of source) 
Danish 5,000,000 (1980) 
German 23,000 (1976) 
Inuktitut 7,000 (1990) 

Sweden is 93 per cent Swedish-speaking 


(7,825,000 speakers). There is a sizeable minority, of 
some 280,000 speakers, of Finnish and Tornedalen 
Finnish, and some 5,900 (partly nomadic) speakers of 
Saami. Five Saami languages have been mapped 
within Sweden: Lule, Pite, Northern (the largest 
group), Southern and Ume Saami. After 1945, Swe- 
den received a relatively large number of refugees 
from other European countries: it is official govern- 
ment policy to encourage bilingualism with Swedish 
among these immigrants. The best-represented 
immigrant languages are Serbian/Croatian (120,000), 
Greek (50,000), Western Farsi (35,000) and Spanish 


(35,000). Tavringer Romani is spoken by some 25,000 
people. 

Since its total independence in 1905, Norway 
has recognized two standard forms of Norwegian, 
Bokmal and Nynorsk. The former is an adaptation 
and updating of the written norm imposed under 
Danish rule; it is used by 74 per cent of Norwegians 
(3.4m of 4.6m). The latter is a modernized version of 
a language constructed by the nineteenth-century 
autodidact linguist Ivar Aasen from his observations 
of rural dialects; it is used largely in country districts 
(850,000). There are some 21,000 speakers of 
Swedish, 16,000 of Lule, Pite, Northern and Southern 
Saami, 12,000 of Danish, 6,500 of Romani, and 5,350 
of Finnish. The major immigrant languages are 
Vietnamese (99,000) and Urdu (and other languages 
of Pakistan; 17,000). 

Whereas Danish, Swedish and Norwegian are 
(North) Germanic languages, Finnish and Saami are 
Uralic (more specifically Finno-Lappic) languages. 
Finnish (known to its speakers as suomi) is the 
language of well over 90 per cent of the population of 
Finland. Finnish arose in a unified written form in 
the nineteenth century as a compromise among 
various dialects, and played an important role in 
the national awareness that contributed to Finland’s 
independence in 1917. The official languages are 
Finnish and Swedish. Alongside a sizeable group of 
Swedish speakers in Aland and other south-western 
areas, there are speakers of other Balto-Finnic lan- 
guages: Karelian, spoken in two villages close to the 
Russian border, Livvi (or Olonetsian; not mapped) 
and Estonian. 


Table 8.14 Languages spoken in Finland 


Language No. of speakers (date of source) 
Finnish 4,700,000 (1993) 
Swedish 296,000 (1997) 
Karelian 10,000 (1994) 
Russian 10,000 

Estonian 6,000 (1993) 
Livvi 5,172 (2000) 
Saami languages 2,550 (1995) 


The three varieties of Saami spoken in Finland 
(Inari, Northern and Skolt) are termed languages 
above all because of the almost complete lack of 
mutual intelligibility; their speakers are spread over 
a wide area (Saamiland) that also includes the 
Kola peninsula in Russia. The Saami population 
has declined to some 4,700, and the number of 
speakers of Saami has declined correspondingly; 
many Saami use Tornedalen Finnish, a language of 
the Torne valley in northern Sweden, as their second 
language. Because of the transhumance involved in 
the rearing of reindeer, many Saami lead a nomadic 
existence so that precise localization on a map is 
impossible. 


8.11 Greenland, Iceland, the 
Faroes and Svalbard 


The official languages of Greenland are Greenlandic 
(or Inuktitut, an Eskimo-Aleut language) and 
Danish. The native tongue of the great majority of 
Greenlanders is Greenlandic, known to its speakers as 
Kalaallit oqaasii — of the 44,000 speakers, some 40,000 
speak West Greenlandic, 3,000 East Greenlandic and 
800 North Greenlandic (used in and around Thule). 
The minority language (7,830 speakers; 1986) is 
Danish, which is largely confined to the major 
towns. Although since Home Rule the role of Green- 
landic in public affairs has been expanding, Danish is 
still the dominant language of secondary education, 
commerce and administration. Yet Danish has had 
little influence upon Greenlandic, almost certainly 
because of the marked typological differences 
between the two languages. Greenlandic has been 
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written for almost two centuries and was the first 
native American language to be thoroughly described, 
by Samuel Kleinschmidt in 1851. Standardization was 
guided by a language commission set up in 1956, and 
a new orthography, based on the majority West 
Greenlandic dialect (concentrated around the capital 
Nuuk/Godthab), was introduced in 1973 to replace 
Kleinschmidt’s. 

The absolutely predominant language of Ice- 
land’s 293,966 inhabitants is Icelandic, though there 
is a minority of Danish speakers (2,250; 2001). Ice- 
land was colonized by speakers of Old Norse in 
the late ninth century. The language changed little in 
the following millennium, so that present-day Ice- 
landers have little difficulty in reading the Sagas and 
Eddas of the twelfth and thirteenth centuries. The 
modern language has preserved all the morphological 
characteristics of Proto-Germanic. Stringent efforts 
are made to keep the written language free from 
external influence: thus words for modern inventions 
are created rather than borrowed from foreign 
languages. 

The Faroe Islands have been an integral part of 
Denmark since 1380, and were granted local auton- 
omy in all but matters of foreign policy in 1948. The 
official languages are Faroese and Danish. Faroese is 


similar to, but not mutually intelligible with, Ice- 
landic. It has been a written language only since the 
nineteenth century. Faroese is spoken by all 45,400 
inhabitants of the islands. 

Svalbard (pop. 3,309; 1991) is a group of islands 
situated halfway between the north shore of mainland 
Norway and the North Pole. Although probably visited 
by the Vikings, the islands were not settled until the 
1890s, when the Norwegians began mining coal. They 
have been recognized as part of Norway since 1920. 
The inhabitants of Svalbard are Norwegian-speaking. 
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9.1 General remarks 


Virtually the entire area covered by the maps in this 
section is the historical domain of the Hamito- 
Semitic language stock, which shares a number of 
structural resemblances, in, for example, the primacy 
of the root system, the forms of the pronouns and 
the morphology of the verb. It has four subdivisions: 
the Cushitic family of parts of the Sudan, Ethiopia 
and Somalia; Ancient Egyptian and its descendant 
Coptic, both now extinct; Berber; and the Semitic 
languages, with Hebrew and some Aramaic dialects 
surviving from the northern branch, and Arabic and 
the Modern South Arabian (MSA) and Ethiopian 
Semitic languages representing the southern branch. 
The whole of the Middle East and North Africa 
is now dominated by a complex network of Arabic 
dialects, which are typologically divided into Eastern 
and Western, the boundary between the two coincid- 
ing with the eastern frontier of Libya. These dialects 
were once confined to the Arabian Peninsula, with 
some intrusion in late antiquity into parts of Mesopo- 
tamia and Syria, but subsequently were carried by the 
Islamic conquests of the seventh and eighth centuries 
and later expansions throughout the Middle East and 
North Africa, checked only by the mountain barrier 
of the Ethiopian highlands, the savannah belt of the 
Sudan and the rivers Niger and Senegal. Their success 
in replacing the indigenous languages of this vast 
territory has been attributed to the fact that they 
were moving into zones of similar language type, 
such as Aramaic, Coptic and to some extent Berber, 
whereas the alien structure and the historical tenacity 
of the Iranian and Turkic languages guaranteed their 
survival, albeit with massive lexical borrowing from 
Arabic in the religious and intellectual domains. 
Christian and Jewish communities, however, 
were able to maintain isolated pockets of their own 
idioms in small, settled communities of Aramaic 
speakers in northern Syria and Iraq, and adjacent 
parts of Turkey and Iran, while the religious status of 
Hebrew and its survival as a written medium during 
the Middle Ages led to its renaissance as the official 
language of the State of Israel in 1948. The Semitic 
languages of Ethiopia, particularly Amharic and 
Tigrinya, thrive in protected isolation in the high- 
lands of the Horn of Africa, with some related non- 
Arabic languages on the south coast of Arabia. In the 
impenetrable terrain of Ethiopia numerous minor 
languages of the Cushitic stock of Hamito-Semitic 
are spoken alongside the Semitic tongues, while in 
the west of the country and the southern Sudan a 
bewildering mosaic of languages of the Nilo-Saharan 
phylum mingles with the southernmost representa- 
tives of the Arabic dialect diaspora. Finally the Berber 
stock survives in healthy competition with Arabic in 
the north of Africa, particularly Algeria and Morocco. 
Climatic and geographical factors contribute to 
this sharp delimitation of the Arabic dialect area. It 
corresponds closely to the zone of summer drought 
with an average rainfall of less than 5 inches (127 
mm) per annum, except in the Lebanon and Ethiopia. 
Here the role of the nomadic bedouin is a major 
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determinant in shaping the social environment. A 
precise geographical barrier is set in the north by the 
mountain systems of south-east Anatolia, Kurdistan 
and western Iran, though permitting some spillage 
into the Arabic area of Turkish and Kurdish, while 
some pockets of Luri and Gurani exist near the 
Persian border. The coast of the Levant also forms a 
mountain barrier with its transversal faulting, which 
encourages the establishment of relatively discrete 
dialect forms as against the more fluid situation in 
the Syrian desert. A low coastal plain on the Persian 
Gulf terminates the easterly downward-tilting plate of 
peninsular Arabia and occasions a measure of Persian 
influence on the language and culture of the area. The 
mountain barrier resumes in Oman and contains the 
remainder of the peninsula as far as Sinai, whence it 
resumes its course down the west coast of the Red Sea 
to merge into the highland system of Ethiopia. The 
northern limit of Arabic-speaking Africa is also 
predominantly mountainous, except in Libya and a 
few stretches of coastal plain further west, finally 
levelling off southwards down the Atlantic coast of 
Mauritania. 

Contained within this mountainous periphery 
are two vast desert complexes, interrupted only by 
the river systems of the Tigris-Euphrates and the 
Nile. The stony Syro-Mesopotamian desert connects 
directly with the Naftid and Dahna’ wastes of Saudi 
Arabia to form a region occupied mainly by camel- 
and cattle-breeding nomads, but punctuated occa- 
sionally by concentrations of wells or oases where 
they can spend the dry summers and which permit 
the establishment of overland communication routes 
between the major zones of settlement. The great 
Empty Quarter of South Arabia is almost entirely 
uninhabited. The sparsely populated desert zones to 
either side of the Nile continue through Libya and the 
western Sudan to link up with the Sahara and create 
an area of over 3,500,000 square miles to which only 
nomadic tribes can acclimatize. 

The population throughout the Middle East and 
North Africa is concentrated in the more mountain- 
ous periphery and particularly the river regions, and 
there are few major cities in the desert zone. Baghdad 
had attracted some 4,797,000 inhabitants in 2000, 
while in the Nile Delta Cairo could boast of some 
10,552,000 inhabitants in the same year and Alexan- 
dria 4,113,000. By contrast the major cities of the 


Table 9.1 The consonantal system of Proto-Semitic 


Plosive Fricative 
Bilabial pb 
Interdental 
Dental tdt 
Palato-alveolar 
Velar kqg 
Pharyngeal 
Laryngeal 
Semi-vowels 

Bilabial w 

Palatal y 


N 1Q 
“nie 


> ao MH Ie 
> go. 


Levant and northern Arabia are more modest, Beirut 
and Damascus having just over 2 million inhabitants 
and Aleppo about 1,700,000. In North Africa only 
two cities top the million mark, Casablanca with 
2,940,600 (1994) and Algiers with 1,885,000 (2000). 

The dialects of the settled zones and urban 
centres, which Versteegh sees as originating in late 
antiquity from pidginized versions of Arabic develop- 
ing in contact with Greek- and Aramaic-speaking 
communities, are generally innovative and responsive 
to external influences. Equally, however, they tend not 
to be clearly marked off from neighbouring dialects 
and a great deal of overlapping occurs. Considerable 
reduction in the grammatical inventory has taken 
place in terms of the underlying Classical Arabic 
(CA) paradigm, explicable in terms of a long settled 
history. The bedouin dialects, on the other hand, are 
marked as more conservative, preserving more of the 
classical system, and the boundaries between them are 
more distinct, especially where they are protected by 
geographical factors. This is nowhere more in evi- 
dence than in the peninsula itself and in North Africa. 

In all the Semitic languages consonants pre- 
dominate over vowels. The full system for proto- 
Semitic is particularly relevant to Arabic, the most 
conservative of the Semitic family, and may be set out 
as in Table 9.1. The only departures from this system 
in Classical Arabic are a fricative pronunciation for 
the bilabial p (> f), velar g becomes affricate j, the 
interdental fricative t becomes emphatic z, and the 
lateralized dental § shifts to s; while sand s merge into 
s, except in MSA. Losses in Hebrew and Aramaic 
include all the interdentals (t> 55d >zt>s,d> s) 
and the lateralized dental s (though retained in 
Hebrew writing). Velars h and ¢> pharyngeals h and’. 
In Aramaic t > 4, d > *. The Ethiopian Semitic lan- 
guages also pronounce the bilabial plosive b as f but 
have created two independent phonemes p and f, all 
interdentals again are lost (t> s,d>z,d> s (though 
the last is retained in writing) — as are the velar frica- 
tives (h >h, g > ‘). In Amharic, all the laryngeal 
sounds are reduced to zero. A notable innovation of 
the Ethiopian group is an ejective articulation for the 
Semitic emphatics for which evidence exists also in 
Modern South Arabian. The Semitic vowel system 
consists of a triad a, i, u, to which corresponds a 
lengthened set d, 7, #, along with the diphthongs aw 
and ay. 


Lateral Lateralized Rolled Nasal 
m 
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s r n 
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The word in Semitic basically consists of a root 
and a scheme, the root being constituted of three, 
occasionally more, successive consonants forming a 
sort of semantic skeleton for the word, which is then 
given its morphology and closer semantic definition 
by the insertion of a pattern of vowels and the add- 
ition of affixes. Thus Arabic ktb indicates ‘writing’, 
specified to kataba ‘he wrote’, yaktub ‘he writes/will 
write’, kdtib ‘scribe’, kitab ‘book’, maktaba ‘library’. 
The verbal system is aspectual, indicating whether the 
action is complete or still in process. This leads to two 
basic verbal forms, the Perfect formed with pro- 
nominal suffixes, as Arabic katabtu ‘I wrote’, and the 
Imperfect with pronominal prefixes and suffixes, as 
aktub ‘I write’, yaktubtina ‘they write’. Apart from the 
base form of the verb a system of measures has 
evolved, varying according to the specific language, 
where — by altering the base form by vocalic or con- 
sonantal modifications, and by prefixes — the applica- 
tion of the root notion can be modified, as Arabic 
‘alima ‘know’, a‘lama ‘inform’, ‘allama ‘teach’, ta‘allama 
‘learn’. It is likely that the Semitic noun originally had 
a triple flection indicating subject, object and posses- 
sive by the suffixes -u, -a, -i, respectively. Present in 
Classical Arabic, it is now lost in all modern lan- 
guages. Number was indicated by a system of singu- 
lar, dual and plural but the dual survives only in 
bedouin Arabic dialects and some ossified words 
elsewhere. A feature of the southern Semitic lan- 
guages is the availability of ‘broken’ plurals formed by 
vocalic and suffixal changes to the singular form. 
Otherwise plurality is shown by suffixation. Gender is 
limited to masculine and feminine. Pronouns exist as 
independent forms or as suffixed object and posses- 
sive forms. 

Literature is actively cultivated throughout the 
region under review, with particular emphasis on the 
novel, the short story and drama. Amongst more 
traditional genres one may mention national poetry, 
which is deeply rooted in all the cultures discussed. A 
form of Classical Arabic continues in use throughout 
the Arabic-speaking world as a means of mass com- 
munication overriding the problems of compre- 
hension between dialects. Modern Standard Arabic is 
essentially the same as the classical form, from which 
however it is not descended, but it lacks some of the 
grammatical nuances of the latter, notably in the sup- 
pression of otiose inflections. It is employed in mass 
communications of all kinds, in writing and in con- 
versation, and is taught in schools. It is in this lan- 
guage that literacy must be evaluated. Generally this 
ranges from as low as 17 per cent in Mauritania, a 
predominantly bedouin culture, to 79 per cent in 
Jordan and 71 per cent in Kuwait, but normally runs 
from 40 to 60 per cent. The use of Hebrew is compul- 
sory in Israel and the literary language differs little 
from its biblical and Mishnaic origins. Its central 
position in Judaism ensures its active cultivation, 
though some sects regard it as too sacred to employ as 
a spoken medium. Nevertheless literacy stands at 88 
per cent. Amharic and to some extent Tigrinya are the 
literary languages of Ethiopia, where writing has been 
actively encouraged and developed for almost a cen- 
tury, resulting in a very remarkable corpus of litera- 
ture. One of the more impressive achievements of the 
Mengistu regime was to raise Ethiopia’s standard of 
literacy from one of the lowest figures in Africa (des- 
pite the country’s rich medieval literary experience) 
to a respectable 55.2 per cent of a total population of 
53,200,000 at the time. 


9.2 Neo-Aramaic dialects 


The Assyrian or Neo-Aramaic dialects are remnants 
of ancient Aramaic, a language group first attested in 
inscriptions of the ninth century Bc from north Syria 
and the Caucasus, and which became the lingua 
franca of the Persian, Assyrian and Babylonian 
regions, with sporadic occurrences in North Arabia 
and Egypt. It was the language of Palestine in the time 
of Christ, and in late antiquity and the Middle Ages 


was the vehicle for an extensive literature in Classical 
Syriac at the Christian centre of Edessa, and for 
various Jewish writings, notably the Babylonian 
Talmud. It survives today in the dialects of small, 
isolated communities of Jews and Christians in the 
Anti-Lebanon, in northern Iraq and in parts of Turk- 
ish Kurdistan and Iranian Azerbaijan, where it is the 
medium of settled communities of agriculturalists. 
Isolated émigré communities also have established 
themselves in Georgia (Mtskheta, Gardabini, Tbilisi, 
Kutaisi, Samtredia and Zestafoni), Armenia (Yerevan 
and Leninakan) and Azerbaijan (Kirovabad and 
Armavir). There is also a diaspora in the United States 
and in the larger cities of Russia, including Moscow 
and St Petersburg. No reliable figures can be cited for 
the number of speakers of these languages, particu- 
larly after the dramatic political events of the 1980s 
and 1990s, but there were an estimated 4,000 Ma’lila 
speakers in 1936 and roughly 20,000 for the Turdyo 
group in 1985. A recent estimate for the total number 
of Aramaic speakers suggested 300,000, including 
about 25,000 in the Russian territories (according to 
a 1970 census). A figure for Iraq in 1974 speaks of 
200,000 there. Very few Jewish speakers of the dialects 
survive outside Israel, where the figure stood as high 
as 28,000 in 1976. Christianity is the dominant 
religion of the Assyrians, those of Urmia and 
Kurdistan being Nestorians while the inhabitants 
of Tar “Abdin are Monophysite Jacobites. A large 
number are Uniates and a very few are Greek 
Orthodox. 


9.2.1 Western Neo-Aramaic 


Western Neo-Aramaic has survived in the mountains 
of the Anti-Lebanon north of Damascus, in the 
villages of Ma‘lala, Bakh’a and Jub‘Adin, the first a 
Christian community, the latter two both Muslim. 
Perhaps the most conservative form of the language 
and comparable to the early Jewish and Christian 
forms of Palestinian Aramaic, it preserves the long /6/ 
articulation of the phoneme 4, the plural ending -éya 
on masculine nouns and an aspectual verb conjuga- 
tion. The influence of Arabic is dominant in the 
phonetic system, the grammar and the lexicon. 


9.2.2 Eastern Neo-Aramaic 


The Eastern Aramaic dialects, which are more 
innovative than the Western, perhaps because of their 
linguistic environment, provide an uncertain number 
of dialects, for the most part mutually incompre- 
hensible, formerly scattered over a wide area of 
south-eastern Turkey, northern Iraq and north- 
western Iran, with the addition of the isolated 
Mandaic on the Shatt al-‘Arab, that is, mostly in areas 
of Kurdish, Turkish and Iranian speech. However, as 
a result of persistent religious persecution in more 
recent times, the majority no longer live in their 
original homelands and many have emigrated, par- 
ticularly to Israel, Turkey and Iran. The latter regions 
seem to have been their original homeland and many 
large communities remain on the north-western, 
western and south-western shores and in the hinter- 
land of Lake Urmia (Rezaiye), and in various Iranian 
cities such as Tabriz, Teheran and Hamadan. The 
Turkish vilayet of Van and the Kurdistani mountains 
are home to several communities speaking minor dia- 
lects. For centuries they have lived as immigrants in 
Iraq in and around Ba‘qdba near Mosul, while in the 
1930s the Khabdr tributary of the Tigris river 
attracted some thirty communities of Aramaic - 
speakers from Lake Urmia. They refer to themselves 
as Assyrians, “assiirdi, but the term athurdi, used of 
the mountain communities, is regarded as the more 
correct form. Speakers of the Ttir “Abdin dialect call 
themselves assorzi. 

Characteristic phonological peculiarities of these 
dialects include the preservation of the long 4, while 
pharyngeal h tends to shift to velar h. The verbal sys- 
tem has diverged considerably from Semitic. Of the 
finite forms of the verb only the imperative remains, 
and suppletive forms have been created based on par- 


ticiples and infinitives, as pdtih-parr + n-PRON > 
pathin ‘I open’; ptih-part + lii-PRON + suEFE > ptihlii 
‘I opened’. By this means a complex paradigm of 
finite tenses has evolved, which contrasts markedly 
with the largely aspectual system of Arabic. The 
system of derived stems or measures is much simpli- 
fied in Aramaic and essentially reduced to the inten- 
sive and causative forms. In the syntax, simple and 
compound sentences occur, the latter mostly in 
writing. Sentences may be verbal or nominal. Word 
order is not rigid in the spoken language and SVO 
(Subject—Verb—Object) tends to predominate, unless 
after verbs of speech when VSO is more normal. 
Asyndetic sentences are very common, but subordin- 
ating conjunctions based usually on relative construc- 
tions are available. Typologically, Eastern Aramaic 
stands closest to the Aramaic of the Babylonian 
Talmud, Classical Syriac and Mandaic. 

The dialects of eastern Aramaic present a 
complex picture. Turédy6é is spoken in the vicinity 
of Mardin, to the south-west of Lake Van on the 
Turkish-Syrian border, and is one of the best 
described. Urmi is centred on the town of Rezaiye on 
Lake Urmia, with, formerly, a Jewish subdialect in 
Salamas to the north, the two being mutually unintel- 
ligible, while the related Zakho and Sulaymaniyya 
subgroups lie to the south and south-east respectively. 
Quite distinct from these is Hartevan, an isolated 
Sprachinsel to the west of Urmi in the Siirt province 
of Turkey. Fillihi, or Alqosh, and the Zakho subgroup 
on the Khabiar river are spoken by communities of 
emigrants from Iran and Turkey respectively. 

A form of modern standard Assyrian based on 
the Urmi dialect was devised by missionaries in the 
1840s for the publication of religious and educational 
books, which were printed on local presses using 
the Syrian Nestorian script, and a modest output of 
literature and journalism was achieved. Within the 
former Soviet Union in the 1920s and 1930s, a Latin- 
based alphabet was devised for Assyrian and success- 
fully used for a time, but did not really do justice to 
the phonology of the language. 


9.2.3 Mandaic 


Mandaic was the language of a sect of Gnostics who 
occupied the territories to the south of the Tigris and 
Euphrates rivers. A number of their religious writings 
produced between the third and the eighth centuries 
AD has survived. Communities of Sabians (not the 
Sabaeans, for whom see Section 9.4.1.5), numbering 
some hundreds only, have been found in Khorram- 
shahr and Ahwaz in Iran. They have preserved their 
old idiom, but little information is available on it. 
Around 20,000 ethnic Mandaeans in Iraq now speak 
Arabic. 


9.3 Hebrew 


Hebrew belongs to the Canaanite branch of the 
Semitic language family. Its heyday as a literary 
medium was during the biblical period from about 
1200 to 200 Bc; it was subsequently supplanted by 
Aramaic in Palestine, but continued in use for the 
composition of various apocryphal and_ rabbinic 
writings in the early centuries of the Common Era. 
Somewhat artificially maintained in the Middle Ages 
as a written language among Jewish communities 
throughout Europe and particularly the Middle East, 
where it was subjected to a strong influence from lan- 
guages it came into contact with — particularly Arabic 
in the areas of syntax and intellectual and cultural 
calques in the lexicon — it nevertheless provided the 
vehicle for a rich religious and poetical literature. 
A nineteenth- and twentieth-century revival of the 
language is associated with the name of Eliazer Ben 
Yahuda (1858-1922), who laboured to make a spoken 
idiom of the literary language by incorporating 
material from medieval Hebrew, Arabic and Aramaic 
and by the use of word derivation. The Balfour 
Declaration of 1917 led in 1922 to Hebrew being 
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accorded official status within the Jewish community 
in Palestine and when the State of Israel was founded 
in 1948 the Jewish population of Palestine already 
spoke what is termed Palestinian Hebrew. Inevitably 
the Jewish authorities made it state policy to adopt it 
as the official language for the young nation, along 
with Arabic for the Palestinian Arab population. 

Although the Israelis had to adapt what was 
effectively a somewhat enriched version of the ancient 
biblical language to the intellectual, social and polit- 
ical demands of a modern environment, the new lan- 
guage did not fundamentally alter in its structure, and 
what ensued amounted to the revival of an effectively 
dead language. This was done with remarkable effi- 
ciency when one considers that to many, if not most, 
immigrants Hebrew was a totally new experience. In 
the earlier years there were tensions between teachers 
who felt it their duty to propagate the language as if it 
were still biblical Hebrew, and writers who preferred 
to model their language on the later Mishnaic model. 
While the sacred, historical roots of Hebrew doubt- 
less will continue to force normative expectations on 
the Israelis’ view of their language, by now it has 
developed an internal dynamic, which will ensure its 
survival as an independent member of the Semitic 
family, selecting elements, whether eclectically or 
arbitrarily, from the many layers of its historical 
origins. The language of writers and intellectuals may 
hold a prestigious position through its archaizing 
norm, but the influx of a large element of Oriental 
Jews into the population has brought about many 
deviations from the ideal standard. 

It is estimated that 83 per cent of Israelis are 
Jews, so that the number of speakers of Hebrew in 
Israel may be put at about 5,000,000, with 88 per cent 
literate (including presumably Arabs). The acquisi- 
tion of Hebrew is obligatory for newcomers. Many 
are bilingual with European languages, particularly 
English, or with Arabic, and the recent arrival of large 
numbers of Ethiopian Falashas has also introduced 
Amharic and Tigrinya to the scene. The Arab element 
in the population — Muslims, Christians, Samaritans 
and Druzes — numbers just over 1,000,000. 

In a number of areas of Hebrew, profound 
changes have taken place. The centrality of the tri- 
consonantal root and the primary semantic values as 
exemplified by consonants have been maintained, 
but distinctions among laryngeals and emphatics 
have largely been given up. Vowel quality is no longer 
phonemically distinctive. Biblical Hebrew continues 
to provide the basic morphological system, though 
with some loss, such as the imperative, which is 
largely replaced now by the appropriate part of the 
future. Similarly the biblical tense system was felt 
inadequate for modern needs and replaced by a new 
trilinear system with a present indicative formed with 
a pronoun and present participle. In syntax the 
possessive particle sel has reduced recourse to pro- 
nominal suffixation on nouns and to the possessive 
construct. A tendency has developed to form nominal 
and adjectival compounds, which otherwise are alien 
to Semitic. And the creation of diminutives has 
spread. The lexicon draws strongly on biblical and 
Mishnaic Hebrew, allowing the innate word-building 
potential of the language to expand it as appropriate. 
There are numerous Arabic and English loans, in the 
latter case including loan translations. 


9.4 Arabic dialects 


It is estimated that over 200 million people speak one 
or another of the many and varied forms of dialect 
Arabic. It is of course very difficult to define the pre- 
cise borders of a dialect and at least in the settled 
parts of the region one might speak rather of sets of 
dialect continua, where groups of more or less related 
forms imperceptibly merge into one another, with 
sets of isoglosses crossing tribal boundaries in such a 
way as to present a large number of possible outlines, 
which do not necessarily correspond to the social 
divisions of the area. The one clear distinction that 


can be made within the Arabic dialect area is between 
the bedouin and the settled dialects. The determining 
factors are operative within the desert regions of the 
Arabian Peninsula, Iraq, Syria, Egypt and the Sudan, 
and the whole of North Africa, irrespective of any 
local classification, and a bedouin dialect can always 
be recognized. The settled and, in particular, the 
urban dialects tend to be innovative and succumb 
readily to external pressures on their phonological 
and grammatical systems. 

The bedouin dialects, however, are very con- 
servative. Universal throughout them is the voiced 
articulation of the phoneme gq as /g/ or /j/, sedentaries 
preferring an unvoiced pronunciation, /q/, /k/ or /'/. 
In the morphology they tend to preserve gender dis- 
tinction in the second and third person plural of the 
pronoun and verb, the use of the dual number, and 
the internal passive (CA qutila, where settled dialects 
have recourse to the derived forms). Because of the 
tribal and territorial identity felt by Arab bedouins 
and their lack of trust in outsiders, their dialects are 
found to be more discrete as far as the speakers are 
concerned, although their nomadic way of life again 
creates difficulties in accurately delimiting them. 
Consequently, in these maps the areas defined are 
often very tentative, except in the case of purely urban 
dialects or those where clearly marked geographical 
factors delimit an area more precisely. 


9.4.1 Eastern Arabic dialects 


The Eastern type of Arabic is marked clearly off 
from the Western by a line corresponding roughly to 
the Egyptian boundary of Libya, with some antici- 
pation of its main features evident in the Western 
Delta and the bedouin dialect of the Western Desert 
of Egypt. The Eastern dialects are generally more con- 
servative compared with the Classical Arabic para- 
digm in their phonology and syllabication. In the 
Western dialects factors relating to word stress, 
probably deriving from the Berber substratum, have 
led to a reduction in the internal vowelling of 
individual words, which appear at times to consist 
almost entirely of consonants. 

The Eastern dialects can be classified into 
North and Central Peninsular Arabic, spreading into 
Southern Iraq and Syria; Hijézi Arabic, which is 
the basis of the Syro-Palestinian group and of 
Egyptian and Sudanese; and Yemeni and South 
Arabian dialects; leaving Oman as a mixed zone. A 
handful of so-called Modern South Arabian lan- 
guages are spoken on the south coast of the Yemen 
and in Dhofar. These do not belong to the basic 
Arabic type but are rather to be grouped with the 
Semitic languages of Ethiopia. The major group- 
ings of dialects tend to be mutually intelligible, at 
least at close range, and the biggest gap in intercom- 
prehensibility is between Eastern and Western 
dialects. 


9.4.1.1 Shammari dialects 


A convenient starting point for investigation of the 
Eastern forms is central and northern Najd, where 
a dialect configuration has developed reflecting the 
history of tribal movements there over the past two 
centuries. Central to this is the role of the dialects of 
the Shammari tribes, who originate from Jabal 
Shammar. Here are the home grounds of a largely 
nomadic tribal federation, whose annual search for 
spring grazing and summer quarters has led them 
north across the Naftid to the southern regions of 
the Tigris, Euphrates and Kartin rivers of southern 
Mesopotamia, where some of them have become 
settled agriculturalists. The Shammar themselves 
preserve several features in their language other tribes 
on the Gulf and elsewhere would characterize as 
bedouin, while towards Mesopotamia the influence 
of their non-Shammari neighbours becomes per- 
ceptible in their speech. The Dhafir in the east of 
their territory employ a mixed dialect with traces of 
their neighbours, the ‘Aniza, and a few Mesopota- 
mian features. 


9.4.1.2 ‘“Anazi dialects 


The ‘Aniza, unlike the Shammar, are not indigenous 
to the area they now inhabit. They originated in 
western Hijaz between Mecca and Syria, but were 
displaced by movements of the Dawasir and moved 
round to the west and east of the Shammar. The 
“Amarat branch now camps in Mesopotamia some 
way north of Najaf, but has a central Najdi dialect, 
while their neighbours to the south, the Shammar, 
retain their northern Najdi form. In taking over the 
central Syrian desert they have split the Shammar 
into two. Both Shammar and “Aniza subgroups meet 
in their summer grounds in Mesopotamia but move 
elsewhere for their spring grazing, the Wild Sulayman 
and Wild ‘Ali, for example, towards the Hijaz, the 
Ruwala to Syria, and the “AmAarat, the Sba’a and the 
Dhafir towards the Euphrates. The close contact 
maintained by the tribes between themselves and 
with the settled agricultural and urban peoples of the 
region occasions many influences from one dialect to 
another, which permit tentative classifications to be 
made. When they live in close contact in a sort of 
continuum there is much dialect overlapping, but 
with the more geographically defined groups such 
as the Shammar of Jabal Shammar this is much less 
the case. We are now confronted with an Arabian 
subgroup of dialects represented in a zone including 
southern Irag, southern Persia, Kuwait, al-Hasa, 
northern Saudi Arabia and the bedouin zone of the 
western borderlands of Iraq, while the riverlands of 
southern Iraq with Khdzistan form a Mesopotamian 
subgroup. Neither subgroup is genetically based, but 
purely territorial. 

Since groupings of dialects tend to centre around 
foci of population in Iraq and the Gulf region, the 
influence of the North and Central Arabian dialects is 
strongly felt in the latter also. Thus the “Anazi ‘Awazim 
and Rashayida represent the oldest layer of the popu- 
lation of Kuwait, which always had strong cultural 
and commercial links with lower Mesopotamia, and 
their dialect has now adapted to that of the settled 
population, while the Baharna of Bahrayn — albeit 
Shi‘i, and having no contacts with interior Arabia — 
nevertheless speak a dialect that is basically Arabian. 
The original tribes of al-Hasa have moved north- 
wards and their dialects have been replaced by those 
of tribes from the borders of the southern Rub 
al-Khali desert, such as the “Ajman and Al Murra. The 
remainder of Iraq and the Syrian desert presents a 
mixed scenario, again reflecting early tribal move- 
ments. The bedouin influence again is dominant, 
even in the capital Baghdad, whose dialect is basically 
of a bedouin type, employed also by the Arab popula- 
tion of Khiizistan. The Shammart Tayy? occupy land 
in the extreme north of Iraq near Mosul, while “Anazi 
tribes are present in desert Syria in the Fid‘an, ‘Amarat 
and Ruwala. In the mountainous north and east 
of the country, Arabic is contiguous with areas of 
Kurdish and Iranian speech, and in Turkey is found in 
isolated pockets surrounded by speakers of Turkish. 
Although good descriptions exist of individual dia- 
lects in northern Iraq, little comparative study has 
been done. Four small groups, the Haddidin, N‘aym, 
Fadl and Mawéli, scattered between Damascus and 
Aleppo in Syria, are remnants of an early phase of 
the Arab movement into Syro-Mesopotamia in the 
Middle Ages. 

The Iraqi dialects form a unity in the form of the 
second person feminine singular and plural of the 
imperfective with tkitbin/tkitbiin after the Classical 
Arabic pattern but against other Arabic dialects which 
tend to drop the -n. The most remarkable division, 
however, is exemplified by the reflexes of the word for 
‘I said’ (CA qultu), where all bedouin groups employ 
a form gilit, attesting to the Najdi tendency to dissolve 
final consonant clusters with an anaptyctic vowel, as 
do the settled populations in the south, whereas the 
settled peoples of the north employ galtu, along with 
virtually all non-Muslim communities (Christians 
and Jews). Thus galtu is attested as a substrate form 
in Baghdad and Basra alongside gilit of the main 
population, while it is the regular form in the settled 
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communities of Mardin and other communities in 
Turkey, and in Tikrit, and strays over the border into 
Syria at Dayr al-Ztr, Palmyra (Tadmur), the settled 
peoples near Aleppo and the Hawran. It is character- 
istic also for Khiizistan and Bukhara in Uzbekistan. 

No accurate figures are available for the speakers 
of the dialects within Iraq, Syria, and Saudi Arabia 
since they are not constrained by international fron- 
tiers. The population of Iraq is estimated to be about 
25,000,000, 97 per cent of them Muslim and 3 per 
cent Christian, though many have Kurdish or Turkish 
as a first language. Syria’s population numbered 
16,482,000 in 2000, with 90 per cent Muslims and 
10 per cent Christians, and Saudi Arabia 21,607,000, 
including many immigrant workers, all Muslims. The 
population of Kuwait is just over 2,000,000, while 
Bahrayn has roughly 700,000, many of whom also 
speak Farsi or Urdu. 


9.4.1.3 Hijazi and Syro-Palestinian dialects 


The Hijaz is a highly distinctive and culturally 
influential region of Arabia consisting of a coastal 
plain and a long chain of mountains butting sharply 
against the desert and steppe lands of central Arabia 
(al-hijdz = ‘the barrier’). As the source of the Islamic 
faith, its dialect is regarded by traditional Muslim 
scholars as that of the Qur'an. Demographically it has 
several urban centres, but is a region of pastoralists 
and agriculturalists, with a number of nomadic 
tribes. Little work has so far been done on the dialects 
of the Hijaz, other than the urban ones of Mecca, 
Jidda and Medina, which display an overall unity 
(and curious affinities with Sudanese Arabic), but 
sufficient textual material does not exist to make any 
general observations possible. The most thoroughly 
investigated region is “Asir and the nearby coastal 
Tihama, where sedentary groups outnumber the 
bedouin, though living in close symbiosis. The 
Tihama displays marked distinguishing features in its 
material culture, which may reflect its proximity to 
Africa. A number of phonological features have been 
noted that set the southern Hijazi dialects off against 
Najdi, notably in the Classical Arabic nominal patterns 
fo‘ala and fa‘alat, which produce in the Hijaz fa‘al and 
faalat / fa‘alan / fa‘alah as opposed to fi‘al and flat / 
filah or filat. The more northerly tribes have been 
listed on the map for reference since little is known of 
them, but it should be noted that the Najdi Harb have 
a Hijazi form of their dialect. 


9.4.1.4 Syro-Palestinian dialects 


The Hijazi type of Arabic, however, provides the 
source for the dialects of the Negev and the Sinai, 
Jordan and the Lebanon (Syro-Palestinian), and 
carries over into Egypt. A number of urban centres — 
such as Jerusalem, Beirut and Damascus — have estab- 
lished strikingly similar prestige dialects, which are 
rapidly influencing the speech of the younger 
generations. The Palestinian troubles have had an 
adverse effect on the task of preserving the old 
dialects untouched, so that, though the situation in 
Israel, Jordan and Lebanon is particularly well 
treated descriptively, a paucity of knowledge of the 
Anti-Lebanon and the Syrian highlands has some- 
what obscured the overall picture. The best classifica- 
tion remains that of Cantineau, who divides the 
region into Palestinian—Jordanian, North Syrian and 
Lebanese—Central Syrian dialect groups. The most 
prestigious is that of urban Jerusalem, population 
493,500 in 1989, which lies in a region of settled 
village dialects. These distinguish themselves by the 
articulation of CA q as /k/ and k as /tf/. The dialects 
of the southern bedouin, on the other hand, along 
with the Hawran, preserve the Hijazi practice of ren- 
dering CA qas /g/, and kin some environments as /c/. 
They also share with North Syrian dialects the pro- 
nunciation biktub/baktub for the third person mas- 
culine and first person singular imperfective of the 
verb, as opposed to central Syria, Damascus and Leb- 
anon with byiktub/biktub. 

To the north of these lie the related dialects of 


Lebanon and Central Syria, which realize the contrast 
noted above by yiktub or byiktub/’iktub or biktub. 
The Lebanese dialects can be classified into four 
groups, all of which show a tendency to diphthongize 
long vowels in final syllables. A Central-North group 
spoken by a homogeneous population of Maronite 
Christians, and a few Sunni Muslims, is characterized 
by giving the 4 phoneme two separate articulations, 
/al and /a/ > /0/. The Central-South group, with a 
mixed population of Maronites and Druze, preserves 
the q phoneme and the interdentals, particularly d. 
The Southern group is spoken by a population with 
about 80 per cent Shits and, apart from a marked 
tendency to imdla (a > e), which recurs in the North 
Syrian dialects, shares a number of phonological and 
morphological peculiarities with the Central—South 
group. Finally the Northern group is again of mixed 
population, with Maronites and Sunni Muslims. 
Unfortunately the picture has been very much com- 
plicated by the recent civil war in the Lebanon and 
it is difficult to know precisely what the present situ- 
ation is, particularly for Beirut and the Shit popula- 
tion generally. An isolated pocket of Arabic speech at 
Kormakiti in Cyprus has been characterized as Leba- 
nese or Syrian in type. Spoken by a very few survivors 
of the Maronite settlement there, the dialect has lost 
the passive voice and most of the internal plural 
forms of the noun, in marked contrast to Syrian 
Arabic, but does maintain imdla. The apparent pres- 
ence of many Greek loanwords in the dialect may be 
the result of code-switching rather than actual 
borrowing. 

Providing a bridge between the bedouin dialects 
of the Negev and southern Jordan and the sedentary 
dialects of the Nile Delta, the Sinai Peninsula was till 
recently one of the least understood regions of the 
Middle East, but recent research has identified five 
dialect groups, all bedouin, for three of which, on the 
Mediterranean and Red Sea coastal regions, we now 
have a reasonable amount of material. The southern 
part of the peninsula remains undescribed but its dia- 
lects are likely to be of similar type to the coastal 
ones. Most of the bedouin tribes of Sinai claim des- 
cent from Hijazi groups who arrived as immigrants 
between the fourteenth and eighteenth centuries, and 
the peninsula now supports a population of about 
50,000 divided among ten tribes. With the arrival of 
television and other modern means of communica- 
tion and the development of tourism, the region is 
now heavily exposed to Egyptian influence and the 
prestige dialect of Cairo is widely used in everyday 
conversation. 

Again no accurate figures can be given for the 
speakers of the Syro-Palestinian dialects. There are 
833,000 Arabs in the State of Israel, one-sixth of 
whom are Christian, as against 5,083,000 in Jordan, 
8 per cent of whom are Christian, and about the same 
number in the Lebanon, 25 per cent adhering to 
eleven sects of Christianity, the remainder belonging 
to five branches of Islam. 


9.4.1.5 Yemeni dialects 


The dialects of Yemen, with a population of 
20,000,000, both Sunni and Shi‘, form a unity distinct 
from those of the Hijaz or central Arabia. This is 
probably to be attributed to geographical factors, the 
mountainous terrain in the north with the impene- 
trable barrier of the great southern desert to the east, 
as much as to historical ones, the Yemen being the 
seat of the ancient Himyari or Sabaean culture up to 
the sixth or seventh centuries and densely populated 
to this day by settled and nomadic tribes with ancient 
recorded histories. The lexicon is replete with terms 
peculiar to the area, and the Yemenis take great pride 
in their linguistic and cultural separateness, which is 
reflected even in their classical literature. Showing a 
close relation to the Yemeni dialects are those of 
South Yemen, a predominantly bedouin region, and 
Hadramawt, which is more settled and is contiguous 
in the east with the Omani-type dialects of Dhofar. 

A provisional classification has been proposed 
for the dialects of north Yemen. The Tihama group 


on the west coast is clearly marked off from the high- 
land and plateau dialects, with such features as a 
glottal realization // for the phoneme, ‘, a form of the 
indeterminate noun with the termination -u (m) and 
a feminine subject suffix -n on the third person singu- 
lar perfective verb, yielding bdakan for bdakat ‘she 
went’. This feminine marker is also found on pro- 
nouns. There is of course a clear geographical barrier 
in the mountains between the Tihama peoples and 
those of the highlands and one of the most remark- 
able features encountered in the south of the high- 
land group near Ta‘izz is the preservation there of 
an ancient k- base for the first and second persons 
singular subject suffixes of the perfective verb (f- in 
all other forms of Arabic), which may be a relic of 
forms found in Old South Arabian, a non-Arabic lan- 
guage related to Ethiopian Semitic. Throughout the 
highlands and plateau dialects there are complex 
phonological distinctions and much lexical variation. 
The northern dialects around Sa‘da mark themselves 
off from other plateau forms in eliminating the 
masculine and feminine distinctions of the verb and 
the forms of the pronoun, bringing them more into 
line with Hijazi to the north. 

Of the dialects of South Yemen only those of 
Aden, a decidedly mixed dialect as one might expect 
of a major port, and of Dathina and Hadramawt have 
been reasonably described, but no general classifica- 
tion has been made. Dathina shows features usually 
associated with Eastern Arabian dialects. 


9.4.1.6 Omani dialects 


The dialects of Oman, population 2,903,165 (1993 
census) — a vast territory consisting of a fertile and 
well-populated coastal plain with bedouin and settled 
inhabitants, a settled mountainous centre similar in 
many respects to the Yemeni highlands, and vast 
deserts to the south, sparsely populated by bedouin — 
display a number of discrete characteristics, including 
affrication of the phoneme q to /g/ or /j/, and k to 
/¢/ or /¢/, Four major groupings have been identi- 
fied, two representing the settled populations of the 
mountain core and two the bedouin of the coast and 
the desert. A heterogeneous bundle of features unites 
all dialects except that of Buraymi in Trucial Oman, 
but a clear dichotomy emerges between settled and 
nomadic. There has been little study of these in depth 
and the older works are concerned with settled forms, 
such as Band Khartis, though Zufar in the south-west, 
which is in a bedouin area, is an exception. 


9.4.1.7 Egyptian Arabic 


Egypt represents the furthest west extension of 
Peninsular-type dialects and forms a bridge to the 
domain of the Western Arabic dialects beginning in 
Libya, The population of 76,000,000 makes it the 
heaviest-populated Arab country, yet the vast major- 
ity, entirely settled, is concentrated in the Delta and a 
narrow strip of land to either side of the Nile as far as 
Aswan. The deserts to the east and west are sparsely 
populated by bedouin, apart from a few isolated oases 
in the Western Desert and a few mountain com- 
munities on the edge of the Red Sea. Very little is 
known of the dialects of either area but the Western 
Desert has been found to contain the first indications 
of the Libyan form, as might be expected from its 
bedouin character and the absence of any geo- 
graphical barrier with Libya itself. 

Until recently, little was known of the many 
varieties of Arabic spoken in Egypt, other than that 
of Cairo (population now above 10,552,000), the 
prestige form not only for educated Egyptians but for 
international communication in many other coun- 
tries as well. Several isoglosses converge to mark off 
the dialects of Cairo and the Delta from those of the 
Nile valley, corresponding to the old political and 
geographical division between Upper and Lower 
Egypt. The eastern division of the Delta is relatively 
thinly populated and the dialects there stand close to 
the bedouin dialects of Palestine. The phonemes j and 
q are distinctive for the Delta in their well-known 
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realizations as /g/ or /j/, and /g/, /’/ or /q/, respectively, 
with /g/ and /*/ characteristic for Cairo, whereas /j/ 
and /q/ appear to predominate in the Nile Valley. But 
it must be stressed that all dialect boundaries are very 
fluid, as one would expect in a settled region. The 
Fayytim — with the neighbouring Band Swéf — forms 
a discrete grouping within the Nile Valley, with yet 
another for the less densely populated Asyait—Minya 
stretch, known to have been infiltrated by bedouin 
from the Maghrib in the Middle Ages. The Asyit 
boundary marks a major north-south cultural divide 
within Egypt, which is reflected in the lexicon. 
The desert oases of Bahariyya, Farafira, Dakhla and 
Kharga are geographically isolated and conservative 
in their dialect features, but their speech is often of a 
hybrid character, indicating a mixed population. In 
general, however, they conform to the Nile Valley 
type. Although about 6 per cent of the population of 
Egypt is Coptic Christian and historical indications of 
this can be detected in the general dialect geography, 
this has no relevance at the present day. 


9.4.1.8 Sudanese dialects 


The Sudan, with a population of 36,500,000 in 2003, 
is perhaps the most problematic region of the Arabic- 
speaking world. Although Arabic is the principal lan- 
guage, it is impossible to estimate the number of 
its users as a first language. In the Sudan as a whole, 
70 per cent of the population are Muslims, with 
about 20 per cent following indigenous religions and 
5 per cent Christians. The two latter groups are con- 
centrated in the south and speak about a hundred 
non-Semitic languages (see Section 9.7, below), but 
the predominant role of Islam suggests that the over- 
whelming majority of the population will be accus- 
tomed to use Arabic, if only as a second language. 
The Sudan is also the least-researched zone of 
Arabic speech in terms of scholarly description. An 
abundance of texts from all regions has been collected 
and furnished with generally inadequate grammatical 
descriptions and vocabularies, and the Sudanese 
themselves are deeply conscious of their traditional 
values to the point of giving every encouragement to 
presenting their oral culture in the press and on radio, 
but the only dialect analysed to the satisfaction of a 
linguist is that of the camel-breeding Shukriyya, who 
claim descent from “Ali ibn Abi Talib, with about 
200,000 speakers between the Atbara and the Blue 
Nile. Historically it is known that many of the Arabic- 
speaking tribes came to the Sudan in the Middle Ages 
from Egypt — for instance, the sedentary Ja‘aliyyin 
(Ga‘aliyyin) on the Nile — or from the Hijaz (as the 
Juhayna, who claim descent from Qahtén and 
Himyar), with some immigration from Libya in the 
nineteenth century. 

The sparse linguistic information makes it 
impossible to offer any classification of the dialects. 
The Ja‘aliyyin dialects, spoken by sedentaries ranged 
along the bend of the Nile and living in close sym- 
biosis with the bedouin to either side, seem to form a 
discrete group and trace their origins to Egypt, as is 
confirmed by their distinctive speech forms. Some of 
the Juhayna (Guhayna) tribes, notably the Shukriyya, 
the Kababish in Kordofan, and the Rufa‘a, entered the 
Sudan from the Red Sea and in that sense will form a 
unity. The prestige dialect is that of Omdurman, 
while the cattle-breeding Baggara nomads of Kordo- 
fan and Darfur have a highly distinctive dialect with 
many non-Semitic features in it. The articulation /g/ 
for the phoneme /j/ prevails in the Sudan along with 
other Egyptian features. On the Red Sea coast of the 
Sudan, between Tokar and the Eritrean border, are 
found a few pockets of pure Peninsular Arabic of 
Hijazi and Yemeni type spoken by the Rashayda and 
Zubaydiyya, but no study has been made of them. 


9.4.1.9 Chadian dialects 


Chad, with a population of about 10,000,000, con- 
tinues the Sudanese variety of Arabic, represented by 
about 750,000 speakers who maintain the tradition 
that they escaped from the Mamliks of Egypt in the 


fourteenth and fifteenth centuries. Arabic stands 
alongside French as the official language. Those living 
in the steppes of Kanem, Batha and Wadday are 
nomadic cattle-breeders claiming Juhayna ancestry, 
while sedentary cultivators occupy the area between 
10 and 13 degrees north, from Wadday to Bornu in 
the west. Some forms of urban Arabic have developed 
and the language of the sultan’s court is emerging as a 
kind of koine. There is much diversity in their forms 
of Arabic but the language is basically homogeneous, 
the differences being largely of a phonological nature, 
and forming little or no hindrance to communica- 
tion. A number of Arabic phonemes have been lost 
(j, , z 5 & q t and d) and have to be realized by other 
means. Among the more remarkable innovations may 
be cited the use of al- as a relative marker, the expan- 
sion of biliteral roots into triliterals by means of an 
added vowel or consonant, and the realization of the 
Classical Arabic emphatic consonants as injectives. 
Pockets of Arabic speech also occur in the border 
regions of North Cameroon, Nigeria and Bornu. The 
influence of local Nilo-Saharan languages on these 
dialects is strong, though the Islamic background of 
the region has ensured a thorough familiarity with 
Classical Arabic and the development of a vigorous 
scholarly tradition in that language. News broadcasts 
on the radio from the capital N’Djamena are 
delivered in a sort of pidgin Arabic. 


9.4.2 Western Arabic family 


The Western Arabic dialects stand in sharp contrast 
to the Eastern group in terms of phonology and 
morphology, and the lexicon contains many items 
peculiar to the Maghrib. They are spoken over 
the northern and Saharan regions of North Africa, 
from Libya to the Atlantic coast of Morocco and 
Mauritania, and are totally incomprehensible to 
Arabic speakers of the east. Their phonology is 
marked by a striking collapse of the system of short 
vowels familiar in Eastern Arabic, to the extent that 
in Jijelli there is only one short vowel phoneme! 
Short vowels in open syllables tend to disappear so 
that their presence in the language is statistically 
much reduced compared with other forms of Arabic, 
particularly in the urban forms of the Moroccan 
dialect group. A further distinguishing feature is the 
presence of an n- prefix before the first persons singu- 
lar and plural of the imperfective verb, and a -a suffix 
on the latter as well. Within the family there are many 
varieties of dialects, but their classification has proved 
much simpler than is the case elsewhere. 

There is a very clear distinction between the 
settled and bedouin forms, which may be attributed 
to the history of arabicization of the region. In the 
early years of the Islamic conquest of North Africa 
only a few centres of Arab power were established, 
such as at Qayrawan and Fez, and subsequently at a 
number of ports associated with them, while the rest 
of the region remained Berber-speaking. Later the 
older cities, such as Constantine, were arabicized also. 
In the eleventh century, massive invasions of the 
bedouin Band Hilal and Bandi Sulaym into the non- 
Arabic-speaking regions led to the spread of Arabic 
from Alexandria down into the Fezzan and along the 
Mediterranean coast to western Morocco, and down 
the Atlantic coast as far as the Senegal River, leaving 
the Berbers in isolated pockets in the more moun- 
tainous regions. The striking cleavage between settled 
and bedouin dialects that resulted from this is still 
very much in evidence today. The settled dialects 
appear in two forms, those of eastern Algeria as far 
as Tripoli in Libya, and a western group covering 
western Algeria, Morocco and Mauritania. The 
reduction in the short-vowel system as one proceeds 
westwards is striking. A distinction can be drawn 
between the urban dialects of the cities and those of 
the villagers of such settled regions as the Sahel of 
Tunisia, the hinterland of Jijelli and Philippeville in 
Algeria (the latter heavily influenced in their phono- 
logical systems by their urban neighbours), and the 
Ghumiara and Sanhaja regions of Morocco. 

There are four main types of bedouin dialects in 


North Africa. Their effect was strong enough to kill 
off the old city dialects of the region. Sulaymi Arabic 
covers the area between Alexandria in the east and 
Mejerda in the west. A very conservative form is 
found in the south of Tunisia, where such typical 
Eastern bedouin Arabic features occur as the preser- 
vation of gender distinction in the second and third 
persons of the plural, and the internal passive of the 
simple verb. The Hilali dialects have two forms. The 
eastern covers the area from central Tunisia into Alge- 
ria as far as the Awrés, while a central form is found 
in the Saharan districts of Algeria, the western part of 
Constantine and the central Tell. It is also spoken by 
camel-breeding nomads of the Sahara and shows 
signs of spreading. Finally, in western Algeria and 
Morocco, the Ma‘qili dialects represent the speech of 
bedouin invaders of the thirteenth century who grad- 
ually spread from eastern Morocco to the south-west, 
capturing Mauritania in the fifteenth century, where 
the Hassaniyya dialect is spoken by around 3,000,000 
people. The population statistics for the other North 
African states accord with their histories of bedouin 
encroachment and their settlement patterns. Libya 
and Tunisia had 5,114,000 and 9,619,000 inhabitants 
respectively in 2000, the latter including a very few 
Jews, while Algeria and Morocco had 31,471,000 and 
28,778,000 respectively, with a large proportion of 
Berber speakers. 

In the islands of Malta and Gozo — with popula- 
tions of 394,583 (2001) and 26,064 (1990), all Roman 
Catholics — a form of Western Arabic, originating in 
Tunisia, but with certain Eastern features such as the 
imdla, has been spoken by the Christian population 
for centuries. With the island freed from Muslim rule 
in the thirteenth century, this has now developed into 
a language in its own right, with several rural and 
urban dialects. Its speakers are bilingual with English. 
It was also recorded for parts of Sicily and for Pantel- 
leria as late as the eighteenth century. There are a 
number of borrowings from the Romance Italian and 
Sicilian languages in Maltese. It is now written in a 
modified Roman script, officially adopted in 1934, but 
the Arabic origin of the language is evident. The 
phonemes * and g have coalesced and in the present- 
day language are apparent only in the pharyngealized 
articulation of adjacent vowels. The merging of the 
phonemes #) and h has necessitated the creation of a 
special letter fi. Maltese has a considerable literature, 
including some fine poetry. The dialect of Gozo dif- 
fers slightly from that of the main island. 


9.5 Modern South Arabian 
family 


These non-Arabic languages are limited to a small 
area around the Dhofar region of Arabia and the 
island of Sogotra in the Indian Ocean. They are rem- 
nants of a language that was probably spread over the 
whole of South Arabia in ancient times and they 
share many distinctive features with the Ethiopian 
Semitic languages. The most prestigious of them is 
Mehri, with a southern dialect spoken in the hinter- 
land of Mukalla, and a northern one spoken by semi- 
nomadic camel-breeders around Negd in Dhofar, 
some bilingual with Shehri. In all there are about 
70,000 Mahra, but many of these speak only Arabic. 
Close to southern Mehri is the Harstsi language, 
which is spoken by about a thousand people. They are 
not related to the Mehri speakers, but have the same 
high social status. About 25,000-cattle breeders of 
varying social standing, some of whom were origin- 
ally serfs, speak Jibbali (or Shehri) in the mountains 
of Dhofar and on the coast, with a distinct dialect 
spoken on the Kuria Muria islands. Bathari is spoken 
by communities of pastoral cave-dwellers and fisher- 
men of low social caste on the coast. They are few in 
number and serfs to the Mahra; their dialect is close 
to Mehri. Sogotri is spoken by about 60,000 people of 
varying social class, including the bedouin and semi- 
nomadic cattle-breeders of the high central plateau, 
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fishing communities on the coast, and the former rul- 
ing class who were of Mahra origin. 

These languages are not mutually comprehen- 
sible and all have highly complex phonological 
systems revealing much fluidity, the result of environ- 
mental conditioning. Notable features of their phon- 
ology are the preservation of an ejective articulation 
for the Semitic emphatic consonants, as in Ethiopian 
Semitic, a reduction of the pharyngeal ‘, and the 
presence of voiceless and voiced laterals, s and z. The 
verbal system basically conforms to the North Semitic 
paradigm, but with no fa- or n- prefix stems. Syl- 
labication of the imperfective verb follows the Ethio- 
pian Semitic pattern (perfective katéb, imperfective 
yakétab, jussive yaktéb, as compared with Ethiopic 
nigiri, yanigar, yangiir), while the subject suffixes 
of the first and second persons of the Perfective 
verb have the Ethiopic k-base as opposed to north 
Semitic t-. 


9.6 The Berber languages 


The Berbers represent the aboriginal population of 
North Africa and are to be found from Siwa Oasis in 
western Egypt all the way west to the Atlantic coast of 
Morocco, and southwards to the Zenaga in the south 
of Mauritania. Their territory has been substantially 
limited by the advance of the Arabs throughout 
history and it is noticeable that their heaviest concen- 
tration is to be found in Morocco, where the great 
confederations of the Zenata, Masmtida and Sanhaja 
inhabit the mountains of the Rif, and the Middle and 
High Atlas, and in Algeria, where they concentrate in 
Kabylia in the north, the Awrés in the south-east and 
the Tell country, with further groups along the border 
between Algeria and Morocco, in the Saharan oases of 
Algeria and Morocco and the great triangular zone 
between Ghadamis, Timbuktoo and Zinder, which is 
the home of the Touareg. There they are reasonably 
protected from the Arabs. On the other hand, in 
Tunisia and Libya they are reduced to isolated pockets 
in mountainous regions and again in the southern 
oases or on the island of Jarba and are far fewer in 
number, sometimes only a few hundred to a com- 
munity. In Tunisia, for example, in a population of 
9,619,000 they account for only 1 per cent, as opposed 
to about 30 per cent in Algeria with 31,471,000 popu- 
lation and 40 — 50 per cent in Morocco, with a 
population of 28,778,000. No figures are available on 
Mauritania. While they tend to maintain contact with 
neighbouring Berber communities, they no longer 
have any sense of a common bond and their peculiar 
distribution is reflected in the fact that it is rarely 
possible to find a homogeneous grouping of dialects 
save where geographically determined, as in Kabylia 
and Awrés, and parts of Morocco. 

Generally, it is no longer fashionable to conceive 
of the Berber language as a bundle of dialects but 
rather as a common language with a basic vocabulary 
split up into a large number, something like 5,000, 
local forms. There are estimated to be over 5 million 
Berber speakers, most of whom are bilingual with 
Arabic, but they are still receding before the advance 
of the prestigious Arabic. Although the Berber dia- 
lects have been well described, there is still no com- 
plete table of correspondences for their phonology on 
a comparative basis that might lead to a classification, 
and they do not in any case appear to follow any 
traditional classification of the populations. There is a 
noticeable division into occlusive and spirant dialects, 
but every phenomenon isolated occupies a district 
peculiar to itself. Mutual comprehension is usual 
between neighbouring communities, but this recedes 
with distance. Communication with neighbours in 
any case tends to be achieved in Arabic. 

A basic system of phonemes common to the 
whole Berber family has been established, but environ- 
mental factors have complicated the picture enor- 
mously, as has the widespread influence of Arabic on 
the language. Basset’s system as modified by Galand 
(1960) is as shown in Table 9.2. Emphaticization is 


Table 9.2 The phonological system of Berber 


Plosive Fricative Nasal Lateral Rolled 

Consonants 

Labials b f m 

Dentals tdd n r 

Sibilants SZZ 

Palatal sibilants §z 

Velars kg 

Uvular Y 
Semi-vowels yw 
Vowels aloliu 


particularly noticeable, with such phonemes as 
emphatic s, r, /and z occurring, while the vocalic triad 
a/i/u undergoes much environmentally conditioned 
gradation. Consonants as well as vowels may be short 
or long, a factor reflected in the lexicon and the 
morphology. Much as in Arabic, morphology is 
effected by means of a consonantal root (which may 
have from one to four radicals, but usually two or 
three) combined with an inflection. The patterning of 
vowels in the root provides the grammatical identity, 
such as noun or verb, number, and so on, while the 
inflection completes the item. The extreme phono- 
logical and morphological complexity of Berber often 
makes the task of identifying the root very difficult. 
Again as in Arabic, verbs may be simple or derived, 
derivation being achieved by prefixes, but the system 
is much simpler than Arabic, providing only causa- 
tive, passive and reciprocal. The verb is basically 
aspectual, and tenses and moods are created by means 
of inflectional suffixes. Since there is no tradition for 
written Berber, syntax conforms to the expectations 
of spoken grammar, employing expressive procedures 
such as topicalizing or emphasizing by means of 
anticipation. Parataxis is the norm and there is little 
subordination. 


9.7 The languages of the Sudan 
(Nilo-Saharan languages) 
David Appleyard 


It has been estimated that about half the population 
of the Sudan speak languages other than Arabic as 
their first language. These non-Arabic languages of 
the Sudan mostly belong to the Nilo-Saharan phylum, 
which is also spread across adjacent African countries 
from the western border lands of Ethiopia through 
Kenya, Tanzania and Uganda in the east, and through 
Chad and the Central African Republic, extending as 
far as Mali and parts of Niger, Upper Volta and Benin 
in the west. The largest number of Nilo-Saharan lan- 
guages, however, is spoken within the Sudan. Aside 
from these, also spoken in the Sudan are a number of 
Niger-Congo languages (such as Zande, Fulbe/Fulani, 
Talodi) and (excluding Arabic) a small number of 
Afro-Asiatic languages: Beja (Cushitic stock), Hausa 
(Chadic stock) and two other Chadic languages, 
Kajakse and Kujargé. Hausa, spoken in parts of 
Darfur, arrived from further west as a language of 
trade. 

The pressure exerted by Arabic upon the other 
languages of the Sudan is immense, particularly in 
the Nile Valley and western Darfur, such that many 
languages have recently become extinct — for instance, 
Birgid (Nubian family) — or are likely to die out in the 
near future. In western Darfur, for example, the com- 
petence of indigenous languages amongst young 
people has decreased during the last few decades and 
as Arabic has spread as a lingua franca and language 
of instruction wherever trade and government ser- 
vices have spread, so there is now an almost universal 
knowledge of Arabic. Most non-Arab populations 
are multilingual, with trilingualism (mother tongue, 
another vernacular and Arabic) being more prevalent 
than simple bilingualism. The extent of this spread of 
Arabic is closely tied up with attitudes towards 


‘sudanization’, that is, the adoption of what has been 
called the Nile Valley lifestyle. Sudanization is the 
principal detribalizing factor and is closely allied 
with islamization and arabicization. In the southern 
Sudan, therefore, where these factors are more actively 
being resisted, the non-Arabic languages are perhaps 
in a better position. 

The total number of languages spoken in the 
Sudan is well over 100 — but the difficulty of dis- 
tinguishing language from dialect makes it difficult to 
be more precise. Of these, Arabic as the official 
national language obviously enjoys the overwhelm- 
ingly dominant position, though in this context it has 
to be kept in mind that there are in fact not one, but 
several, varieties of Arabic current in the Sudan, ran- 
ging from Classical Arabic as the language of Islam, 
through Modern Standard Arabic as the language of 
the state apparatus, to at least three varieties of col- 
loquial Arabic: Sudanese colloquial, Chad colloquial 
and several pidgins (e.g. Juba Arabic). Reflecting the 
different attitude of the southern Sudan, however, 
in 1972 English was acknowledged as the principal 
language of the region, as no one indigenous lan- 
guage holds a particularly dominant position. Up to 
1969 English was also the language of secondary 
school education in the north. 

In ancient times Meroitic, the language of the 
kingdom of Meroé centred on the Nile north of 
the sixth cataract, came to be written in a script of 
its own, deriving from the contemporary, cursive 
Egyptian script. The linguistic affiliation of Meroitic 
is still disputed by scholars and connections with two 
of the major language groups of the modern Sudan 
have been suggested: Nilo-Saharan (and in particular 
East Sudanic) and Afro-Asiatic (Cushitic), the latter 
being the less defensible. Historically, the only other 
non-Arabic language to attain the status of a written 
medium in the Sudan was Old Nubian during the 
heyday of the Christian kingdoms (approximately 
eighth century ap). Old Nubian was written in an 
alphabet of its own, combining the Coptic alphabet 
with certain characters deriving from the old Meroitic 
script. 


9.7.1 Nilo-Saharan 


The Nilo-Saharan phylum probably contains around 
ninety distinct languages and dialect clusters, focused 
on the White Nile from the southern frontier of 
Egypt to Uganda, and the Blue Nile in the area of the 
Sudanese—Ethiopian border, with a westward exten- 
sion across the Sahel as far as Mali. The phylum is a 
complex one and individual stocks and families such 
as Central Sudanic and Nilotic, for example, are in 
turn large and complex so that current work on the 
Nilotic family is still revealing an increasing number 
of languages. Various schemes of classification of 
Nilo-Saharan have been proposed, through not all the 
details of genetic affiliation have yet been worked out; 
one such may be represented by Figure 9.1. A more 
recent (1989) modification of this scheme based on a 
close comparison of pronominal stems suggests a 
finer definition into ‘Peripheral Nilo-Saharan’, where 
the picture of genetic affiliation is less clear, and 
“Core Nilo-Saharan’, where a clearer pattern may be 
observed (Figure 9.2). 

Whilst the details of internal Nilo-Saharan clas- 
sification may still be unsure, it is generally agreed 
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Nilo-Saharan 
Songhay Saharan Maba Fur East Central Berta Kunama Komuz 
Sudanic Sudanic 
Eastern Western Nilotic Kuliak Bongo-Bagirmi 
E. Sudanic E. Sudanic Kresh 
Moru-Logbara-Madi 
Mangbetu-Asua 
Mangbutu-Efe 
West East South Lendu 
Nilotic Nilotic — Nilotic 
Nubian Nyima 
Surma Temein 
Nara Tama 
Eastern Jebel Daju 
Figure 9.1 Nilo-Saharan phylum (typical scheme) 
Nilo-Saharan 
‘Peripheral’ ‘Core’ 
Songhay Saharan Maba Berta Kunama 
+Kuliak = + Fur 
East Sudanic Central Sudanic Komuz Kadugli 
Krongo 
Nubian, Surma, Moru/Madi/Logbara, Bagirmi, Koman Gumuz 
Nara, Jebel, Mangbutu-Efe, Yulu-Binga, 
Nyima, Temein, Mangbetu, Sinyar, 
Tama Daju, Kresh, Bongo 
Nilotic Baledha (Lendu), 
Aja 


Figure 9.2 Nilo-Saharan phylum (1989 scheme) 


that the phylum may be divided into at least ten 
stocks: Songhay, Saharan, Maba, Fur, the East Sudanic 
stock (further divided into two substocks, and thence 
perhaps nine families), the Central Sudanic stock 
(further divided into perhaps six families), Komuz, 
Berta, Kunama and Kadugli-Krongo, with Kuliak 
variously classified as a separate eleventh branch or as 
a member of the larger East Sudanic stock. Of these 
various subgroupings of Nilo-Saharan, only Songhay 
and Kuliak are not represented in the Sudan. 

The Nilo-Saharan languages show a wide range 
of structural types, some languages (e.g. Turkana, 
Kunama) possessing very complex morphologies. 
Work on comparing the grammatical patterns of 
Nilo-Saharan has in effect only just begun and is 
hampered by the disparate amount of data available 
for the various languages. Some languages, such as 
Turkana again and some of the Nubian languages 
(Kenuz-Dongola dialects) have been well described, 
whilst about others (Daju or Tama, for example) we 
know very much less. 

Many Nilo-Saharan languages possess morph- 
ologies of an agglutinative or inflectional nature. 
Turkana (Nilotic stock, Central Nilotic family), for 
example, is a highly agglutinative language, as are 
Fiyadikka-Nobiin (East Sudanic stock, Nunian family) 
and Kunama (single-language stock). Word order in 
Nilo-Saharan varies between SVO (Subject—Verb-— 
Object), as in Temein (Western East Sudanic sub- 
stock), and SOV, as in Meidob (Nubian family). 


9.7.1.1 Saharan 


The Saharan stock is generally divided into three 
branches: Kanuri, Teda-Daza and Berti-Zaghawa, of 


which only the latter is represented in the Sudan. 
Berti is now extinct. 

Zaghawa (own ethnic name beri, language 
name béria) has 108,000 speakers (1983) in western 
Sudan (Wadai-Darfur) plus a further 77,834 in Chad 
(1993 census). The Bideyat are sometimes regarded as 
separate group, but are to all intents and purposes 
identical to the Zaghawa. 


9.7.1.2 Maba (Maban) 


The name Maba is strictly speaking that of one of the 
languages of this stock, hence the name Maban for 
the whole group. As there is, however, a danger that 
this might be confused with Mabaan, a Nilotic lan- 
guage of the Northern subgroup, the name Maba 
will be maintained here for the whole stock. Of the 
three subgroups of Maba — Maba proper (also called 
Mabang), Masalit and Runga(more properly Aiki)- 
Kibet — only one (Masalit) is represented in the 
Sudan. 


Masalit (own ethnic name masald, language 
name kaana masald) has 240,000 speakers in total 
(66,710 in eastern Chad (1993 census) and 173,810 in 
western Sudan). Masalit is now coming under heavy 
pressure from Arabic, particularly around the fringes 
of its area and in the urban and semi-urban com- 
munities. There is one dialect distinct from the 
main language: Surbakhal. Many Masalit are at least 
bilingual in the neighbouring languages, such as 
Daju, Maba and Erenga. 

The Mimi (Mima) language, spoken within the 
Sudan by a small number of nomadic pastoralists 
across Wadai-Darfur and into Kordofan, but princi- 
pally in Chad, has been classified as Maba by some 
linguists, but is now generally regarded as belonging 
to the Fur stock. The confusion seems to have arisen 
through the varied use of the name Mimi referring to 
more than one language. Mimi proper (also called 
Biltine) is closely related to Fur. Of the total number 
of perhaps 50,000 speakers, only 5,000 live within the 
Sudan. 


9.7.1.3 Fur 


The Fur stock consists of two  single-language 
branches. Fur proper and Mimi (see above). Fur (own 
ethnic name furakay) — 500,000 speakers (Bender 
1983) — is spoken widely in Darfur and in scattered 
colonies as far east as Gedaref on the Ethiopian bor- 
der. The homeland of the Fur has been subjected to 
increasing arabicization since the late seventeenth 
century, and neighbouring languages such as Berti 
and Birgid have since disappeared, but Fur itself 
continued to expand westwards. Because of this 
expansion and the associated effects of slavery and 
population resettlement within the region, the Fur- 
speaking population is now very mixed. In urban 
contexts, Fur is now giving way to Arabic. 


9.7.1.4 East Sudanic stock 


The East Sudanic stock of Nilo-Saharan comprises 
the largest number of languages of the entire phylum 
and presents the most complex picture of genetic 
classification. In one major classification, East 
Sudanic is further divided into three substocks: 
Eastern, Western and Nilotic, whereas elsewhere 
Nilotic is seen as one family of a two-substock system. 
The Nilotic substock or family is itself particularly 
complex and involves at least twelve — and perhaps as 
many as twenty — distinct languages. 


9.7.1.4.1 Eastern East Sudanic 
Nubian 


The Nubian family (see Figure 9.3) comprises the 
most northerly Nilo-Saharan languages, spoken along 
the Nile up to the Sudanese—Egyptian frontier. Other 
members of the family are scattered further west 
across Darfur (Meidob and the extinct Birgid) and 
North Kordofan. 

Meidob (own ethnic name tid, language name 
tid-n-aal) has 50,000 speakers (1993). There are two 
dialects, Urrti and Shalkota. 

Birgid is now extinct. 

Kenuz-Dongola (own names 
dongolawi, hence also called Dongolawi) has 180,000 
speakers (1996) in the Sudan plus a further 100,000 


konuuz and 


Proto-Nubian 


Old Nubian 


Central Nubian 


Nobiin Meidob 


Birgid 


Kenuz-Dongola Hill Nubian 


Figure 9.3 Nubian family 
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in Egypt. Kenuzi (also called Kenzi) and Dongolawi 
form distinct, but closely related dialects. 

The Hill Nubian group is a dialect or language 
cluster spoken in the Nuba Hills of Kordofan and 
comprising perhaps four branches: Kadaru (12,360 
speakers in 2000) together with Debri; Dilling (own 
name warki-m-be, with 5,295 speakers in 1984); 
Thaminyi (own name faminyi-ni-be) together with 
Kodhin (own name kodin ni ai); Dauri (also called 
Dair — 1,000 speakers in 1978). 

Nobiin: spoken along the Nile both in the Sudan 
(295,000 speakers in 1996) and in southern Egypt 
(200,000 speakers). Dialects are Mahas and Fiyadikka. 


Surma 


According to a recent analysis, the Surma family 
comprises at least ten branches, spoken both in the 
Sudan and in Ethiopia. The precise subgrouping of 
the family is not as yet clear, though one classification 
that has been proposed sees a fundamental division 
between Majang and the rest of Surma (Southern 
Surma), which is in turn divided into a South- 
western and a South-eastern group (see Figure 9.4). 


Surma 


Southern 


——j 


South-western South-eastern 


A _A 


Majang  Murle,  Zilmamu, = Kwegu, ‘Nuclear 
Didinga Bale Muguji = Surma’ 
Me’en 
Mursi, 
Tirma 


Figure 9.4 Surma family 


Of the Surma languages, only Murlé, Didinga 
and Tirma are spoken in the Sudan. 

Didinga (own name 630 ki @id’ina) has 100,000 
speakers (2000). There are two closely related dialects, 
Chukudum and Lowudo. It has been conjectured that 
Murle and Didinga should be regarded not as separ- 
ate languages, but as dialects of the same language. 

Murle (own name murlén) has 60,000 speakers in 
the Sudan (2000), with a further 6,000 in Ethiopia. 

Tirma (own name surma) has 3,305 speakers in 
the Sudan (2000) and also in Ethiopia, closely related 
to the Suri and Mursi of Ethiopia, totalling between 
10,000 and 15,000. 


Jebel languages 


The Jebel (or Eastern Jebel) family of languages, 
named after the hills (Arabic jebel) in Blue Nile 
Province in Eastern Sudan where they are spoken, 
form one of the least-known branches of East 
Sudanic. Recent research has shown that there are at 
least four Jebel languages and not just one (Tabi) as 
previously thought: 

Tabi (more properly called Gaam and also 
known as Ingessana — own name jog gaam ‘people of 
the hill’, language name kor e gaam ‘language of the 
hil?) had 67,161 speakers in 2000. The three most 
important dialects are Kukur, Soda and Bau. 

Aka (own language name wlu-ka-fa-c-aka ‘lan- 
guage of the people of Aka’): 300 speakers (Bender 
1989). 

Kelo (own language name wndu-faa-keelo ‘lan- 
guage of the people of Kelo’): a couple of hundred 
speakers. 

Molo (own language name twra-ko-molo ‘lan- 
guage of Molo’): a couple of hundred speakers. 


9.7.1.4.2 Western East Sudanic 
Daju: 143,053 speakers (2000). Many of the younger 


Daju of Darfur now speak Arabic. Daju properly 
consists of a group of dialects or very closely related 
languages. These have been classified into two groups: 
Eastern Daju (Shatt and Liguri) and Western Daju 
(Nyala-Lagowa, Sila, Mongo, Nyalgulgule and Beygo). 

Nyima (also called Nyimang): 70,000 speakers 
(1982). 

Tama: 60,000 speakers in the Sudan, consisting of 
44,500 Tamok, 7,500 Erenga, 6,000 Jabaal and 1,500 
Orra, plus a further 63,000 speakers in Chad (1993). 
The sources for the Tama cluster are confusing and 
contradictory. Other names applied to Tama ethnic 
groups are Mararit, Sungor (the same as Erenga?), 
Mileeri (same as Jabaal) and Mun. 

Temein (own name lonot na roye): 10,000 speakers 
(1984). There are three languages subsumed under 
Temein: Ronge (Temein proper), Doni (also called 
Jirru) and Dese. 


9.7.1.5 Nilotic stock 


Languages belonging to the Nilotic stock are spoken 
in Ethiopia, Kenya, Uganda, Tanzania and Zaire, as 
well as in the Sudan. Nilotic is generally further 
divided into three families: North Nilotic (also 
called West Nilotic), Central Nilotic (also called East 
Nilotic) and South Nilotic. Only members of the first 
two families are represented in the Sudan. 


9.7.1.5.1 North Nilotic family 


Anywa (own language name d°a anwa ‘mouth of 
Anywa’; also called Anuak, Anwak, etc.): 52,000 
speakers in the Sudan (1991) plus a further 45,646 in 
Ethiopia (1998). 

Burun: 18,000 speakers (1977). A number of 
dialects are identified: Ragreig, Abuldugu (also 
called Bogon), Maiak, Mufwa (also called Mopo) and 
Mughaja. 

Colo (own language name do Joolo — also called 
Shilluk): 175,000 speakers (1982). 

Jieng (own name jier; also called Dinka): 450,000 
speakers (1982). There are many dialects: Abiem, 
Agwok, Apuk, Awan, Lau, Malwal, Paliet, Palioping 
and Tuic. 

Lwo (own name dé Ito; also called Luo, Luwo, 
Dhe Lwo): 80,000 speakers (1983). Closely related Lwo 
dialects are also spoken in Uganda, Tanzania and 
Kenya. 

Mabaan (also called Jumjum): 25,000-50,000 
speakers in the Sudan plus a further 2,000 in 
Ethiopia. 

Nuer (own language name fog nddd ‘mouth of 
Nuer’, hence also called Naath): 740,000 (1982) 
speakers in the Sudan (including 3,000 Western 
Jikany, 17,600 Eastern Jikany, 12,500 Lou, 1,100 
Nyuong, 2,500 Thiang, 5,900 Bul, 2,400 Jagai, 6,700 
Laak, 4,900 Leik and 1,600 Door) plus 64,907 
speakers (1998) in Ethiopia. 


9.7.1.5.2 Central Nilotic family 


Central (or Eastern) Nilotic comprises about thirty 
languages and dialects spoken in Ethiopia, Uganda, 
Kenya, Tanzania and Zaire, as well as in the Sudan. 
The two largest Central Nilotic languages, Teso and 
Maasai, are spoken outside the Sudan. The Central 
Nilotic family has been classified internally as in 
Figure 9.5. None of the Ongamo-Maa languages and 
only a few of the Teso-Turkana group are spoken in 
the Sudan. 

Bari (own name kituk nd bari): 420,000 speakers 
in the Sudan (2000), together with speakers in 
Uganda and Zaire totalling about 454,000. The Bari 
dialects of the Sudan are Kuku, Nyangbara and 
Pojulu. 

Lotuko: 185,000 speakers (1982). Dialects are 
Dongotono, Oxoryok (Koriok), Logiri (also called 
Lango), Lomya, Lopit, Lowudo and Lokoya. 


Teso-Turkana group or cluster 


Jiye-Toposa: 100,000 speakers in the Sudan (2000), 
plus a further 10,000 in Ethiopia. Jiye and Toposa are 
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Figure 9.5 Central Nilotic family 


different dialects though both are spoken by the 
Sudanese Jiye. Turkana is spoken by at least 250,000 
people, mainly nomadic and moving within north- 
western Kenya, as well as a few over the border into 
Ethiopia and the Sudan. 

Nyangatom (own name dyatoom) is another 
member of the Turkana cluster spoken in the Sudan, 
as well as in Ethiopia. 


9.7.1.6 Central Sudanic stock 


The internal classification of the Central Sudanic 
stock of Nilo-Saharan is as yet unclear; one analysis 
divides it into maybe six families whilst another sug- 
gests ten subgroupings or families. A recent study 
(1989) tentatively identifies a ‘core group’ of lan- 
guages, consisting of Bagirmi (Sara languages), Yulu- 
Binga, Sinyar (Shemya) and Bongo, and a series of 
‘peripheral’ languages, consisting of Moru-Madi, 
Mangbutu-Efe, Mangbetu, Kresh and  Lendu 
(Baledha). Quite a different scheme of the internal 
classification of Central Sudanic, also recently pro- 
posed, is illustrated in Figure 9.6. 
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Figure 9.6 Central Sudanic stock 


9.7.1.6.1 Moru-Madi family 


Moru: 70,000 speakers (1982). Dialects are Agi, Andri, 
B’alib’a, Kadiro, Lakamad’i, Miza and Moruwad’i. 

Madi: 18,000 speakers in the Sudan (1982) plus a 
further 215,000 in Uganda. Dialects are Pandikeri, 
Lokai and B'urulo. 


9.7.1.6.2 Bongo cluster 


Bongo: 10,084 speakers (2000). Closely related is 
Baka with about 25,000 speakers in the Sudan (1993) 
and 1,300 in Zaire. 


9.7.1.6.3 Yulu-Binga cluster 


Yulu: 3,000 speakers (1987); Binga: 2,000 speakers. 
Yulu is also spoken in the Central African Republic. 
Many speakers of Yulu in the Sudan are bilingual in 
Kresh or Arabic. 


9.7.1.6.4 Kresh 


Kresh (own name kpaya, hence variant name Kpara, 
Kpala) forms a single-language subgrouping of Cen- 
tral Sudanic. Spoken by about 16,000 people as a first 
language plus a further 4,000 as a second language. 
Some Kresh is also spoken in the Central African 
Republic Dialects are Naka, Gbaya-Ndogo, Gbaya- 
Ngbongbo, Gbaya-Gboko, Gbaya-Dara, Dongo and 
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Orlo (Woro). Aja, own name gbaya (200 speakers in 
1993), is mostly regarded as a member of the same 
subgrouping as Kresh, or even as a dialect of the 
latter. Others regard Aja as a separate branch of 
Central Sudanic. 


9.7.1.6.5 Sinyar cluster 


Sinyar (also called Shamya, Shemya; own language 
name taar samyan ‘mouth of the Sinyar’): 12,321 
speakers (2000) in Chad and the Sudan. The Sinyar 
are generally trilingual, also speaking Fur and Arabic; 
many also speak Daju or Masalit as well. 

Fongoro: about 1,000 speakers. Most Fongoro 
have assimilated to the surrounding Fur and their 
language is likely to become extinct in the near future. 

Aja (see Kresh, p. 284). 


9.7.1.7 Berta stock 


Berta, which forms a discrete branch (stock) of Nilo- 
Saharan, is still somewhat poorly recorded. There are 
a number of dialects subsumed under the name 
Berta, and possibly more than one language will need 
to be identified here when more data are available. 
There are up to 150,000 speakers in total, 124,799 of 
whom live in Ethiopia (1998 census) and the 
remainder in the Sudan. The Berta (own name bor0a, 
language name ndui bdr61i) are also known in the lit- 
erature by the terms Beni Shangul, Shanqilla, Gamila, 
Watawit, Gebelawin and other names. There are 
several distinct dialects (Undu, Mayu, Fadashi, 
Gebeto), often grouped into two as Sudanese and 
Ethiopian Berta, though the Berta themselves regard 
their language as a single unit. The number of 
speakers was estimated in 1976 as 50,000 in Ethiopia 
alone, with perhaps a further 10,000-20,000 in the 
Sudan. 


9.7.1.8 Komuz stock 


The Komuz stock consists of two branches, Koman 
and Gumuz. The latter is regarded as a single lan- 
guage, while Koman consists of a number of closely 
related languages. 


9.7.1.8.1 Koman family 


Komo (also called Koma; own language name fa 
komo, ethnic name go-kwom): 10,000 speakers in the 
Sudan; perhaps 1,500 speakers in Ethiopia. 

Twampa (own ethnic name k’wani-m-pa ‘people 
of the homeland’, own language name t’wa-m-pa 
‘language of the homeland’; also known as Uduk in 
the literature): approximately 10,000 speakers. 


9.7.1.8.2 Gumuz 


Gumuz (own language name ingofi baxa ‘language 
of people’, the people are also called Beni Shangul, 
though this name more properly applies to one of 
the Berta groups in Ethiopia. Other names appearing 
in the literature are Gumis and Ganza, though again 
the latter is also applied to another people, the 
Omotic-speaking Hozo-Sezo. Gumuz has 120,424 
speakers in Ethiopia (1998 census), plus a further 
30,000-40,000 in the Sudan, making a total of 
perhaps 160,000. Dialects are Disoha, Dakunza, 
Dukuna, Sai, Sese, Dekoka, Dewiya, Kukwaya, 
Gombo and Modea. 


9.7.1.9 Kadugli-Krongo family 


The Kadugli-Krongo languages have formerly been 
regarded as belonging to the Niger-Congo phylum. 
Such a classification would seem to rest primarily on 
lexical comparisons, whilst an examination of the 
grammatical evidence is far less suggestive of a Niger- 
Congo relationship. It is now mostly agreed that 
Kadugli-Krongo forms a separate branch of the Nilo- 
Saharan phylum. Nine languages have been identi- 
fied: Kamdang, Keiga, Kanga, Miri, Kadugli, Katcha 
(Katla), Tumma, Krongo and Tumtum. 


Kadugli (own name foma ma dalla, ka dalla): 
50—-70,000 speakers. Some Daju speakers live amongst 
them. 

Kamdang (also called Tulishi (own name ka- 
twlifi) in the literature): 8,628 speakers (1997). 
Dialects are Tulishi, Turuj and Kamdang. 

Krongo: 21,000 speakers (1984). 

Of the other languages not on the map, Keiga 
(also called Aigang own name aigay) has 6,000 
speakers, Kanga (own name kaya) 6,300 speakers, 
Katcha (own name ka-dolubi) 1,500 speakers, Tumma 
5,200 speakers and Tumtum 7,300 speakers. 


9.8 The languages of Ethiopia, 
Eritrea, Somalia and Jibuti 
David Appleyard 


The languages of Ethiopia and Eritrea belong to 
several different stocks: Semitic, Cushitic, Omotic, 
Nilotic, Komuz and the single-language ‘families’ 
Berta and Kunama. The first three of these all belong 
to the super-family, or phylum, Afro-Asiatic (the 
other members are Chadic, Berber and the extinct 
Ancient Egyptian) and share a significant number of 
inherited structural features. Almost all Omotic — and 
the large majority of Cushitic — languages are spoken 
within Ethiopia; other Cushitic languages spread 
across the Horn of Africa from the Sudan to Jibuti 
and Somalia, and into Kenya, with isolated pockets as 
far south as Tanzania. Whilst Ethiopia also possesses 
the largest number of living Semitic languages with 
perhaps as many as twenty, of which Amharic (the 
official language of Ethiopia) and Tigrinya rank 
second and third, respectively, after Arabic in terms of 
numbers of speakers, the traditional home of Semitic 
is in the Middle East and across the Red Sea in Arabia, 
from where it is generally accepted that Semitic 
speech entered Ethiopia some time during the first 
millennium Bc. 

The remaining language families and the single- 
member ‘families’ Berta, Kunama and Mikeyir all 
belong to the Nilo-Saharan superfamily or phylum 
and are restricted to the western and south-western 
fringes of Ethiopia. The other languages of the Nilo- 
Saharan phylum stretch across Africa over the Sahel 
and down the Nile, from Mali in the west to the 
borders of Ethiopia in the east, from southern Egypt 
in the north to Uganda, Kenya and Tanzania in 
the south. Of the numerous branches of Nilo- 
Saharan, those represented in Eritrea are Kunama 
(a single-language isolated branch) and Nara 
(another single-language branch). In Ethiopia there 
are the Surma languages, Nilotic (Nuer, Anywa, 
Nyangatom-Turkana), Berta (another single-language 
branch) and Komuz (Gumuz, Komo, Shita, Kwama). 
In addition to these, there is one small language in 
Ethiopia that to date has defied classification and 
must at present be regarded as a language isolate: 
Birale. 

The languages of the Ethiopian Literacy Cam- 
paign were, in 1984, Afar, Amharic, Gedeo, Hadiyya, 
Kambaata, Kefa-Mocha, Kunama, Oromo, Saho, 
Sidamo, Silt'i, Somali, Tigre, Tigrinya and Wolaitta, 
and thus included languages from each of the three 
Afro-Asiatic families spoken in Ethiopia, plus one 
Nilo-Saharan language. 

In addition to Somali and its dialects, three 
minority languages are also spoken in the Somali 
Republic: Boni (which is closely related to Somali), 
Swahili and Mushungulu, both of which are Bantu 
languages — Swahili, of course, being widely spoken 
throughout East Africa and as a first language along 
the East African coast from Baraawe in Somalia to 
Mozambique. 

The languages of the Jibuti Republic are Afar 
and Somali (Af-Ciise dialect, belonging to the North- 
ern group), both of which are spoken in the neigh- 
bouring countries, Afar in Ethiopia to the north and 
Somali in Ethiopia and Somalia to the south and 
east. 


9.8.1 Cushitic languages 


The Cushitic languages stretch from north-eastern 
Sudan (Beja), through Eritrea, Ethiopia, Jibuti and 
Somalia to Kenya and isolated pockets in Tanzania 
(South Cushitic). The majority of Cushitic languages 
are spoken by only several thousands or, in some 
instances, only a few hundred people. On the other 
hand, two languages, Oromo and Somali, are spoken 
by several millions. Only Somali has the status of an 
official language, both in Somalia and (together with 
Afar) in the Republic of Jibuti. In Ethiopia, within 
whose frontiers the largest number of Cushitic lan- 
guages is spoken, several Cushitic languages received 
the status of national languages as part of the literacy 
campaign undertaken after the Ethiopian Revolution 
of 1974; these are Afar, Oromo, Sidamo and Somali. 
Today, in Ethiopia, several other Cushitic languages 
have the status of national languages. 

Typologically, most Cushitic languages have 
complex morphologies and (with the sole exception 
of Yaaku) are verb-final languages, typically with SOV 
(Subject-Object—Verb) sentence structure and other 
features of such languages: modifiers preceding heads, 
subordinate clauses preceding the matrix clause, and 
suffixing rather than prefixing as the preferred mor- 
phological device (including postpositions or case 
suffixes rather than prepositions). In common with 
other Afro-Asiatic language families such as Semitic 
or Berber, most Cushitic languages also have complex 
verbal inflections indicating not only categories such 
as person, tense or mood, but also an extensive range 
of derivations (causative, passive, reflexive, etc.). 

It is probably true to say that the majority of 
Cushitic languages also have a simple tone-accent 
system, the best studied instance being that of Somali, 
where tone plays an essentially morphological role, 
distinguishing gender, for example, in such pairs as 
inan ‘boy’ and indn ‘girl’, or case, as between libaax 
(non-subject) and libaax (subject), ‘lion’. 

Another feature typical of many Cushitic lan- 
guages is complex morpho-syntactic devices for 
marking focus within the sentence, involving not only 
morphological marking (e.g. by means of an enclitic) 
on the focalized noun phrase, but often also special 
verb forms when the focalized noun is also subject of 
the sentence. 

Somali has been written in Somalia with the 
Roman alphabet since the introduction of the national 
orthography in 1972. The letters g, x and c, and the 
digraph dh are used to denote sounds not found in 
European languages: IPA /q/, /h/, /S/ and /d/, respect- 
ively. In the Jibuti Republic since 1976, Afar, too, has 
been written with the Roman alphabet, but rather 
confusingly using different letters for those sounds it 
shares with Somali: x for /d/, q for /S/ and c for /h/. 
Both languages indicate long vowels and consonants 
by simple doubling. Prior to the introduction of the 
Somali National Orthography, there were a number 
of attempts to introduce invented alphabets for 
Somali, most notably the ‘Osmaniya’ alphabet 
devised by Cismaan Kenadiid in 1922, which had 
some success including a fortnightly journal. Somali 
has also been written in the Arabic script. Within 
Ethiopia, however, Somali and the other Cushitic lan- 
guages of the literacy campaign were written in 
modified versions of the Ethiopian syllabary. Since 
the 1990s many Cushitic languages of Ethiopia, most 
notably Oromo, have been written in the Roman 
alphabet. 

The Cushitic family is generally divided into four 
main branches, as in Figure 9.7. There is some debate 
about the status of South Cushitic as an independent 
branch; according to some scholars, it may be better 
handled as a subdivision of East Cushitic. One lan- 
guage, for instance, generally classified as South Cush- 
itic, Dahalo, has recently been shown to belong more 
closely to the Dullay-Yaaku branch. 

Figure 9.8 shows the composition of the Central 
Cushitic or Agaw family. Figure 9.9 shows the com- 
position of the large East Cushitic family, with Figures 
9.10 and 9.11 illustrating the details of the Omo 
Tana and Highland East Cushitic (sub-)groups, 
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respectively. Saho-Afar, Omo-Tana and Oromoid are 
collectively often referred to as Lowland East Cushitic. 
It is as yet uncertain how the Dullay and Yaaku 
groups fit in to the overall East Cushitic picture. 

Note on figures for numbers of speakers: Censuses, 
particularly those giving population figures for 
some of the smaller ethnic and linguistic groups, are 
notoriously unreliable in Ethiopia. Even for larger 
groups there is sometimes extreme variation in pub- 
lished figures. It also has to be borne in mind in some 
instances that expressed membership of an ethnic 
group does not necessarily imply that the language 
associated with that group forms the first language of 
all individuals who so identify themselves. It should 
also be remembered that some of the figures given 
below are estimates and not the result of formal 
censuses. 


9.8.1.1 North Cushitic 


Beja (own language name ti Bedaawye) is the sole 
constituent of the Northern branch of Cushitic, with 
about 150,000 speakers in Eritrea (2001); principal 
dialects Bani Saamir, Arteiga, Hadendiwa, Bishari. 
Amongst the Beja in northern Eritrea there is a high 
degree of polylingualism, most especially with Tigre 
(Ethiopian Semitic) and Arabic. The movement of 
Beja into Eritrea from the Sudan has been particularly 
marked over the last three or four generations. 


9.8.1.2 Central Cushitic (Agaw) 


Awngi (own ethnic name awiya, awé own language 
name awni) has about 357,000 speakers (1998 census); 
two dialects, slightly differing between the provinces 
of Agawmodor and Matakkal. Formerly also spoken 
further east in the province of Damot. Awngi, like 
other Ethiopian Agaw languages, is under pressure 
from Amharic. 

Bilin (own name balin/balen) has about 70,000 
speakers in Eritrea (1995). Two dialects are recog- 
nized: Senhit (Targe/Targeq’ar) and Halhal (Taq’ar). 
The Bilin have various traditions of having migrated 
in two distinct groups from further south: one group 
from Lasta in the tenth century ap and the other in 
the thirteenth century aD. It is likely that most, if not 
all, Bilin are at least bilingual, if not multilingual in 
Bilin, Tigre and Tigrinya. Around 75 per cent of the 
Bilin are Muslims and also speak Arabic, particularly 
in the town of Keren. 

Kemant (own ethnic names kamanta; language 
name kamantanidy) is an ethnic group of about 
172,000 people, though few who identify themselves 
as Kemant still speak the Kemant language: only 
about 1,650 are active speakers. It is a dialect cluster 
with two principal varieties C’olga and Karkar; 
varieties probably now extinct included ‘Falasha’ 
(also called Kayla), Dambiya and Qwara. All Kemant 
speakers are probably bilingual with Amharic. 
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Khamtanga (own ethnic name xamat/xamat, 
language name xamtaya) has about 143,000 speakers 
of which about 94,000 are monolingual (1998); it is a 
dialect cluster including what appears in the literature 
as Khamta, Chamir and Lasta-Agaw. Most if not all 
Khamtanga speakers are bilingual, speaking Tigrinya 
(in the north) or Amharic (in the south). 

Kunfal (own language name agéwr) has about 
2,000 speakers (2000). Very little is known about this 
variety of Agaw; it appears to be closely related to, if 
not actually a dialect of, Awngi. 


9.8.1.3 East Cushitic 


9.8.1.3.1 Afar-Saho 


Afar (own name ‘afdér — Qafar in orthography) 
together with Saho, forms a discrete branch of (Low- 
land) East Cushitic, with about 1,140,000 speakers 
(1998 census), 160,000 in Eritrea plus about 980,000 
in Ethiopia; since Saho and Afar properly form a 
dialect chain with varying degrees of mutual intelligi- 
bility between dialects, there is some discrepancy in 
the classification of dialects, but generally at least two 
varieties of Afar are recognized, a Northern and a 
Southern dialect, though this is almost certainly a 
simplification. The Arabic name for the Afar, Danakil, 
is sometimes found in the literature, whilst in Ethiopia 
the term Adal is frequently used. In Jibuti, Afar is 
written in the Roman alphabet, but using different 
letters for §, d and h (q, x and c, respectively) from 
Somali (c, dh and x, respectively). 

Saho, together with Afar, forms a discrete branch 
of (Lowland) East Cushitic; it has 180,000 speakers in 
Eritrea (2001) and about 23,000 in Ethiopia (1998 
census), with dialects Sassaorta, Miniferi, Irob, Hadu, 
forming a dialect chain with Afar. 


9.8.1.3.2 Oromoid 


Konso-Gidole (dialect chain, own names Karatti, 
Diraytd, Moésiya), together with Oromo, forms a dis- 
crete branch of (Lowland) East Cushitic, with about 
150,000 speakers of Konso (afa Karatti), about 50,000 
of Gidole (Dirayta, Dirasta) and 6,500 of Bussa 
(Mosiya) (all 1998 census); the numbers of speakers 
of other dialects are not known. From north to south 
the dialects are Bussa (Mésiya), Gidole (Dirayta, etc.), 
Mashile, Gato, Turo (Duro), and Konso (afa Karatti). 
The extremes of the chain — for example, Gidole 
(Dirayta) and Konso — are probably not mutually 
intelligible. 

Oromo (own name oromoo; language name afaan 
oromoo — or as a tribal-specific name such as afaan 
arsi, afaan boraana, etc.), together with the Konso- 
Gidole (Konsoid) dialect cluster, forms a discrete 
branch of (Lowland) East Cushitic. Estimates of 
numbers of speakers vary widely from 8 million 
(1976) to 30 million (2001), but 20 million is prob- 
ably more realistic; the Kenya Oromo were registered 
as numbering 70,869 in 1969. A realistic figure for the 
total number of speakers for all varieties is probably 
about 20 million. There are several dialects forming a 
chain, with only a limited degree of mutual intelligi- 
bility between the extremes of the chain: (from north 
to south) Raya, Wollo (both of which may belong to 
the Central or Shoan dialect group), Central dialects 
(Shoa-Tulama; possibly also the dialects of Arussi 
belong here), Western dialects (Wellegga-Mac¢’a), 
Eastern dialects (Harar, Qottu) and Southern dialects 
(Boraan, Gabra and Kenyan Oromo-Orma, Waata). 
The earlier (Ethiopian) name for the Oromo, Galla, is 
now considered offensive and was officially replaced 
by Oromo after the 1974 revolution. Oromo, one of 
the national languages of Ethiopia, is there written 
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now in Roman script and is used in radio and televi- 
sion broadcasts. 


9.8.1.3.3 Omo-Tana 


Arbore (own (moiety) names Marlé, Gondorébba) is, 
along with Elmolo and Dasenech, a member of the 
Western Omo-Tana subgroup of (Lowland) East 
Cushitic languages, with about 4,400 speakers (1998 
census). All speakers are bilingual with Boraan- 
Oromo. The name Arbore (var. Ibore) is said to be of 
Boraan origin. 

Bayso is generally classified as a discrete sub- 
branch of Omo-Tana (Lowland East Cushitic); it has 
about 1,000 speakers (1995) on Gidicho Island and 
the western shore of Lake Abaya. Many speakers are 
bilingual with (especially) Wolayta (Omotic). 

Boni (Af-Boon) is a non-Somali Cushitic lan- 
guage on the verge of extinction, as reported by 
Lamberti (1986). The name Boni (Somali Boon, 
Swahili wa-Boni) is also applied to another Omo- 
Tana-speaking population in Kenya whose language 
appears to be different from that described by 
Lamberti as Af-Boon. Indeed, the term would seem to 
have a rather vague application and properly speaking 
is not an ethnonym. The Kenyan Boni language 
is most closely related to Somali and is spoken by 
some 4,000 people. It exhibits at least four dialect 
divisions: Bireer*?, Safarée?, Kidd gur, Kac'?. Most 
speakers are bilingual either with Orma-Oromo or 
Somali. 

Dasenech (own name Déddsanac) is, along with 
Arbore and Elmolo, a member of the Western Omo- 
Tana subgroup of (Lowland) East Cushitic languages, 
with 32,000 speakers (1998 census). It is also spoken 
in adjacent areas of Kenya and the Sudan; also called 
Galab in Ethiopia. 

Somali (own name Soomaali; language af 
Soomaali) is the most important member of the 
Eastern sub-branch of the Omo-Tana group of 
(Lowland) East Cushitic languages; it has about 12.5 
million speakers (2006). There are five or six main 
dialect groups with various subdivisions, and the 
standardized language, which is generally understood 
and forms the official language of the Somali Repub- 
lic, is developed from the most widespread of these; 
namely, the Northern dialect cluster (af-Ciise, also 
spoken in Jibuti, af-Geedabuursi and af-Isaaq), to 
which the Daarood group of dialects (af-Majeerteen, 
af-Warsangeli, af-Dolbohaante) also belong. The 
Northern (Daarood group) dialects are also spoken in 
Ethiopia (af-Ogaadeen, af-Degodiya) and in the south 
of Somalia to the west of the Lower Juba river. The 
other dialect groups are the Benaadir group, the 
Ashraaf group, the May dialects and the Digil dialects. 
Af-Jiddu may belong to this last group, or may be 
classified as a separate branch. The Ashraaf and May 
groups in particular show considerable divergence 
from the Northern and thus the standard dialect. 
Since 1972 the Somali language in Somalia has been 
written in the Roman alphabet and has been the sub- 
ject of an extensive literacy drive. In Ethiopia, Somali 
is also recognized as one of the national languages 
and there used to be written in the Ethiopian 
syllabary. 

The other Omo-Tana languages, Rendille and 
Elmolo, are spoken entirely in Kenya. 


9.8.1.3.4 Highland East Cushitic 


Burji (own name Daase) is the southernmost and 
most divergent of the Highland East Cushitic lan- 
guages; it has some 36,000 speakers (1998 census), 
including about 7,000 who have migrated to northern 
Kenya, especially around the town of Marsabit. 
Within the original area of settlement in the Amarro 
mountains, two dialect areas have been identified. 

Gedeo (own name Gedero), a discrete branch of 
Highland East Cushitic, has 637,000 speakers (1998 
census); it is also known as Derasa (Darasa). 

Hadiyya is the second largest of the Highland 
East Cushitic group of languages, with some 924,000 
speakers (1998 census). There are various dialects 


of Hadiyya: Leemo, Sagago, Soro. Libido, which is 
sometimes regarded as a dialect of Hadiyya, is prob- 
ably best regarded as a closely related, but separate 
language. The Hadiyya have spread in recent years to 
other parts of Ethiopia, most notably Bale and Arussi 
provinces. These groups are reported, however, to 
have shifted to the surrounding Oromo language. 

Kambaata, together with Alaba and Timbaro, 
forms a dialect cluster within Highland East Cushitic, 
with some 606,000 speakers, including 345,797 mono- 
linguals: 278,567 Kambaata, 51,541 Timbaro and 
15,689 Alaba (1998 census). The Timbaro dialect area 
is also known in the literature as Qabena. 

Sidamo (own name Sidaama) is perhaps the 
largest of the Highland East Cushitic group of 
languages, with almost 1.9 million speakers (1998 
census). In the past, the name Sidama (as against 
Sidamo) was used to denote what is now known as 
the Omotic family of languages. Sidaamo is also the 
name of the eponymous ancestor of the Sidamo. 


9.8.1.3.5 Dullay 


This is a dialect cluster forming a separate branch of 
(Lowland) East Cushitic and including such lan- 
guages or dialects as Dobase, Gawwada, Gollango, 
Harso and Tsamako. The total number of Dullay 
speakers is about 41,000 (1998). This group is also 
known in the literature as Werize. 


9.8.1.3.6 Yaaku and Dahalo 


Yaaku (also known as Mogododo) is spoken in Kenya, 
as is Dahalo, which used to be classified as a Southern 
Cushitic language but is now thought to form a 
distinct branch more closely allied to both Dullay and 
Yaaku. 


9.8.1.3.7 Southern Cushitic 


These languages (Iraqw, Ma’a, Alagwa, Gorowa, Asa, 
Burunge and Kw’adza) are all spoken in Tanzania. 


9.8.2 Ethiopian Semitic languages 


The Semitic languages of Ethiopia and Eritrea form a 
distinct group within the Semitic family, the original 
homeland of which, it is generally accepted, included 
the Arabian peninsula and much of the Fertile Cres- 
cent of Western Asia. Semitic speech was introduced 
into Ethiopia from southern Arabia probably some- 
time during the first millennium sc, and indeed 
the closest relatives of Ethiopian Semitic are the lan- 
guages of pre-Islamic southern Arabia, known col- 
lectively as Epigraphic South Arabian, as well as the 
surviving small, non-Arabic languages of Dhofar in 
Yemen and the island of Socotra (Mehri, Harsitisi, 
Sheri/Jibbali and Socotri), with which they share a 
number of distinguishing features, such as glottalized 
consonants (ft, k, s’, etc., corresponding to the 
emphatics of Arabic), and the presence of two prefix 
conjugations, one imperfective and the other modal: 
Ge'ez yaqibbar: yaqbar; Sogotri iqdber: |-iqbér. 

Of the Ethiopian Semitic languages, only 
Amharic (the official language of Ethiopia) and 
Tigrinya have more than a million speakers. Both 
are written in slightly modified expansions of the 
Ethiopic syllabary used for writing Ge'ez (Ethiopic) 
and which was derived from a localized form of the 
South Arabian script during the fourth century ap. 
Within Ethiopia, further modifications of the 
Ethiopic syllabary have also been used for writing 
other languages, Semitic (e.g. Chaha), Cushitic (e.g. 
Oromo) and Omotic (e.g. Wolaitta). 

Ge‘ez probably ceased to be spoken before the 
end of the first millennium ap, but was retained as 
the normal written language until the second half of 
the nineteenth century, when Amharic began to take 
over this role. Ge'ez is still the language of traditional 
learning and is much used in the liturgy of the 
Ethiopian Orthodox Church. The earliest surviving 
examples of written Amharic may date as far back as 
the fourteenth century, and there are also examples 


extant from the seventeenth and eighteenth centuries, 
but not until the nineteenth century did Amharic 
become the regular written language for secular 
affairs. Amharic now has one of the largest vernacular 
written literatures in Sub-Saharan Africa. Again it is 
not until the nineteenth century that any substantial 
evidence for written Tigrinya is found, though there 
are a few earlier indications. Tigre, too, possesses a 
small written literature. The dominant role of 
Amharic in much of the twentieth century, as the 
official language of Ethiopia and the language of 
the ruling group, both under the imperial and the 
Marxist regimes, necessarily eclipsed the develop- 
ment of other written languages in the country, 
though efforts were made under the literacy 
campaigns of the later 1970s and the 1980s to 
expand literacy to other languages than Amharic and 
Tigrinya. Outside the sphere of Highland Christian 
culture, there are examples of Harari written in 
Arabic script dating from the sixteenth century and 
perhaps earlier. 

Typologically, the modern Ethiopic Semitic 
languages have many features in common which 
set them apart from other Semitic languages. For 
example, all modern Ethiopian Semitic languages 
make use of the verb meaning ‘to say’ in verbal 
expressions of the type Amharic zamm ala ‘be quiet’, 
diiss ali ‘be happy’, and so on. Considerable use of 
auxiliaries is also made in the development of the 
tense system in the verb. For the person familiar with 
other Semitic languages, however, the most notable 
differences are all in the field of syntax, where the 
normal sentence pattern is SOV, and qualifiers 
(including qualifying clauses) precede the head. The 
SOV pattern is assumed to have developed under the 
later influence of the indigenous Cushitic languages. 
Indeed, as might be expected if this is so, the syntax of 
Geez, representing an older stage of Ethiopian 
Semitic, shows much less such influence and pre- 
serves older common Semitic patterns. However, a 
few syntactic features remain in the modern lan- 
guages, too, that are not typical of the SOV type, such 
as prepositions. 

The Ethiopian Semitic (or Ethio-Semitic) lan- 
guages are divided into two geographically defined 
groups, North Ethiopic, containing Tigre and 
Tigrinya as well as Ge'ez, and South Ethiopic, incor- 
porating the remaining twenty or so languages and 
dialects. It is now generally agreed, therefore, that 
Ge'ez is not the immediate ancestor of all Ethiopian 
Semitic languages, but represents only one of possibly 
a number of closely related Semitic dialects that must 
have been spoken in Ethiopia around the beginning 
of the Common Era. Figure 9.12 shows the sub- 
divisions of Ethiopian Semitic. 


9.8.2.1 North Ethiopic 


Tigre (own ethnic name tagre), together with Tigrinya 
and the extinct Geez, forms the North Ethiopic 
branch of Ethiopian Semitic; it has about 800,000 
speakers (1997). The main dialects are Mansa‘, Habab 
and Bogos; there is also some bilingualism with 
Cushitic Beja and Bilin. It is with some justification 
that Tigre may be regarded as the most archaic of the 
modern Ethiopian Semitic languages. 

Tigrinya (own ethnic name also tagre; language 
name tagranna or tagray) has about 4.5 million 
speakers (1998 census), mostly in Ethiopia, with 
1,200,000 in Eritrea (2001). In the present century the 
influence of Amharic on Tigrinya has been wide- 
ranging, particularly in the area of vocabulary, 
though it remains to be seen whether the recent and 
increasing autonomy of Tigrinya will reverse this 
situation. 


9.8.2.2 South Ethiopic 


9.8.2.2.1 Transversal South Ethiopic 


Amharic (own name amaraya, earlier amharamnna) 
has 17.4 million speakers and is spoken by many 
more throughout Ethiopia as a second language. 
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Figure 9.12 Ethiopian Semitic 
Notes: 

* East Gurage: Zway, Silt'i, Innaqor, Walane 
+ Northern Gurage: Soddo, Gogot, Muxir 


There are several dialects of Amharic (Gondar- 
Bagemder, Gojjam, Willo, Manz, etc.), none of which 
differs very markedly from any other, but the use of 
Amharic first as the language of the court (ydnagus 
q’angq"a) and now as the official language of Ethiopia 
has promoted a kind of standardization at least in 
the written language, based on the language of Addis 
Ababa. The role of Amharic as the dominant 
language has also meant that, in its written form at 
least, it has adopted many items of Ge'ez vocabulary. 
Indeed, in the expansion of the Amharic lexicon since 
the 1974 revolution Ge'ez has provided a ready source 
for new vocabulary and lexical patterns. 

Argobba has about 11,000 speakers (1998 census). 
A northern variety is still spoken near the town 
of Ankobar, but a southern variety spoken on the 
outskirts of Harar appears now to be extinct. It is very 
close to Amharic, and as a degree of mutual intel- 
ligibility has been claimed it may probably be best 
regarded as a divergent dialect of Amharic. 

Harari (own name gésindn ‘the tongue of the 
city’) has 21,000 speakers (1998 census). Harari is 
spoken only in the town of Harar, is surrounded on 
all sides by two Cushitic languages, Oromo and 
Somali, and — like many other minority languages in 
Ethiopia — is subject to the influence of Amharic. 

Note on the term ‘Gurage’: The name ‘Gurage’ 
has traditionally been applied to a number of peoples 
and their languages to be found clustered together 
along the Rift Valley, south-west of Addis Ababa. Lin- 
guistic research (Hetzron 1972), however, has shown 
that the languages so designated belong to two differ- 
ent groups: East Gurage, which is more closely 
related to Amharic-Argobba and Harari, and North- 
ern Gurage, which — together with Western Gurage 
and the now probably extinct Gafat — forms a distinct 
branch of South Ethiopic. The supposed origin of a 
single ‘Gurage’ ethnos in an ancient Tigrean military 
colony is quite spurious and can only be dated to the 
writings of the traditional Ethiopian scholar Alagqa 
Tayyd. The term “Gunnan-Gurage’ has been coined 
by Hetzron to describe the Northern + Western 
Gurage grouping; that is, Outer South Ethiopic minus 
Gafat. Information contained in the 1998 census gives 
a total of about 1.9 million for the total number of 
speakers of all varieties subsumed under the old term 
“Gurage’. 

East Gurage consists of Innadgor, probably best 
regarded as a variant of Silt’i; Silt’i itself (own name 
salt’e), with 827,764 speakers (1998 census), which 
includes the variant called Ulbarag or Urbarag; 
Walane; and Zway (own ethnic name zay), with 4,880 
speakers (1994), spoken on the islands in Lake Zway. 


9.8.2.2.2 Outer South Ethiopic 


Gafat is probably now extinct and replaced by 
Amharic. The most extensive documentation of Gafat 
comes from a manuscript translation of the Song of 
Solomon made for the eighteenth-century traveller 
James Bruce. Apart from Gafat, Outer South Ethiopic 
consists of two branches, Northern Gurage and 
Western Gurage. 

Northern Gurage includes three languages: 
Gogot, Muxir and Soddo (own name kastaninna), 
also called Aymiillél in the literature. 

Western Gurage consists of the following lan- 
guages: Chaha (own name céxa), the most important 
“Gurage’ language both in terms of number of speakers 
and because of the activities of the Catholic mission 
at Endeber and, more recently, the status of Chaha as 
one of the languages of the literacy campaign, Enar, 
Ezha (own name dza), Gumar (own name gomara), 
Gura, Gyeto (own name gyeta), Indagaf (own name 
andigand), Innamor (own name inor), with 280,000 
speakers, and Masqin, with 25,000 speakers (2002). 


9.8.3 Omotic languages 


It is only just over thirty years ago that linguists 
identified the Omotic family as a separate branch of 


the Afro-Asiatic phylum. Previously the languages 
now classified as Omotic had been grouped within 
the Cushitic family under the heading of “Western 
Cushitic’ or, more confusingly, ‘Sidama’. Others had 
placed what are now known as the Southern Omotic 
languages outside Afro-Asiatic altogether and had 
categorized them as ‘Nilotic’. These problems of clas- 
sification were essentially due to the scant informa- 
tion possessed by scholars on most of the Omotic 
languages, and even today the majority of Omotic 
languages are far from adequately recorded. Another 
factor in the problem of Omotic classification is that 
the better-known languages, such as Wolaitta or Kefa, 
present many similarities — especially in vocabulary — 
with adjacent Cushitic languages such as Highland 
East Cushitic and Konso, in particular. This is doubt- 
less due to a long period of contact and interaction, 
which is also reflected in the similarity of many other 
cultural traits. At the other end of the geographical 
distribution of Omotic, the Mao languages (Hozo- 
Sezo, Mao of Didessa and Mao of Bambeshi) show 
the influence of neighbouring Nilo-Saharan Komuz. 
The details of internal Omotic classification are 
still in the process of being worked out, though the 
generally held picture is as represented in Figure 9.13. 
Aside from the activities of missionaries, none 
of the Omotic languages was written until Wolaitta — 
in terms of numbers of speakers indubitably the 
most important member of the family — was adopted 
as one of the languages of the Ethiopian literacy 
campaign after the 1974 revolution. Wolaitta is now 
written in an adapted form of the Ethiopic syllabary. 
Typologically, there is some degree of difference 
amongst the various branches of Omotic, though 
most have general features that can be categorized as 
typical of the Ethiopian language area and which can 
be found both in Cushitic and Ethiopian Semitic. 
Such features are the presence of glottalized phon- 
emes, a dominant SOV word order, modifiers preced- 
ing the head, focus-marking systems, inflectional 
systems in verbs marking person, tense, and so on, 
and derived stems. The Southern Omotic languages 
stand somewhat apart here and are less typically 
‘Ethiopian’. Their relationship with North Omotic, 
however, can be demonstrated by a small but signifi- 
cant number of shared morphemes. Many Omotic 
languages also have tonal systems, some best des- 
cribed as tonal-accent systems as in a number of 
Cushitic languages, whilst others like Gimira (Bench) 
have what can be more accurately described as a true 
tonal system with many contrasts (up to five in 
Bench): kar! ‘male genitalia’, kar’ ‘mud-wasp’, kar”? 
‘game played with stones’, kar? ‘circle’, kar* ‘broad 
leaf, kar? ‘clear’. 
The scientific study of Omotic is still in its 
infancy and new data continue to be collected. Many 
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Figure 9.13 Omotic 


Notes: 
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2. South Ometo: Zayse-Zergula, Koyra, Gidicho, Kachama 
3. Gonga: Kefa, Mocha, Anfillo, Bworo 

4. Dizoid: Dizi, Sheko, Nao 
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. Mao: Hozo-Sezo, Mao of Bambeshi, Mao of Didessa, Ganza 


. Aroid: Aari, Hamer, Banna, Karo, Dime 
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Omotic languages are poorly recorded, often as no 
more than short word lists. Even the names of the 
various languages and dialects continue to be cor- 
rected and revised, and some confusion still exists in 
this area. The following list can only therefore be a 
summary of the state of the art. 


9.8.3.1 Ometo 


The Ometo group consists of a number of closely 
related dialects and languages forming a continuum 
and constituting the largest division of Omotic, in 
terms both of numbers of languages and numbers of 
speakers. Thus, the North Ometo group probably 
contains over 3 million speakers, over a million of 
them accounted for by Wolaitta alone. Within Ometo, 
three major divisions have been identified: North 
Ometo, which is essentially a huge dialect cluster 
intergrading from Gofa and Malo in the west through 
the varieties of Gamo in the east, and north to 
Wolaitta and its associated dialects; and the Kullo- 
Konta cluster, spoken across the Omo River, also 
belongs here. The second major division of Ometo is 
South Ometo, all spoken along the eastern interface 
between Omotic and Cushitic in the Rift Valley, and 
includes a number of distinct languages, such as 
Zayse, Kachama and Gidicho. The third member of 
the Ometo group is the single language Chara. 


9.8.3.1.1 North Ometo 


This subgroup consists of Basketo (dialects Doko, 
Dolo), which has some 58,000 speakers (1998 census), 
Dorze, with 21,000 speakers (1998 census), Gamo 
(own language name gemo dona) which together with 
Gofa and Dawro forms a dialect cluster with 1.2 mil- 
lion speakers (1998 census), Male (the preferred 
name for Malo — which has some 54,000 speakers 
(1998 census)), Oyda, with perhaps 16,500 speakers 
(1976) and one of the least-known Ethiopian lan- 
guages, and Wolaitta (own name welaitta; also known 
as Wolamo/Welamo in the literature), which has 1.2 
million speakers (1998 census). Kullo and Konta, 
formerly taken to be separate languages, are now 
included as dialects within the Gamo — Gofa — Dawro 
cluster. 


9.8.3.1.2 South Ometo 


This consisits of four languages: Kachama (own 
ethnic name gat'ame; also known as Haruro), with 
about 4,000 speakers (1998 census) on the southern 
island of Lake Abaya; Koyra (own name koore nuuna; 
also known as Koorete, Kore, Kwera, Amarro), with 
some 104,000 speakers (1998 census); Gidicho (500 
speakers? on the northern island of Lake Abaya), 
which is best classified as a dialect of Koyra; and 
Zayse-Zergula (dialect cluster), with 18,000 speakers 
(1998 census). 


9.8.3.1.3 Chara 


The other branch of Ometo, Chara (its own name is 
C'ara), has about 7,000 speakers (1998 census). 


9.8.3.2 Other North Omotic languages 


9.8.3.2.1 Gimira (Bench-She) 


This is a dialect cluster with two principal foci: Bench 
(own language name bend non “Bench mouth’) and 
She (own regional name siiz dod ‘She country’). The 
total number of speakers for the entire cluster has 
been given as about 174,000 (1998 census). The name 
Gimira derives from the Amharic and Kefa name for 
the people. 


9.8.3.2.2 Yemsa 


Yemsa (own name yemsa) is also called Yem, and 
Janjero, Jinjero or Zinjero — from the Amharic name — 
in the literature. The 1998 census showed 165,200 ‘in 
the ethnic group’ and 81,600 speakers of Yemsa, 
which is a single language forming a distinct branch 
of North Omotic. 


9.8.3.2.3  Gonga (Kefoid) 


The Gonga languages (also sometimes called Kefoid 
after Kefa, the largest language of the group) con- 
stitute a cluster of fairly closely related languages and 
dialects. One, Anfillo — also called (Southern) Mao, 
not to be confused with the Mao that forms a 
separate branch of North Omotic — was thought to 
have perhaps 500 speakers (1990). 

Bworo (own name bor6; also called Shinasha) 
is properly a group of closely related but scattered 
dialects, of which Bworo (bor6) is the self-designation 
of only one. Other members of the group, which has 
also been called North Gonga, are Guba, Naga and 
Amuru, none of which varieties is more than fleet- 
ingly documented. Confusion also arises in the earlier 
literature from the fact that the term ‘Shinasha’ seems 
to have been applied to other groups, some of whom 
are not even Omotic-speaking. The total number of 
speakers was 19,878 in the 1998 census. 

Kefa (own name kafaa), also called Kafa, together 
with Mocha (moéa), forms a dialect cluster with 
around 570,000 speakers, of which 445,000 are mono- 
lingual (1998 census). Kefa itself consists of a number 
of slightly varying dialects or ‘argots’ associated with 
the various caste groups of traditional Kefa society, 
which are distinguished principally by vocabulary, 
but with some minor differences in phonetic realiz- 
ation and in morphology. As well as Mocha, another 
dialect, now recently extinct, close to Kefa was Garo 
(also called Bosha). 


9.8.3.2.4 Dizoid 


The three languages in this group are Dizi (own 
ethnic name dizi, language name dizi nuu), also 
referred to as Maji after the name of the largest town 
in the region, with 21,000 speakers (1998 census); 
Nao (own ethnic name naro), now also called Nayi, 
and its 3,656 speakers (1998 census); and Sheko 
(other names are Dorsha, Dorsa, Bulla), with about 
24,000 speakers, 13,600 of them monolingual (1998 
census). 


9.8.3.2.5 Mao 


Mao is perhaps the least known of the Omotic 
groups, and such work as has been done has until 
recently been beset by confusion of terms, as the 
name ‘Mao’ is also applied to other remnant peoples 
of western and south-western Ethiopia. The classifica- 
tion of Mao as Omotic or not has also been a concern 
of recent years, the evidence now pointing to the 
present classification. The name Mao is taken from 
the word for ‘man, person’ (Hozo moo, Sezo maay, 
etc.) The languages grouped under the heading 
Mao are Hozo (own language name hooz wandi 
“‘Hozo mouth’) and Seze (own name seezi wani, ‘Sezo 
mouth’), each with about 3,000 speakers (1995); 
Bambassi (own language name mao kole) — 5,000 
speakers (1992) — one dialect of which, Didessa, is 
sometimes listed as a separate language; and Ganza, 
one of the least-known Ethiopian languages, which 
has 5,400 speakers (2004). 


9.8.3.3, South Omotic (Aroid) 


Aari (own name fraari; also known as Ari, Are, as 
well as by various tribal names such as Galila, 
Bako, Ubamer, etc.) has about 59,000 speakers (1998 
census). There is some dialectal differentiation across 
the region covered by Aari. 

Banna (own name bana apo ‘Banna mouth’) and 
Hamer (own name hamar apo “Hamer mouth’) are 
now considered as dialects of a single language, with 
about 43,000 speakers (1998). Karo, perhaps best 
regarded as a dialect of Hamer, has 200 speakers 
(1998 census). Dime (own name dim rap ‘Dime 
mouth’) had 6,500 speakers in 1998. 


9.8.4 Nilo-Saharan languages 


For a full discussion on the classification of Nilo- 
Saharan, see Section 9.7. 


The Nilo-Saharan languages spoken within 
Ethiopia and Eritrea belong to at least four distinct 
families of Nilo-Saharan and a total of seven distinct 
groups within those families, together with one lan- 
guage, Mikeyir, whose adherence to the Nilo-Saharan 
phylum is not in doubt, but whose precise affiliation 
is as yet unclear. As with many of the minority lan- 
guages of Ethiopia, the available data here, too, are 
often rather scant, sometimes consisting of no more 
than a list of words. The classifications made by 
scholars from such data must therefore be of necessity 
regarded as provisional and open to re-assessment 
when more data become available. 

The Nilo-Saharan families represented in 
Ethiopia and Eritrea are (1) Nilotic — divided into 
four subfamilies, Northern Nilotic, Central Nilotic, 
Surma and Nara; (2) Komuz; (3) Berta; and (4) 
Kunama. Of these Nara, Berta and Kunama consist of 
one language each. 


9.8.4.1 Nilotic 


9.8.4.1.1 Northern Nilotic 


Also called Western Nilotic in the literature, this 
includes Anywa, Mabaan and Nuer. Anywa (own lan- 
guage name d’a anwa, ‘mouth of Anywa’, also called 
Anuak, Anwak, etc.) has about 45,600 speakers in 
Ethiopia plus a further 14,700 in the Sudan. Shilluk 
(do fools ‘mouth of Shilluk’) is probably mutually 
intelligible with Anywa. Nuer (own language name 
tog nddd ‘mouth of Nuer’) has 65,000 speakers in 
Ethiopia (1998 census). 


9.8.4.1.2 Central Nilotic 


This is also called Eastern Nilotic in the literature. 

Turkana (own name niturkwana, and in Ethiopia 
panatoom, hence the variant name Nyangatom also 
found in the literature — the Ethiopian Nyangatom 
are an offshoot of the Toposa tribe) belongs to 
Central Nilotic. The majority of the Turkana, who are 
nomadic pastoralists, are to be found in Kenya. 
Turkana is one of the better-known Nilotic languages, 
thanks to a number of recent studies. The total num- 
ber of Turkana speakers has been estimated as 
250,000, of which 14,000 were recorded in Ethiopia 
in the 1998 census. Turkana is a member of the 
Teso-Turkana cluster of Central Nilotic languages, 
the other members of which are spread across the 
southern Sudan, Kenya, north-eastern Uganda and 
northern Tanzania. 


9.8.4.1.3 Surma 


Surma includes several little-understood languages. 
Majang (own language name ato majay or ato 
majayeropk ‘mouth of Majang’, also called 
Masongo) has some 15,000 speakers (1998 census). 
The little-known language Mikeyir (more properly 
Shabo) used to be classified with Majang, but is 
now thought to be a mixed language with both 
Majang and Omotic input. Me’en (own language 
name mersnen, ethnic name meren, as well as surma; 
also called Megan, Mekan or Bodi, the latter being 
the name of one of the Me’en tribes) has some 
56,000 speakers (1998 census). Dialects include 
Bodi. Murle (own name murlén; also called Olam — 
not to be confused with Zilmamu-Olam) has 200 
speakers in Ethiopia (1975), but 60,000 in the 
Sudan (1982). 

Mursi (own name mun), with about 3,000 
speakers (1998 census), is part of the Suri cluster 
(including Mursi, Tirma and Chai), with about 
27,000 speakers in Ethiopia (1998 census) and several 
thousand more in the Sudan. Tirma (own name 
surma — hence the name of the whole group); the 
name Tirma is properly a place name, from which 
derives the Mursi name tirmaga. 

Zilmamu-Olam (the Zilmamus’ own name is 
maji ‘person’; also known as Bale, Tsilmamo, Mashi; 
the Olam call themselves alam) is better-known now 
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as Kacipo-Balesi, of which Zilmamu is one dialect. 
With about 4,000 speakers (1998 census), it is another 
of the least-known languages of Ethiopia. 


9.8.4.1.4 Nara 


Nara is classified as a separate branch of the Nilotic 
family (own ethnic name nara, language name nara 
bana, ‘Nara talk’, the same as Nera; also called Barea 
or Barya from the Amharic term for ‘slave’) and has 
about 80,000 speakers (2001). The main dialects are 
Higir and Mogareb, with two smaller sections, Koyta 
and Santora. 


9.8.4.2 Komuz 


The Komuz family consists of two branches, 
Gumuz and Koman (of which one member, Komo, 
is spoken in Ethiopia). Gumuz is classed as a dia- 
lect cluster, while Koman consists perhaps of six 
branches. 

Gumuz (own language name ingofi baxa, ‘lan- 
guage of people’) is spoken by the people of that 
name, also called Beni Shangul, though this name 
more properly applies to one of the Berta groups. 
Other names appearing in the literature are 
Gumis and Ganza, though again the latter is also 
applied to another people, the Omotic-speaking 
Hozo-Sezo. There are just over 120,000 speakers in 
Ethiopia (1998 census), plus a further 40,000 in the 
Sudan. 

Komo (own language name fa komo, ethnic 
name go-kwom) has perhaps 1,500 speakers in Ethio- 
pia with a further 10,000 in the Sudan. 


9.8.4.3 Berta 


This forms a discrete branch of the Nilo-Saharan 
phylum. The Berta (own name bara, language name 
ndu baru) are also known in the literature by the 
terms Beni Shangul, Shangilla, Gamila, Watawit, 
Gebelawin and others. There are several distinct dia- 
lects (Undu, Mayu, Fadashi, Gebeto), often grouped 
into two as Sudanese and Ethiopian Berta, though the 
Berta themselves regard their language as a single 
unit. The number of speakers is 125,000 in Ethiopia, 
with a further 22,000 in the Sudan. 


9.8.4.4 Kunama 


This also forms a discrete branch of the Nilo-Saharan 
phylum. The Kunama are known by various names 
(Baza or Bazen in Arabic, and Barya in Amharic — as 
are the Nara), their self-name being kunaama. There 
are numerous dialects, including Marda, Barka 
Aymasa, Tika and Ilit. The Marda dialect has attained 
a certain degree of prestige as it is the dialect of the 
high chiefs of the Kunama and was also used by the 
Swedish Mission in its Bible translations and other 
publications. Since 1984, Kunama has also been one 
of the languages of literacy in Ethiopia, the only Nilo- 
Saharan language to achieve this status; now, along 
with Nara, it is a national language of Eritrea. Some 
groups (especially the Bitaama) also speak Tigre. In 
2001 the official population figure for the Kunama 
was 107,000. 


9.8.5 Bantu languages 


Swahili and Mushungulu are both Bantu languages 
(see Section 10.2.1). Mushungulu (also called 
Kimushungulu) had between 20,000 and 50,000 
speakers in 1992. Part of the Niger-Congo phylum, 
Bantu family, North-East Coast branch, it is distinct 
from Swahili and has close relatives in Tanzania (e.g., 
Shambala/Shambaa). Most speakers are probably 
bilingual with Somali (May variety or dialect). 


9.8.6 Birale (unclassified) 


The existence of this language, the classification of 
which is still under discussion, was only confirmed in 
the 1980s. First reported in 1945, it was not until 1981 
that the first few pieces of data were collected and 
subsequently published in 1983. More recently (since 
2000), an extensive word list and a small number of 
studies have been published. These would seem to 
suggest that Birale (own name jongotd), previously 
unclassified, may perhaps be an Afro-Asiatic lan- 
guage. However, classification is made difficult by the 
fact that Birale lexicon contains elements of borrow- 
ings from all adjacent languages, especially from East 
Cushitic Tsamako (Dullay group), amongst whom the 
Birale live and by whom they are now rapidly being 
absorbed. Its morphology, however, appears to be 
quite different. The numbers of speakers of Birale was 
recorded in 2001 as 19. 
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Map 72 SUDAN and the HORN of AFRICA 
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10.1 Introduction 


Sub-Saharan Africa is the world’s most complex 
linguistic area. The complexity does not seem to be 
different in kind from that occurring in other areas of 
the world, but is primarily a consequence of the size 
of the area over which great linguistic and social 
diversity extends. There are few, if any, phenomena 
contributing to linguistic complexity elsewhere in the 
world that are not also exhibited somewhere in 
sub-Saharan Africa. These include large areas of con- 
tinuous diversity among relatively similar languages 
(language continua and dialect chains), areas in 
which very different languages occur in close proxim- 
ity to each other, a high degree of multilingualism in 
many areas, the continuing spread of some languages 
at the expense of others (lingua francas and language 
shift), and populations with very different lifestyles, 
for example, both sedentary (farming, merchant) and 
migratory (hunter-gatherer, herder) populations. 

As a result, there are numerous problems that 
should be borne in mind when using the information 
presented in this part of the atlas. Foremost is that, 
though there is continual progress in research on the 
language situation in Sub-Saharan Africa, resources 
for research have not yet been equal to the enormous 
task of uncovering all the languages of sub-Saharan 
Africa, not to mention describing and classifying 
them, or of keeping track of the size of the popula- 
tions speaking many of those languages that are 
already known. In addition, given the problems of 
managing the great amount of information already 
available, some errors are inevitable. Naturally, care 
has been taken to minimize these errors. 

Some discussion is reserved for the areas 
represented on individual maps. However, general 
problems are sufficiently widespread to warrant 
introductory discussion. They include: 


1 accurate location of languages; 

2 recognizing the same language with different 
names; 

3 recognizing different languages with the same 
name; 

4 multilingualism and language shift; 

5 language and dialect; 

6 classification of languages. 


These problems — discussed in Sections 10.1.1 to 
10.1.6 — interact with each other, so that discussion 
of one problem often requires cross-reference to 
discussion of another problem. 


10.1.1 Accurate location of languages 


The problem of accurately locating languages is pri- 
marily a boundary problem. Geographical boundaries 
are mapping conventions used to indicate the limits 
of a majority or concentration of speakers of a 
particular language. It should be understood that 
languages usually extend beyond the boundaries on a 
given map, so that overlap is usual around the 
boundaries of adjacent sedentary groups, even at the 
village level. Overlap is also common among non- 
adjacent groups, especially in relatively urbanized 
areas. For example, relatively large groups of Hausa 
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speakers are located in some cities of Chad and even 
Equatorial Guinea, beyond the areas in which they are 
represented on the maps. An unusual case of extreme 
extension is the Ful(ani)-speaking population. This 
group, originating in Senegal, has spread over several 
centuries across northern West Africa into the Sudan, 
and forms sizeable minorities, even a majority, in 
some areas far from Senegal, particularly in northern 
Nigeria and Cameroon. 

With seasonally or generally migratory groups, 
map boundaries generally indicate the limits of their 
migratory patterns. This applies to many of the 
remaining Khoisan populations of southern Africa 
(Namibia, Bostwana and Angola), to some of the 
Saharan populations, for example the Teda and Daza, 
and, to some extent, to some of the Nilotic popula- 
tions of Kenya, Sudan and Ethiopia. In the case of 
some populations, such as the Maasai of Kenya, 
reserves have been established in which they can 
maintain their traditional herding lifestyle. Of course, 
Maasai may migrate to other areas, but they are then 
constrained to assimilate to more sedentary lifestyles. 
In sum, the relative sizes of the areas represented for 
any two languages cannot be taken as an indication of 
the relative sizes of the populations speaking the two 
languages. 

Apart from the boundary problem, for some 
smaller languages there is simply a lack of informa- 
tion about the precise area in which their speakers are 
concentrated. Finally, owing to problems of access — 
some mountainous or remote regions are without 
roads — there may be languages still unknown to the 
outer world even by reputation. Given the length of 
time in which languages have been spoken in sub- 
Saharan Africa, which is probably at least as long as 
they have been spoken elsewhere in the world, it is 
possible that languages quite different from nearby 
languages remain to be uncovered. 


10.1.2 Recognizing the same language 
with different names 


The use of heteronyms, different names for the same 
language, is a potential source of confusion for 
readers trying to use this (or any other) atlas to locate 
a particular language. There are various reasons for 
the heteronymy. In the simplest cases, the difference is 
only in variant spellings of the same name, based on 
different spelling conventions used by researchers of 
different language backgrounds themselves. For 
example, while English and French researchers have 
come to agree on Swahili as the designation for a 
particular East African language, a German tradition 
usually gives the spelling as Suaheli. On the other 
hand, French researchers usually use the spelling ou 
to represent the sound represented by u in English, so 
that the Ivory Coast language Tura (in this volume) is 
spelled Toura in French. Some English researchers 
maintain the French spelling. Another example of 
English/French orthographic variants is Anyi/Agni for 
a major Kwa language of Ivory Coast. 

A similar type of labelling variation is that 
different forms of the same name sometimes derive 
from dialectal differences within the language, or 
between the language and closely related languages. 
For example, the variants Swati and Swazi to desig- 


nate the same South African language derive from 
the pronunciation of the name by Swati and Zulu 
speakers respectively, where ti in Swati generally 
corresponds to zi in Zulu. A mixture of dialect differ- 
ences and different spelling conventions results in the 
variants Guere and Wee. The Gu is a French conven- 
tion for W, while the presence or absence of r 
indicates a dialect difference within this Ivory Coast 
language. This language is usually spelled in English 
according to the French convention. 

Among the sources of variant spellings is the way 
that the name of the language sounded to different 
researchers working independently of each other, 
sometimes further compounded by dialect differences 
within the language. The letters i and e, on the one 
hand, and u and 9, on the other, are rather commonly 
interchanged for some languages. The Ugandan 
language Nyuli or Nyole exemplifies the variation. 
Similarly, | and r are interchanged in many African 
languages (though by no means all). The East African 
Bantu language Luguru is sometimes spelled Ruguru 
or Lugulu for this reason. Luguru is the Swahili name 
for this language, where Swahili does distinguish [] 
and [r]. Of a similar source are the variants Kalanga 
and Karanga for certain varieties of Shona in south- 
east Africa. However, the reader will find that in this 
case Karanga has been reserved for one set of varieties 
and Kalanga for a distinct set, so that the variant 
labels no longer apply to the same speech varieties. 
Yet another case, commonly encountered in West 
Africa, is variation between spelling with a labiovelar 
and a simple labial, for example the Chadic language 
Gbwata or Bata, and the Gur language Kpalaga or 
Palaka. Awareness of such spelling variation may help 
the reader match a language referred to in another 
source with a language represented in this section, in 
case a variant spelling is not listed here. The problem 
is that additional information is necessary to distin- 
guish different spellings of one name from a very 
different language with a similar name, such as the 
Ubangi language Sango and the several Bantu lan- 
guages called Sangu. 

Somewhat more difficult are cases in which vari- 
ant morphologies are used to indicate the same lan- 
guage. The East Bantu language Chikunda or Kunda 
exemplifies. The root is Kunda. Chi is a prefix used in 
the language to indicate the language of this Kunda 
community. The use of the label Chikunda has the 
virtue of distinguishing this language from various 
other Bantu languages also called Kunda. However, 
one cannot count on the exclusion of Kunda in other 
literature to designate this language. In some cases 
variant morphologies further obscure the language 
name, for example the Central Togo language Likpe 
or Sekpela. In this case the root is kpe. The Li is a 
prefix similar in function to the Chi in Chikunda. 
In Sekpela, there is both a prefix Se and a suffix la 
serving a similar function. Clearly in such cases the 
reader cannot recognize that these labels refer to 
the same language without knowing more about the 
languages from which the labels are derived. 

Finally, there are a great many cases in which dif- 
ferent labels for the same language are genuinely 
different. One of the major reasons is that names 
of different dialects of a language have been extended 
by different people to the language as a whole. This 
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is discussed further in the section on problems of 
distinguishing ‘language’ and ‘dialect’ in the sub- 
Saharan African context. Apart from this, a major 
source of heteronymy is the adoption of labels for a 
people and their language from a neighbouring 
group. In this case, what usually happened was that a 
researcher came to know about a language from 
speakers of other languages first, and adopted the 
label they used. The name became embedded in the 
literature on that language, but is not the autonym, 
the name used by speakers of the designated lan- 
guage. Unfortunately, in many cases the label is 
derogatory, reflecting a conventional attitude of 
speakers of the source language to speakers of the 
labelled language. For example, the label Nyika was 
once used by the urban Swahili to designate the rural 
Miji Kenda and their languages. The label refers to 
‘bush’, implies lack of ‘civilization’ and is resented by 
most Miji Kenda. As such cases come to light, pejora- 
tive labels are avoided in the further literature, but 
still remain in the older literature as a source of con- 
fusion to the unaware, and irritation to those who are 
aware. Variant names of African languages are by no 
means purged of all such cases. In some cases I have 
had to mention a derogatory label for a language, 
because a more appropriate label has not yet become 
established in the literature. I apologize to the speakers 
of the language in such cases. See appendix on p. 333 
for discussion of the term ‘Bantu’ in South Africa. 


10.1.3 Recognizing different languages 
with the same name 


This problem is more common among sub-Saharan 
African languages than might be imagined. In some 
cases the similarity is purely coincidental. For example, 
there are at least three totally distinct and widely separ- 
ated sub-Saharan languages called Lele: one is a variety 
of the Zairean Bantu language Kuba, another an East 
Chadic language, and yet a third a variety of the Mande 
language Kuranko. (The 1978 Ethnologue overlooked 
Lele as a dialect of Kuba, but listed yet a fourth Lele, a 
variant of the Gur language Lyele, so this looks like a 
case of different names for the same language.) 
Naturally, the more syllables the name has, the less 
likely a chance similarity. The problem persists, how- 
ever, for social reasons. Especially where different lan- 
guages have been in contact, speakers of one language 
may adopt the ethnic label of speakers of another 
language without also adopting their language. This 
seems to be particularly common in many areas of 
Bantu-speaking Africa. Thus, there is an Mbukushu 
dialect of Mashi in Zambia, geographically close to 
the Mbukushu variety of Kwangari, a language cluster 
principally spoken in adjacent Angola. 

Particularly complicated is the use of the label 
Malinke among various Manding-speaking groups of 
Mali and adjacent coastal West African countries. 
Manding itself is a set of very similar North-West 
Mande languages, including Mandinka (Gambia), 
Maninka (Guinea) and Dyula (Ivory Coast). Some 
communities speaking each of these languages desig- 
nate themselves and their language as Malinke. Each 
such variety has more in common with the other 
Manding languages in its area than with the other 
varieties of ‘“Malinke’. A similar situation exists 
among the Ngoni in different areas of East Africa. 
The label Ngoni descends from Neguni, adopted by 
linguists to designate a group of South-East Coastal 
Bantu languages including Zulu. Ngoni ethnicity 
arose as a result of northern conquests by groups led 
by successors to the Zulu commander Shaka. How- 
ever, Tanzanian Ngoni is so similar to the adjacent 
southern Tanzanian Bantu languages that it is classi- 
fied as belonging to the same group as Yao and 
Matengo. On the other hand, southern Malawian and 
adjacent Zambian Ngoni is more similar to the Sena- 
Senga group of Bantu languages spoken in that area. 
A few surviving words of distinctly Nguni (Zulu) 
origin across groups indicate a common source for 
their shared ethnicity, but the shared ethnicity does 
not correspond to a distinctively unitary language. 

In sum, there is considerable difference between 


ethnicity and language in many parts of Africa, as will 
be further evident in the discussion of multilingual- 
ism and language shift. Although ethnicity is of at 
least as great importance to the fabric of sub-Saharan 
African society as it is elsewhere, there is a consider- 
able amount of fluidity to ethnic self-designation in 
many areas. Fusions of smaller segments of distinct 
ethnic groups have long led to the formation of new 
ethnic groups and/or subgroups. The linguistic con- 
sequences of these fusions vary from case to case. 
Classification (along with variant labels for the same 
language) is particularly important for recognizing 
different languages with the same name. 


10.1.4 Multilingualism and language shift 


The high degree of linguistic fragmentation of most 
areas of sub-Saharan Africa has led to the widespread 
adoption of second languages for cross-group com- 
munication. Naturally, speakers of smaller, more 
exclusive languages most often adopt second and 
further languages, while maintaining their first lan- 
guage for in-group communication. However, given 
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the diversity, the number of second-language speakers 
in a diverse area can add up to a majority. The second 
language adopted by speakers of smaller languages 
may have originally been the first language of a larger 
group, but this is not always the case. 

Swahili, Lingala and Sango are examples of lan- 
guages that have come to be so widespread that 
second-language (L2) speakers far outnumber first- 
language (L1) speakers. The economic networks 
underlying the formation of national states sub- 
sequent to European imperialism gave further 
impetus to the spread of second languages as lingua 
francas. As socio-economic networks have become 
more complex, lingua francas whose second-language 
speakers outnumber their first-language speakers 
have continued to spread. The Lingua Franca map 
shows the areas in which the indicated languages have 
become lingua francas. All the areas are highly diverse 
in terms of first languages. 

The main body of maps shows only the locations 
of first-language communities, and thus can be com- 
pared with the spread of African lingua francas given 
in Figure 10.1. 
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Figure 10.1 Africa: lingua francas 
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In addition to the expansion of lingua francas as 
second languages, urbanization and exogamy interact 
to create the conditions under which lingua francas 
expand as first languages. Accompanying the great 
amount of urbanization occurring in sub-Saharan 
Africa is a great deal of language shift to the lingua 
francas as first languages by children born in cities. 
These shifts often result in new varieties of the lingua 
franca, as in the case of Kingwana, a variety of Swahili 
evolving in the urban areas of Shaba (Zaire) since the 
end of the nineteenth century. Similarly, the dialect 
of Swahili evolving in Dar es Salaam, Tanzania, 
though less different from its early-twentieth-century 
Zanzibari source than the Kingwana variety, is recog- 
nizably different from other varieties of Swahili. 
Under conditions of language shift, the emerging dif- 
ference between new and old varieties of a language 
depends to some extent on the degree of linguistic 
diversity in an area. 

In Tanzania, where a Bantu language is spoken as 
a first language by an overwhelming majority of the 
population, new varieties of Swahili are recognizably 
but not radically different from older varieties. In 
Nairobi, Kenya, where shift is from both Bantu and 
Nilotic languages, the Swahili lingua franca had 
already pidginized to a great extent, and the newly 
evolving first-language variety of Swahili is much 
more different from older varieties. Similarly, in the 
Kongo lingua franca area of the Democratic Republic 
of Congo (formerly Zaire), a highly pidginized variety 
of Kongo called Kituba has evolved. Possibly this 
pidginization owes something to the influence of 
Ubangi as well as Bantu speakers, although there is 
relatively great diversity among the Bantu languages 
themselves in the area in which Kituba serves as a 
lingua franca. 

Section 10.1.6, on language classification, con- 
siders further the consequences of language shift, and 
suggests that current multilingualism and language 
shift continues on a larger scale but an old and well- 
established pattern in sub-Saharan Africa, at least 
among the sedentary populations. Language shift 
does not usually result in a shift of ethnicity, though it 
may spawn a new segment of a larger ethnic group. 
Thus, an ethnic label originally associated with one 
language is sometimes transferred to the target of 
language shift, contributing to the phenomenon of 
different languages with the same name, as noted in 
the discussion in Section 10.1.3, above. In the case 
of Sere in north-western Congo (Zaire), the label is 
used here to designate the Ngbaka (Ubangi) lan- 
guage still spoken by some Sere. However, as noted 
elsewhere, many speakers of Sere ethnicity have 
shifted to Zande, the Ubangi lingua franca of the area, 
so that some references to ‘Sere’ in other literature 
may actually refer to a variety of Zande spoken by 
ethnic Sere. 

In addition to African lingua francas, the former 
European colonial languages are also widely spoken 
as lingua francas, especially but not only by people 
who have learned them in school. Thus, it still makes 
sense to refer to Anglophone (English-speaking) and 
Francophone (French-speaking) areas of Africa. 
Portuguese is the third most widely spoken former 
colonial language. Section 10.4.1 lists by nation the 
former colonial languages still maintained as national 
languages, and thus widely spoken in that nation, 
particularly in urban areas. 


10.1.5 The language/dialect problem 


This problem is related to the classificatory problem, 
but restricted to currently spoken languages, whereas 
the classificatory problem also includes a more com- 
plex hierarchical arrangement of groups of languages, 
usually with the assumption that the hierarchy 
reflects the historical series of ancestors of the current 
languages. The larger classificatory problem is dis- 
cussed in Section 10.1.6, below. 

With respect to current languages, the language/ 
dialect problem has a sociopolitical as well as a lin- 
guistic aspect. The political aspect relates to literacy 
and standardization, but involves the consideration 


that speech varieties that may be very similar — in fact, 
mutually intelligible with little or no difficulty — may 
be considered distinct languages by their speakers. 
In such cases, the speakers resist considering the 
varieties to be (merely) dialects of a single language, 
because it is perceived to threaten their ethnic and/or 
sociopolitical integrity. Literacy and standardization 
become involved because the practical consequence 
that most often stems from considering similar 
speech varieties to be dialects of a single language is 
the setting up of a standard variety of such a single 
language as a written language through which literacy 
is introduced in the educational system to speakers of 
the different varieties. 

As a matter of practicality, the fewer standard 
languages the better, since this minimizes the prob- 
lems of devising strategies for introducing literacy. A 
most extreme solution to this problem would be to 
restrict education to literacy in the European lan- 
guages of the pre-independence period, for example 
English, French and Portuguese. While these lan- 
guages are still commonly used at higher educational 
levels in most African countries, and have been 
retained as official languages, as a solution to the 
problems of the considerable diversity of indigenous 
languages, they are not known by most children at 
primary-school ages, and are therefore not practical 
as initial vehicles for literacy. The same holds true 
for the indigenous African lingua francas. In any 
case, literacy and standardization for more localized 
African languages had already begun before the 
colonial period through the efforts of Christian mis- 
sionaries, who believed that their messages could be 
most effectively disseminated through the languages 
most familiar to their audiences. They laid the foun- 
dation for the standardization of a great many local 
languages, and, in fact, have continued to work on 
this process. 

Finally, it must be noted that most local com- 
munities hold literacy in great esteem, along with 
their own community languages, so that they strongly 
support efforts to give their languages written ver- 
sions. Particularly in West Africa, there have been 
many cases of untrained speakers grasping the prin- 
ciples of writing, and then devising writing systems 
for their languages on their own initiative. One of the 
most successful cases was the indigenously created 
writing system for the Vai language in Liberia. It was 
adopted by relatively large numbers of Vai speakers. 
This writing system has never been taught in the 
schools, and is distinct from the standard Vai writing 
system, which uses the Roman alphabet in a con- 
ventional way. 

In creating standard languages from local spoken 
languages, it is often politically desirable to give the 
language a name that does not refer distinctively to 
any one of the varieties, but only to the grouping as 
a whole, and to engineer its linguistic design so that 
the standard compromises on features of any specific 
variety. This avoids the perception that it is biased 
towards a particular community. To some extent this 
has been done with many languages, so that the 
standard language has a different name from any 
of the spoken varieties, such as Yoruba, Igbo and 
Swahili, to name just a few of the larger well-known 
standard languages. Where such standards have 
existed for a while, speakers of particular varieties 
often use the standard name as a reference to their 
own spoken variety. In such cases, they may have 
come to perceive their spoken variety as a ‘dialect’. 
However, it is important to recognize that such a per- 
ception of ‘dialect’ does not have the connotation of 
inferiority to the standard written language that 
commonly occurs to speakers of European languages. 
Thus, for example, Swahili speakers in many areas 
even consider their local varieties superior to the 
standard, perceiving the standard as an artificial 
variety of Swahili designed by outsiders, excluding 
local features in which they take pride. Nevertheless, 
economic circumstances make mastery of standard 
Swahili desirable for many such speakers. The same 
holds true for most other standard African languages. 

With regard to labels for standards, Swahili, 


Yoruba and Igbo are terms that preceded standardiza- 
tion, referring to perceived ethnic affinities among 
groups whose speech varieties were later standardized 
into a single ‘language’. In other cases, however, com- 
promises have led to other labels. A common tactic is 
the use of hypenated labels referring to different 
spoken varieties, such as Fang-Mbeti in Cameroon. 
Fang and Mbeti speakers originally perceived their 
languages to be different, though they are mutually 
intelligible. Mbeti is specific to Cameroon, while Fang 
is also spoken in some neighbouring countries — for 
example, Gabon and Equatorial Guinea, where other 
labels may also apply to Fang varieties, such as the 
term Pamwe for Fang in Equatorial Guinea. A 
hyphenated label for standards is a practical, but 
not always a completely satisfactory, compromise. 
For example, Fang-Mbeti as a standard label also 
includes Bulu and Ewondo, communities distinct 
from the Fang and Mbeti, though their languages are 
all mutually intelligible. Thus, whether to label Bulu 
and Ewondo as ‘dialects’ of Fang-Mbeti or as separate 
‘languages’ becomes arbitrary, and remains a political 
rather than a linguistic problem. Many other cases of 
labelling bias exist for standard languages. 

Feature bias is even more common, partly 
because one variety is spoken over so much larger a 
territory than another that it was practical to refrain 
from compromising the standard with words or pro- 
nunciations of a geographically less widely known 
variety — for example, introducing distinctively 
Ewondo words into standard Fang-Mbeti — and partly 
because in some cases, at the time of standardization, 
for various reasons, speakers of some varieties were 
simply not consulted. Consequently, as communities 
are newly recognized, decisions continue to have to be 
made about whether their languages are sufficiently 
similar to languages already standardized to be 
included in the standard, or whether a new standard 
has to be devised for them. Thus, attention to the 
language/dialect problem must continue for practical 
reasons. 

It is worth noting that standardization of a par- 
ticular language may be avoided in a multilingual 
community. As a matter of practicality, if a newly 
recognized community also speaks a second language 
that has already been standardized, there may be no 
need to further consider standardization of their first 
language. In this case, the language/dialect problem 
remains strictly linguistic, and is a classificatory 
problem. Deciding whether to standardize or not 
is not always easy. Often only part of a newly recog- 
nized community is bilingual in a language already 
standardized. As noted above, often initial primary- 
school-age children are still monolingual, though 
their parents and older siblings may be multilingual. 

The difference between standard and vernacular 
labels obviously contributes to the problem discussed 
above of different names for the ‘same’ language, par- 
ticularly when speakers accept the standard label as 
the appropriate one to use with outsiders in reference 
to their spoken language. A particularly complex 
example is ‘Akan’ to refer to a group of very similar 
Ghanaian languages including Asante, Fante, Abron 
(Brong) and Akwapem. Originally, the label Twi was 
intended to refer to a standard version of all these 
languages. However, the Fante objected to the label, 
not considering themselves to be “Twi’. The hyphen- 
ated label Twi-Fante sought to compromise. Finally, 
and less divisively, Akan has come to designate the 
group as a whole. However, Akan has not yet com- 
pletely replaced “Twi as a reference to the standard 
language. Ghanaian Abrong, Asante and Akwapem 
speakers — but not Fante speakers — sometimes iden- 
tify their language as Twi. When they do so, they are 
making implicit reference to an ethnic affinity 
amongst their groups that the Fante do not accept. 
Thus, some Asantes, for example, may refer to their 
first language as Twi, Akan, Asante or even some 
more local label, and Fantes to their first language as 
Fante or Akan, but not Twi. Generally, the name 
a speaker offers for his/her first language on any 
occasion depends on how specific an ethnic self- 
identification the speaker chooses to give. 
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10.1.6 Classification of languages 


Classification is a continuing process, both because of 
insufficient data for some languages and the related 
but distinct issue of insufficient resources for valid- 
ation of genetic classification by historical recon- 
struction of the deeply related branches of each of the 
sub-Saharan stocks. Even where data are sufficient, as 
is generally the case for Bantu (among other groups 
of its time-depth, whose members are transparently 
related and for which there are highly satisfactory 
reconstructions of the proto-language), there are 
inherent problems in classification, which often yield 
different results. The method of classification con- 
sidered most reliable is a qualitative one based on the 
identification of shared innovations across two or 
more groups, taken to indicate an innovation origin- 
ating in their immediate common ancestor. However, 
identification of a shared innovation is often difficult 
to secure. In many cases, innovations — particularly 
cognate words, which were earlier thought to relate 
two groups, and distinguish them from other groups — 
were later discovered in excluded groups. Thus, it is as 
difficult to use this method for subclassification as it 
is to exhaust the relevant data. 

The most common method is a quantitative 
lexicostatistical one, which groups languages accord- 
ing to the percentage of shared cognate words. 
This method generally yields the most satisfactory 
measure of the degree of similarity among speech 
varieties, and corresponds to general impressions 
about the similarities between any pair of languages. 
It works most satisfactorily when languages are very 
similar, that is, share a high percentage of cognates. 
The problem is that there are invariably related lan- 
guages left over that share a lower percentage of 
words with languages of two clearly distinct groups — 
for instance, Pare, which lies between the Seuta and 
Taita groups of North-East Bantu. The solution might 
be to simply leave Pare as a separate group, which 
it clearly is. However, this does not satisfy those his- 
torians who insist on the inference that Pare has the 
same immediate parent as Seuta or Taita, but not 
both. In fact, the inference is probably wrong, because 
neighbouring languages often converge over time due 
to mutual borrowing resulting from bilingualism. 

A case in point is the Northern Tanzanian lan- 
guage Ma’a (Mbugu), classified as Cushitic because of 
the size and nature of its Cushitic vocabulary. Ma’a 
has a great deal of distinctively Bantu grammatical 
structure, alien to other Cushitic languages. In 
addition, as a large segment of the population has 
moved from the Pare-speaking Bantu area to a 
Shambaa(Seuta)-speaking Bantu area, its Bantu 
vocabulary has shifted accordingly. It is thought that 
Ma’a was originally a Cushitic language that imported 
Bantu grammatical structure through bilingualism. 
However, more recent work has shown that currently 
Ma’a is an in-group vocabulary of Cushitic (and 
also Nilo-Saharan) origin maintained by Pare and 
Shambaa speakers of Mbugu ethnicity. This has led 
some researchers to propose that Ma’a has shifted his- 
torically from a Cushitic to a Bantu language. While 
the clear distinction between Bantu and Ma’a vocabu- 
lary makes it clear that there has been extensive 
borrowing and realignment between Bantu and non- 
Bantu languages in the history of Ma’a, such sources 
of similarity among neighbouring Bantu languages 
themselves are often much more difficult to detect, 
since the initial similarity is so great. Thus, current 
similarities do not necessarily imply an immediate 
ancestor. In this context it is worth noting that those 
varieties of Pare spoken closer to the Taita area are 
more similar to Taita, and those closer to Shambaa 
area are more similar to Shambaa, all the while 
maintaining an over-riding unity to Pare spoken in 
all areas. Examples of this type are encountered 
throughout large areas of Africa, and are typical of 
dialect continua, as well as of areal convergence of 
languages only more distantly related, or even not 
demonstrably related at all. 

The problem of hierarchical classification accord- 
ing to the classical uni-directional tree notion of 


linguistic change and diversification reproduces itself 
at higher, temporally more distant levels of relation- 
ship, and is particularly difficult for intermediate 
groupings. Many examples will be mentioned in the 
following discussion of recent rearrangements of the 
branches of the four sub-Saharan stocks. 


10.2 The four major divisions 


At least as a convenience, but also for its demon- 
strated research utility, most linguists accept the 
broad outlines of the classification established by 
Joseph Greenberg in the mid-twentieth century, 
which reduces sub-Saharan Africa to four basic 
phyla: (1) Niger-Congo, (2) Nilo-Saharan, (3) 
Khoisan and (4) Afro-Asiatic, of which Chadic is the 
only exclusively sub-Saharan branch. Based on mass 
comparison, and lacking secure proto-language 
reconstructions, many aspects of each of these phyla 
are controversial. Least controversial is Afro-Asiatic, 
of which the unity of Chadic as a branch is unchal- 
lenged. Most controversial are the Nilo-Saharan and 
Khoisan phyla as genetic units. 

It was proposed at one time, on the basis of the 
Kordofan languages of Ethiopia, that Niger-Congo 
and Nilo-Saharan be reduced to an older single 
phylum (*Kongo-Saharan). But subsequent scrutiny 
of ‘Kordofanian’ indicated that it was a geographical 
designation, not a genetic one, and that the Kadu 
(Kadugli-Krongo) groups were Nilo-Saharan. The 
Heiban, Talodi, Rashad and Katla groups of Kordofa- 
nian are still taken to be Niger-Congo, but the Niger- 
Congo features of the Kadu groups are assumed to be 
the result of long-standing contact with the other 
groups rather than as evidence of Niger-Congo/ 
Nilo-Saharan genetic relationship. 

Further general discussion considers each 
phylum in turn. Mention of grammatical, particularly 
syntactic, features is appropriate to an atlas because 
many of these features tend to be areal due to lan- 
guage contact, and they cut across language families 
in specific areas. This adds significance to features 
that distinguish adjacent languages of different 
families, such as the occurrence of basic VSO (Verb— 
Subject—Object) order among the Nilotic languages 
of Nilo-Saharan, but not in adjacent Bantu languages. 


10.2.1 Niger-Congo 


Niger-Congo (NC) is the dominant phylum of sub- 
Saharan Africa. It accounts for a majority of the 
speakers of sub-Saharan African languages. The 
Bantu subdivision of Niger-Congo accounts for well 
over half of its total number of speakers. At least one 
branch of Niger-Congo figures on almost every map, 
including Bantu on the first (southernmost, Map 89) 
and Atlantic on the last (northernmost, Map 113). 
Owing to its size and importance, Niger-Congo is the 
most well studied of the sub-Saharan phyla. However, 
its size and diversity also ensure its need of further 
study. Thus, most of the still-undescribed languages 
of sub-Saharan Africa are probably Niger-Congo. 

Niger-Congo has been extensively classified into 
an elaborate system of branches and sub-branches, 
with continuous modifications made to the inter- 
mediate branches. The classification scheme used in 
this edition of the atlas reflects many modifications 
occurring since the last edition. 

According to the current scheme (Figure 10.2), 
the first distinctive branch to emerge from proto- 
NC, the earliest posited form of Niger-Congo, was 
Kordofanian (Map 76 in the preceding section). 


*Niger-Congo 


Kordofanian *Atlantic-Mande-Congo 


Figure 10.2 Kordofanian 


Kordofanian was once considered a superordinate 
group to Niger-Congo, giving rise to the concept of a 
larger Niger-Kordofanian phylum. However, when 
the Kadu groups were excluded from the phylum, the 
label Niger-Congo was restored. This is similar to the 
former positing of an Indo-Hittite family, super- 
ordinate to the Indo-European, where later classifica- 
tion reduced the extinct Hittite language to a branch 
of Indo-European. The difference is that recon- 
structed Indo-European is assumed to be no older 
without consideration of Hittite than it is with it, 
while Kordofanian, according to the recent proposal, 
would indeed be beyond a reconstruction of NC that 
does not consider it. This is a recent development in 
classification. 

In the first edition of this atlas, the separation of 
Kordofanian from NC was not held to be older than 
the emergence of Mande. Current classification 
(Figure 10.3) holds Mande (Maps 106-13) and 
Atlantic (Maps 111-13) to have emerged subsequent 
to Kordofanian. Given the vast time depth involved 
in a proto-NC that contains these families, it is not 
surprising that some scholars have questioned the 
membership of Mande and Atlantic, especially North 
Atlantic, in NC, to which doubts about Kordofanian 
as currently conceived can be added. 


*Atlantic-Mande-Congo 


Mande 


Atlantic *ljo-Congo 


Figure 10.3 *Atlantic-Mande-Kongo 


Perhaps more surprising is that the integrity of 
Atlantic as a distinct branch is questioned by such 
doubts. In any case, the relative times of emergence 
of Atlantic and Mande from NC are currently 
undecided, and are not a crucial problem since 
both families are at the north-western edge of NC — 
Atlantic mainly coastal, and Mande mainly interior. 
Both are in contact with each other or with other 
branches of NC. On the other hand, Kordofanian is 
confined to a north-east area where it is totally iso- 
lated from other NC languages, and does not figure in 
any of the maps of this section. The extreme geo- 
graphical distance between Kordofanian on one hand, 
and Atlantic and Mande on the other, leads one to 
wonder why the most ancient splits in NC would be 
so far apart, and what the implications are for the 
earliest history of NC. Theories that address this issue 
are most probably premature. 

It remains to be noted that Mande as a whole 
differs from both Atlantic and Kordofanian in its 
S-AUX-OV syntax. S-AUX-OV (Subject—Auxiliary 
Verb—Object—Verb) is a common pattern in NC, 
especially in the west and north, but Atlantic and 
Kordofanian are largely SVO languages, despite prox- 
imity of both to SOV languages. 

The current scheme continues with the later 
emergence of Ijoid (Figure 10.4), a small group sur- 
rounded by Benue-Congo languages (Map 102). Ijoid 
is a very distinct group. Most striking is its canonical 
SOV syntax, whereas the surrounding languages are 
of the canonical SVO type, except for some occur- 
rences of S-AUX-—OV, already noted elsewhere in NC. 


*Ijo-Congo 


ljoid *Dogon-Congo 


Figure 10.4 *ljo-Congo 


Canonically, Ijo exhibits SOV-AUX order. For a 
while, Ijoid was widely assumed to retain this order 
from proto-NC syntax, a notion which enhanced its 
status as one of the older branches of NC. However, 
whether or not proto-NC had canonical SOV syntax 
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is currently moot. Despite its SOV syntax, Ijoid has 
the serial verb strategies more widely spread in the 
surrounding Benue-Congo languages, as in ‘knife- 
take meat-cut’, the OV (Object—Verb) version of 
serial ‘cut the meat with a knife’, as in Yoruba ‘take- 
knife cut-meat’. It is thus evident that simple clause 
word order and serializing order are independent 
syntactic strategies. 

The next branch to emerge in NC is Dogon 
(Figure 10.5). Dogon was once classified as a member 
of the Gur family, probably because it is situated geo- 
graphically among Gur and Mande languages (Map 
109), but it is more similar to Gur, as the current scheme 
still implies. The classificatory separation of Dogon 
from Gur, and positing of its prior emergence as a 
distinct branch, presages the more general current 
deconstruction of Gur, soon to be discussed. 


*Dogon-Congo 


Dogon *Volta-Congo 


Figure 10.5 *Dogon-Congo 


The remaining NC branch is generally called 
Volta-Congo (VC). Though its diversification is 
presumably more recent than the branches of NC 
already discussed, its unity is still of great time depth; 
its diversification scheme is intricate and has under- 
gone much recent revision, especially for its more 
northerly members. The primary split in Volta- 
Congo (Figure 10.6) is North/South (sometimes 
labelled West/East respectively, because of the assign- 
ment of Kru to the North branch). 


*Volta-Congo 


*West/North VC *East/South VC = *Benue-Kwa 


Figure 10.6 *Volta-Congo 


The first split in North VC is between Kru and 
the others (Figure 10.7). The Kru group is relatively 
compact and unproblematically unitary, yet there is 
no agreement on its origins in time or place. It is the 
only coastal and relatively southern group in this 
branch, west but not south of South Volta-Congo. 


*West/North VC 


Kru *non-Kru West/North VC 


Figure 10.7 Kru and *non-Kru 


Although the Kru family is quite distinct, it also 
seems to be the western end of a continuum with the 
coastal South VC (Benue-Kwa) languages. However, 
recent classification associates it with the more 
northerly groups, including Gur, north of Kwa, and 
Adamawa-Ubangi, north of Benue-Congo. By this 
account, Seme (Maps 104-5) — an isolated Kru lan- 
guage in Burkina Faso, far north of the rest of Kru 
(Maps 105-6) — reflects a Kru faction that failed to 
go south rather than one that somehow migrated 
northward into an area dominated by Gur. 

Figure 10.8, which is not a tree diagram, illustrates 


Gur —————— (Old) Adamawa —— Ubangi 


Kru-(New) Kwa — West Benue-Congo — East Benue-Congo 


Figure 10.8 Alternative scheme for Volta-Congo as a 
continuum 


*Gur-Adamawa 


various other/ Bariba Central 
Old Gur Gur 


Mbum-Day Leko-Nimbari Fali 


various other/ 
Old Adamawa 


Ubangi 


Figure 10.10 *Gur-Adamawa 


these uncertainties. Kru may turn out to be a signifi- 
cant marker — in some way a turning-point — on an 
ancient VC continuum uniting the North and South 
VC languages. This would have interesting implica- 
tions for their relationships. 

Fieldwork so far suggests that the further east one 
goes in North Volta-Congo, the more distantly related 
a language is to the geographically adjacent South VC, 
so that in the extreme east the distinction between 
Bantu (South VC) and Ubangi (North VC) is among 
the sharpest between any two NC groups (Maps 87, 
90, 92, 96). It presumably follows that Gur and 
adjacent Kru are more closely related than Bantu and 
Ubangi are. 

Currently it is tentatively suggested (Figure 10.9) 
that after Kru, Senufo emerged as a separate branch 
of North VC (Maps 104-5). This is part of the further 
deconstruction of Gur, following Dogon. Senufo is a 
relatively closely related group of languages with dis- 
tinct characteristics, including OV syntactic charac- 
teristics similar to Dogon and Mande, in its wider 
vicinity. It used to be considered the West branch of 
Gur. Now it is assumed to be no closer to the 
rest of Gur (Maps 102-5) than it is to Adamawa 
(Maps 89-91, 98, 103) or Ubangi. 


*non-Kru West/North Volta-Congo 


2 
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Senufo *Gur-Adamawa 


Figure 10.9 A possible origin for Senufo 


The further deconstruction of Gur also includes 
the former Adamawa-Ubangi group, so that all the 
languages in these groups are now seen as forming 
a northern continuum in North VC called Gur- 
Adamawa. Gur-Adamawa consists of the elaborate 
but unproblematic Central Gur unit, the similarly 
elaborate and unproblematic Ubangi unit, and 
various parallel branches that used to be assigned to 
either Adamawa or Gur, but are now viewed (Figure 
10.10) as independent units in Gur-Adamawa, pend- 
ing further research. 

In essence, then, Adamawa and Gur have been 
reduced to geographic rather than genetic units, and 
the trend in classification has been toward splitting, 
rather than lumping. The implication is that the 
standards for lumping should be raised, and the pro- 


*Benue-Kwa (S/E Volta-Congo) 


*West Benue-Congo 


*Kwa *East Benue-Congo 


Figure 10.11 *Benue-Kwa 


*Kwa (*West Benue-Kwa) 


liferation of parallel branches defines the problems 
for further classificatory research. 

South VC is more commonly called Benue-Kwa, 
a merger of two groups that used to be presented 
as distinct but now are seen as a continuum. The 
first stage in the merger was the reassignment of the 
western Nigerian languages from East Kwa to West 
Benue-Congo (BC). More recently this reassignment 
has been viewed as arbitrary. That is, West BC is no 
more distant from (New) Kwa than it is from East BC 
(Figure 10.11). The former labels remain, however, 
as geographically distinct groups, according to the 
current scheme. 

Ivorian Kwa remains fragmented into numerous 
parallel groups (Map 105; Figure 10.12). Ghanaian 
Kwa has been more intricately arranged, particularly 
the Potu-Tano sub-branch, which remains pretty 
much the same since the last edition of the atlas (Map 
107). The languages once called “Togo Remnant’, and 
later ‘Central Togo’, are currently organized into two 
large groups, Na-Togo (e.g., Lelemi) and Ka-Togo 
(e.g., Avatime) (Map 102), with no special relation- 
ship to each other within Kwa. 

The Benue-Congo languages, which could 
also be called East Benue-Kwa (where ‘Kwa’ is West 
Benue-Kwa), have been subject to some interesting 
rearrangements (Figure 10.13). Current West BC 
consists of the more southerly West Nigerian lan- 
guages (mainly Map 101), while East BC contains the 
northern and eastern Nigerian languages, and all the 
more eastern and southern BC languages. A novel 
feature of West BC is the assignment of Oko to a 
particular branch that also contains Nupoid and 
Idomoid. Oko was previously considered an isolate in 
BC. It is evident that more has been learned about 
Oko in the interim. 

In East Benue-Congo, former Platoid has been 
further deconstructed, following the separation of 
Kainji from an older concept of Platoid, and the 
subsequent lumping of Tarokoid and Jukunoid with 
Platoid. The label ‘Platoid’ has now been changed to 
‘Central Nigerian’ (Figure 10.14), the branch of 
East BC directly north of West BC. The former 
North and West Plateau languages remain as a single 
unit (North-West), but former South (or Beromic), 
Central and South-East Plateau are now conceived as 
no closer to North-West Plateau than to Kainji or 
Jukunoid. 

A proposal of long standing, Cross (Maps 96-7, 
101) and Bantoid are now presented as a unit within 
East BC. However, within Cross, Bendi is now sus- 
pected to be an initial split rather than parallel to 
its other branches. Among the re-arrangements of 
Bantoid (Figure 10.15), South Bantoid remains as 
previously, but the former North Bantoid groups are 
now conceived as independent Bantoid groups, no 
closer to each other than to South Bantoid. In add- 
ition, Tikar has been taken out of the South Bantoid 
branch and given independent status along with the 
former North Bantoid groups Mambiloid and Dakoid 


various Ivorian *Potu-Tano 


*Na-Togo 


*Ka-Togo *Ga-Dangme *Gbe 


Krobu West Central 


South 


Figure 10.12 *Kwa (*West Benue-Kwa) 
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*Benue-Congo (*East Benue-Kwa) 
West Benue-Congo East Benue-Congo 
*YEAI Akpes *NOI — *Central ?Ukaan = *Bantoid-Cross 
| Nigerian 
Yoruboid Edoid — Igboid Nupoid Oko — Idomoid Cross Bantoid 
Figure 10.13 *Benue-Congo (*East Benue-Kwa) 
*Central Nigerian (former Kainji + Platoid) 
Kainji North-west Beromic Central South-East Tarokoid Jukunoid 
(Old Plateau) (Old Plateau) (Old Plateau) (Old Plateau) 
Figure 10.14 *Central Nigerian 
*Bantoid 
Dakoid Mambiloid — Tikar South 
Bantu Jarawan Tivoid Beboid WideGrass_ —_ Ekoid-Nyang 
(fields) 
Narrow Grass (fields) Menchum-Momo 
Ring East Ndemli Mundani Menchum Momo 
(= Mbam-Nkam) 


Figure 10.15 *Bantoid 


(Maps 96-7, 98). Particularly interesting in South 
Bantoid is that (Narrow) Bantu is now conceived as a 
primary split in South Bantoid, parallel to Tivoid, 
Beboid, Jarawan and (especially) Wide Grassfields, 
with which it was formerly associated as an inter- 
mediate ‘Wide(r) Bantu’ subgroup of South Bantoid. 
As a result, ‘Bantu’ rather than ‘Narrow Bantw’ is suf- 
ficient designation for this extensive group, for the 
first time since Bantu was assigned membership in 
the much wider Niger-Congo phylum. 

Other matters of terminological reform include 
the use of the term ‘Mundani’ for the Narrow Grass- 
fields group that used to be called ‘Momo’, and the 
appropriation of the term ‘Momo’ for the Wide 
Grassfields branch that used to be called “West 
Momo’. Of greater substance is the assignment of 
Ndemli to Narrow Grassfields (Maps 92-3). Ndemli 
used to be considered an isolate in South Bantoid. It 
is evident that more has been learned about Ndemli 
in the interim. 

Despite extensive reconstruction, Bantu has 
resisted an unambiguous elaborate subclassification, 
for reasons discussed earlier (Maps 74-89, 92-5). 
However, it has long been recognized that a primary 
split in Bantu is between the North-West Bantu 
languages and the others. The north-west languages 
are more diverse and have undergone more radical 
changes than Bantu in other areas. Thus, North-West 
Bantu emerged as a distinctive branch earlier than the 
other branches, and has had greater time to diversify. 
This corresponds to the origin of Bantu-speaking 
communities in the north-west of their current terri- 
tory, particularly southern Cameroon, before they 
spread to more southern and eastern areas of sub- 
Saharan Africa. Most probably, Bantu speakers spread 


from the north-west areas in various waves in various 
directions during a relatively long period of time. 
This must have contributed somewhat to the devel- 
opment of distinctive, areally based patterns outside 
the north-west. However, subsequent encounters and 
amalgamations of different Bantu-speaking groups 
outside the north-west greatly obscure the relation- 
ship between any particular north-western area and 
any other particular area. Instead, continuous vari- 
ation from any area to any other area is commonly 
observed in all directions. 

For practical purposes it is convenient to divide 
the Bantu continuum into geographical zones and 
use the Guthrie numbers as simplified by the Ethno- 
logue for more specific groups. However, where I 
know such geographically based classifications to be 
genetically misleading, I have avoided them in favour 
of more accurate genetic classifications, for example, 
the assignment of Swahili (G40) to Sabaki, along 
with Mijikenda and Pokomo (E40), rather than to 
the more distantly related Ruvu languages, such as 
Zaramo (G30). The genetic misidentification of 
Swahili stems from Guthrie’s selection of Swahili 
in coastal North Tanzania (Zone G) rather than in 
coastal Kenya (Zone E) as the representative token. It 
is clear that Swahili is most closely related to the other 
languages of the Kenyan coast and originated in that 
area before spreading southward to Tanzania and 
off-shore to Zanzibar and Pemba. 

The most prominent feature of the Niger-Congo 
(NC) phylum as a whole is an intricate system 
of noun classification, which shows up most product- 
ively in nouns derived from verbs. Where the system 
remains intact — as in much of Benue-Congo, espe- 
cially Bantu, most of Gur and Atlantic, and part of 


Kwa, Adamawa and Ubangi — the general feature is 
the intricate classification of non-humans into pairs 
of formally unrelated singulars and plurals. The clas- 
sification is marked by prefixes in the Benue-Kwa 
continuum and recurs among the northern Atlantic 
languages such as Diola. Elsewhere suffixes are more 
common, including among northern Atlantic lan- 
guages such as Fulani. Unlike most suffixing lan- 
guages, Fulani shows evidence of former prefixing by 
the correlation of noun-class suffixes with alterations 
of the initial consonant of the root noun. Thus, the 
English ethnonym ‘Fulani’ derives from the Fulani 
plural self-designation, while the French ethnonym 
‘Peul’ derives from the same root as it occurs in the 
singular. 

Within NC, the most extensive areal division 
is between languages that use the genitive order 
possessed—possessor, as in ‘children-our’, and those 
that use the reverse order, as in ‘our-children’. The 
possessor—possessed order is characteristic of most of 
the western NC area, including Southern Atlantic, 
Mande, Dogon, Gur, Kru and Kwa. Elsewhere, 
possessed—possessor order is typical, though there are 
exceptions. 

As mentioned above, canonical OV syntax is 
restricted to Ijoid, with some direct influence on the 
Ogoni group of Cross languages. Elsewhere, OV is 
largely restricted to a subset of the languages that 
feature possessor—possessed order, and is syntactic- 
ally restricted to S-AUX-OV constructions. This fea- 
ture is generally western, but it also occurs in parts 
of Cross and Cameroonian Bantoid. Bantu has 
morphologized OV order with simple object pro- 
nouns, but otherwise VO order is basic. The same 
feature recurs in some Central Nigerian groups of 
East BC — for instance, Duka (Kainji) and Kaje 
(former Central Platoid) — and may have been a 
feature of proto-BC or Benue-Kwa before phono- 
logically induced reduction of morphological com- 
plexity affected verb marking in other Benue-Kwa 
languages, or perhaps even more widely in Volta- 
Congo. At present it is difficult to ascertain how much 
of the morphological complexity of Bantu — apart 
from the noun-class system, and most probably the 
suffixal verb extensional system — preserves features 
from the period of Niger-Congo unity. 


10.2.2 Nilo-Saharan 


The Nilo-Saharan phylum is areally more limited 
than Niger-Congo, but is linguistically even more 
diverse. If its unity reflects historical reality, it is no 
doubt older than Niger-Congo. For the most part it 
stretches across sub-Saharan Africa north of Niger- 
Congo. Unlike Niger-Congo, its major branches are 
not related in a language continuum, but are quite 
discrete, and are separated from each other by inter- 
vening Niger-Congo and/or Chadic languages. 

Classificatory schemes of Nilo-Saharan are quite 
elaborate, especially for the East Sudanic branch. 
There have been many changes in the details of 
the scheme since Greenberg originally posited the 
phylum. There is much less agreement about 
the Nilo-Saharan scheme than about Niger-Congo. 
The inclusion of Songhai is particularly controversial 
(parallel to Mande in Niger-Congo). Some observers 
are more impressed, for example, by formal simi- 
larities between Songhai and Afro-Asiatic vocabulary 
than between that of Songhai and other putative 
Nilo-Saharan languages. 

The scheme followed in the atlas is adopted from 
the one developed by Lionel Bender. A radically 
different scheme proposed by Christopher Ehret 
will also be mentioned, to indicate how unsettled the 
internal relations among the Nilo-Saharan branches 
are. Bender’s scheme (Figure 10.16) recognizes a 
primary split into four parallel families: Songhai, 
Saharan, Kuliak and a fourth, much more complex 
family, abstractly called ‘Core-Satellite’. Songhai 
(Maps 102, 104) is the most questioned member of 
this family, and is associated with Mande by some 
who also question the membership of Mande in 
Niger-Congo. Saharan (Maps 99-100, 104) is a distinct 
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Figure 10.16 *Nilo-Saharan (after Bender) 


branch whose most prominent member is Kanuri. 
Kuliak is a small group in Uganda (Map 86) formerly 
classified among the East Sudanic groups. The eleva- 
tion of Kuliak to a separate group of Nilo-Saharan is 
the most radical revision in Bender’s scheme. 

The Core-Satellite (CS) branch posited by 
Bender is bifurcated (Figure 10.17) into a ‘Core’ 
branch which includes East Sudanic and the Kadu 
group of Kordofanian (once thought to be Niger- 
Congo/Kordofanian), and a ‘Satellite’ branch with 
various parallel sub-branches, including Central 
Sudanic, Maban and Fur. 

Note, then, that according to this scheme East 
Sudanic and Central Sudanic do not have an exclusive 
immediate ancestor within CS. Some of the sub- 
branches of Core-Satellite occur too far north to be 
represented in this section. They are presented and 
discussed in more detail in Section 9.7. 

Central Sudanic consists of eight compact 
groups, each covering a relatively small area. Among 
recent rearrangements of these groups is the separ- 
ation of Bongo and of various languages called Kara 
from the languages of the Sara-Bagirmi (formerly 
Bongo-Bagirmi) group. Bongo is currently not con- 
sidered any more closely related to Sara-Bagirmi than 
to Moru-Madi, Mangbetu-Asua, Mangbutu-Efe and 
Lendu, among other groups of Central Sudanic. 

East Sudanic is the major Core sub-branch. Its 
Surmic and Daju sub-branches have some representa- 
tives in the sub-Saharan maps, for example, Didinga 
(Map 86) and East Daju (Maps 89-90), but its most 
widespread and elaborate sub-branch, Nilotic, is most 
amply represented among the maps of this section. 

The major branches of Nilotic are generally 
called West (Maps 79-86), East (Maps 85-6) and South 
Nilotic (Map 86). West Nilotic has a Northern group, 
only partially represented in this section, and a 
Southern group totally represented. The East Nilotic 
branch is bifurcated into a Bari and Teso-Maa 
sub-branch. Teso-Maa is bifurcated in turn into 
Lotuko-Maa, including Maasai among its languages, 
and Teso-Turkana, named after two of its constituent 
languages. The Southern Nilotic languages include 
Datoga and the Kalenjin group. 

Ehret’s scheme of Nilo-Saharan (Figure 10.18) is 
quite different in a number of respects. He posits a 
succession of binary splits from which Central 
Sudanic emerges as a distinct group earlier than 
the differentiation of a North Sudanic branch into 
a Saharan-Sahelian sub-branch. Sahelian lumps 
Songhay in the extreme west and East Sudanic in the 
extreme east of the Nilo-Saharan area. The implica- 
tion is that Sahelian spread in a wide belt north of 
Saharan (and Central Sudanic). 

In one respect, Ehret’s scheme is more conserva- 
tive than Bender’s. He maintains Kuliak as an East 
Sudanic group, hence more closely related to Nilotic 
than to Kanuri. Thus, Ehret, in contrast to Bender, 
views similarities between Kuliak and other East 
Sudanic languages, particularly adjacent Nilotic, as 
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more often matters of ancient inheritance than of 
later borrowing. 

It is interesting to note the great difference 
between Ehret’s and Bender’s views of Nilo-Saharan 
internal relationships, which stands in marked 
contrast to Ehret’s high degree of agreement with 
Niger-Congo specialists on the relationships within 
Niger-Congo, as presented in Section 10.2.1 above. 

Amidst its diversity there is little that gives a 
distinctive grammatical unity to Nilo-Saharan. The 
feature that comes closest is the formation of plural 
pronouns from singular pronouns by the addition of 
a pluralizing affix. Although some Niger-Congo 
(e.g., Guang) and Chadic languages show a simi- 
larity between first-person singular and _ plural 
pronouns, it is not as morphologically regular as 
in much of Nilo-Saharan. Generally, first-person 
singular and plurals are formally unrelated in Niger- 
Congo. Chadic first-person plurals are discussed in 
Section 10.2.4. 

Some Sudanic languages have a masculine/ 
feminine gender distinction, for instance Maba in 
Chad and Maasai in East Nilotic. Possibly this feature 
developed due to contact with Chadic and Cushitic, 
respectively, where the latter are both branches of 
Afro-Asiatic, a family for which the feature is charac- 
teristic. However, unlike other sub-Saharan families 
with this gender distinction, it marks nouns but 
not pronouns, so that, as in Niger-Congo (with very 
rare innovative exceptions), ‘he’ and ‘she’ are not 
distinguished. 

Case marking, most strikingly of the nominative/ 
accusative type, occurs in some Nilo-Saharan 
groups such as East Nilotic and Saharan. In Kanuri 
(Saharan), the accusative marker is a clitic that 
follows the entire object noun phrase, not just its 
head word. Kanuri is interesting in this respect 
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because, though it is an OV language, it employs 
possessed—possessor word order, an areal feature of 
the adjacent Chadic languages (VO) and a much 
more extensive VO area of Volta-Congo. Thus, Kanuri 
exhibits structures like [child woman-’s]acc ‘the 
woman’s child’, where a more canonical OV language 
would attach the accusative particle to the head noun, 
as in [woman’s child]acc. The East Sudanic lan- 
guages also employ possessed—possessor order, but 
such Central Sudanic languages as Lugbara use this 
order for alienable possession, as in ‘my pot’, but the 
reverse order for inalienable possession, as in ‘my 
hand’. 

Even more striking than case-marking as such is 
the occurrence of ergative case marking among some 
East Nilotic languages. Thus, in Bari an agent follow- 
ing the verb is marked with an ergative suffix. This fits 
in with more widespread verb morphology in East 
Nilotic, by which a verb is marked with a special 
extension if it is transitive but no object is expressed. 
Among these languages, the subject can be omitted 
and the object preposed without this extension 
(since the object is expressed). Bari extends this con- 
struction further so that the object is preposed to the 
unextended verb and the subject can be expressed 
postverbally with the ergative marker. Ergativity is 
unknown elsewhere in sub-Saharan Africa. 


10.2.3 Khoisan 


In terms of number of speakers, Khoisan is by far the 
smallest of the sub-Saharan phyla. Most of its remain- 
ing communities were until relatively recently herders 
in the west (part of the Khoe area) and hunter- 
gatherers elsewhere. Nama-Damara, principally in 
Namibia, is exceptional, spoken by more than 10 per 
cent of the population. It is officially recognized, has 
been standardized, and given the standardized label 
Khoekhoe since 1990. Its chances for survival are 
much greater than those of most other Khoisan lan- 
guages. In southern Africa, where the Khoisan lan- 
guages are most widespread and diverse, intermar- 
riage and/or adaptation to the lifestyles of sedentary 
Bantu and Indo-European speakers over the last 
several centuries has been accompanied by a great 
amount of language shift to Afrikaans and/or Bantu, 
so that a few recently spoken Khoisan languages have 
become extinct, and some are on the verge of extinc- 
tion. The inset on Map 74 shows the former location 
of some of these Khoisan languages, some of which 
were still spoken in the mid-twentieth century and 
have been described, for instance, !ora (Korana). 

The Southern African branches of Khoisan 
are !wi-Taa, Khoe and Zhu (Ju), sometimes called 
Southern, Central and Northern, respectively. The 
geographical terms for these branches became 
potentially confusing when the isolated Northern 
Tanzanian languages Sandawe and Hadza were added 
to Khoisan. Although the Khoisan phylum is small 
and relatively simple compared to the elaborate 
schemes discussed in the preceding sections, there has 
been some deconstruction of the former classificatory 
scheme. Thus, +H64, previously grouped with the 
!wi-Taa languages, has been withdrawn and elevated 
to an isolate fourth branch of Southern African 
Khoisan, parallel to the other three branches (Map 
74). Kwadi, previously considered a coastal Angolan 
outlier of the Khoe family, is now considered undeter- 
mined for further classification within Khoisan (Map 
84). Ehret has proposed a classification, based on 
successive splits, which has Hadza emerge as the first 
distinctive branch of Khoisan, and then Sandawe 
(Map 86). The former three Southern African 
branches are not further arranged in his basic 
scheme. 

Despite the striking typological resemblances of 
Hadza and Sandawe to the geographically distant 
southern African Khoisan languages, the consensus of 
Khoisan specialists is to consider presumptions of the 
genetic unity of Khoisan to be highly premature, even 
for the southern African branches, let alone their 
relationship to either of the Tanzanian isolates. Thus, 
the recent deconstruction of Khoisan implies a 


307 


Sub-Saharan Africa 


raising of standards for positing genetic relationships. 
It is not doubted that there is a genetic relationship 
among at least some of the Khoisan families, but 
much descriptive work remains to be done before 
genetic relationships can be demonstrated. Clearly 
the Khoisan specialists are putting a higher value on 
accomplishing the descriptive work than on using the 
mass comparison short-cut to positing genetic rela- 
tionships. A similar trend has already been observed 
for aspects of Niger-Congo and Nilo-Saharan, but is 
much more pressing for Khoisan because there are so 
few Khoisan languages left to begin with, and most of 
them continue to be endangered. 

The best-known feature of Khoisan is the inclu- 
sion of various click consonants in their phonological 
inventories, reflecting articulatory mechanisms not 
exploited by languages elsewhere in the world for 
making words (except for the artificially constructed 
Australian ritual language, Damen, and despite the 
common use elsewhere of some of the sounds them- 
selves for expressiveness, e.g., English tsk tsk, an 
apical click, to express disapproval, and wide Niger- 
Congo use of the apical or lateral click to express 
denial). The phonological similarities of Sandawe and 
Hadza to the southern African Khoisan families have 
suggested to scholars that the Khoisan languages, 
or at least typologically similar languages, once char- 
acterized a very large area of sub-equatorial Africa 
that is now Bantu-speaking. If so, the current re- 
cession of the Khoisan languages continues an older 
pattern of assimilation and language shift already 
complete in the area between northern Tanzania and 
the southern African Khoisan area. The existence of 
words with click phonemes in adjacent southern 
Bantu languages is an indication of relatively recent 
language shift to Bantu by speakers of some Khoisan 
languages no longer spoken — for example, there are 
Xhosa (Bantu) words traceable to extinct Khoe lan- 
guages. Kwadi, spoken in a small area of coastal 
Angola, may have resulted from an unusual reverse 
shift of Bantu speakers to a Khoisan language. Kwadi 
appears to have become extinct in the last several 
decades. 

The Khoisan languages vary in grammatical 
complexity. Common to most is some kind of noun- 
class system. In this case, Zhu and !wi-Taa are dis- 
tinguished from the others by having a class system 
that does not include a sex-based masculine/feminine 
distinction. In Khoe, Kwadi, Sandawe and Hadza such 
a sex-based system is central to the noun classes. 
Khoe is more elaborate in also having a third gender 
called ‘common’, used when sex identification is 
undetermined or irrelevant. The Khoe gender 
system also has productive non-sexual semantic uses — 
for instance, written.thing-FEM ‘letter, book’, but 
written.thing-masc ‘newspaper, official document’. 

Hadza is unusual among the Khoisan languages 
in having a basic VSO (Verb—Subject—Object) order, 
though it also uses SVO. SVO is also a basic feature of 
Zhu and !wi-Taa, while Khoe, Kwadi and Sandawe are 
SOV languages. Hadza is typologically even more dis- 
tinct in featuring possessed—possessor word order, 
while all the other Khoisan languages feature the 
reverse order, regardless of their basic clause order. 

The typological distinctions among the Khoisan 
languages carry no weight for genetic classification. 
At the same time, the agreement of the Khoisan 
outlier Sandawe with Kwadi and Khoe with respect to 
OV and sexually based nominal and pronominal 
marking is striking, since these are not features of 
adjacent Niger-Congo languages. Since Cushitic has 
these features, it may be plausible that it has influ- 
enced Sandawe, but that cannot account for Kwadi or 
Khoe. It remains to be determined to what extent 
Hadza’s grammar has been influenced by language 
contact, perhaps with Nilotic in the case of VSO 
order, and Bantu in the case of SVO and possessed— 
possessor order. Similarly, it remains undetermined to 
what extent SVO order (as in Bantu and Afrikaans) 
and the loss of inflection in some South African 
Khoisan languages has been influenced by bilingual- 
ism and language shift among their speakers, but time 
is running out for some of these languages. 


10.2.4 Chadic 


Chadic is one of the major branches of Afro-Asiatic 
and the only one limited to an area of sub-Saharan 
Africa. The other branches are Semitic, Berber, 
Cushitic, Omotic and the now-extinct Egyptian 
branch (except for ritual liturgical use of Coptic). 
Although one or more languages from each of the 
other surviving branches of Afro-Asiatic are repre- 
sented among the maps of this section of the 
atlas, their main domain is further north. They are 
discussed more fully in Section 9. 

Most Afro-Asiatic specialists do not arrange the 
major branches of Afro-Asiatic with respect to each 
other, so that Chadic and the other branches are all 
presented as parallel. However, Ehret plausibly repre- 
sents Chadic as closer to his northern Afro-Asiatic 
branches (Semitic, Berber and Egyptian) than to his 
southern branches (Omotic and Cushitic). 

The unity of Chadic is unproblematic. Its four 
main branches are called West, Biu-Mandara (or 
Central), Masa and East. Masa is thought by some to 
be a sub-group of adjacent Biu-Mandara (Central) 
rather than an independent branch. The major Masa 
languages are contiguous, and not further arranged, 
giving the appearance of a dialect continuum (Maps 
90, 91). East Chadic is a more elaborate branch 
(Maps 89-90). Although it is the major Chadic 
branch in Chad, it is less elaborate and its com- 
munities are generally much smaller than the Biu- 
Mandara and West branches. Biu-Mandara (Maps 
90-1, 95-6, 99), centring on the Mandara area of 
northern Cameroon, and West Chadic (mainly Maps 
97-8, 100), almost exclusively Nigerian except for 
Hausa, are the most complex branches of Chadic 
and contain a number of languages with hundreds of 
thousands of speakers. Hausa rivals Swahili in the 
number of second-language speakers and far sur- 
passes it in the number of first-language speakers. 
Sizeable Hausa-speaking communities, mainly urban, 
extend much further east and west across northern 
sub-Saharan Africa than does the Chadic family as a 
whole. 


Chadic 
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(Biu-Mandara) 

A A A 
1 Hausa 1 Tera 1 Tumak 
2 Bole 2 Bura 2 Nancere 
3 Angas 3 Kamwe 3 Kera 
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B 6 Sukur 1 Dangleat 
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Figure 10.19 Chadic 


The Chadic languages share a number of gram- 
matical characteristics with other Afro-Asiatic lan- 
guages, including the masculine/feminine gram- 
matical distinction among nouns and pronouns. In 
contrast to the Nilo-Saharan gender languages, 
Chadic gender marking distinguishes pronouns even 
where it has been lost as an inflection on nouns, as in 
Gu'de (Biu-Mandara: Gbwata). That is, ‘he’ and ‘she’ 
are distinguished in most Chadic languages. Gender 
distinctions include second as well as third persons, 
a common feature of Afro-Asiatic. In some West 
Chadic languages, the gender distinction has been 
lost in the third person, perhaps owing to contact 
with Benue-Congo, but remains in the second person 
singular, for instance in Guruntum. 


Grammatical processes that many Chadic lan- 
guages share in particular with the Northern (Semitic 
and Berber) branches of Afro-Asiatic, but which are 
non-existent in other sub-Saharan families, include 
patterns of morphological modification of a root 
vowel (ablaut) to change the number of a noun or 
aspect of a verb. The morphological subclassification 
of verbs in many Chadic languages according to 
the vowel suffixed to the verb root to form the per- 
fective aspect is also reminiscent of Semitic verb sub- 
classification, though Semitic verbs are subclassified 
according to the perfective (and imperfective) vowel 
internal to the root verb. 

Like most sub-Saharan languages, and most 
Afro-Asiatic languages (outside Ethiopia), Chadic 
languages have the basic order possessed—possessor. 
However, many differ from other sub-Saharan lan- 
guages in using the same set of suffixes as definite 
markers and possessive linkers, as in teacher-DEF 
boy ‘the boy’s teacher’. Some observers have com- 
pared this to the Semitic construct state, a possessed— 
possessor construction which is treated gram- 
matically as definite in Semitic (as in ‘teacher DEE- 
boy’). Note that the definite marker occurs in the 
same position in the Chadic and Semitic construc- 
tions, but is prefixed to the following possessor in 
Semitic instead of being suffixed to the preceding 
possessed (head) noun. In both cases the unmarked 
noun can only be interpreted as definite in this 
construction. 

A distinctive feature among languages of all 
major branches of Chadic is the modification of the 
transitive verb according to the nature of a following 
object, usually whether it is a noun or pronoun. In 
some languages the change is only tonal, but in others 
it includes a change in the vowel of the verb suffix. 
Possibly this common Chadic feature stems from 
a proto-Chadic stage in which an earlier set of 
pronominal object suffixes affected the final vowel 
marking the verb’s aspect before being replaced by a 
newer non-suffixed set of object pronouns. 

Many Chadic languages in the various branches, 
but not Hausa, make an inclusive (‘you and me’)/ 
exclusive (‘me and them — but not you’) distinction 
for the first person plural. In many of these languages 
a formal relationship is discernible between the first 
singular and the inclusive plural, but it lacks the 
morphological transparency of Nilo-Saharan first- 
person singular/plural pairs (discussed in section 
10.2.2 above). The distinction is rare, but not 
unknown, elsewhere in sub-Saharan Africa. South of 
Chadic it occurs sporadically in Cameroonian and 
Nigerian Bantoid (e.g., Aghem) and Adamawan (e.g., 
Longuda), where it appears to be more recent than in 
Chadic. Influence of Chadic on this distinction 
further south is unclear. The occurrence of the dis- 
tinction in many Atlantic (Niger-Congo) languages, 
such as Diola and Fulani, is clearly independent of 
Chadic. 

All Chadic languages have basic SVO word 
order, except for a Central Chadic area of Nigeria 
and adjacent Cameroon in which VSO order is basic, 
including Ga’anda, Lamang and Glavda. Some 
scholars think that VSO was the original Chadic 
word order, just as it is attested in most of the 
ancient Semitic languages. The relic status of a 
feature of such time depth would be better supported 
by isolated pockets of VSO in various Chadic 
branches rather than in a single sub-branch (Biu- 
Mandara A). 


10.2.5 Indigenous Indo-European 
languages 


Some of the maps in this section feature certain 
speech varieties of West European Indo-European 
origin that have become indigenous to various 
communities in sub-Saharan Africa within the 
course of the last half-millennium. Among them are 
certain varieties of English spoken as first languages 
as well as second languages, among which Krio and 
Liberian Settler English (Map 106) are prominent 
and quite distinct varieties of English. Krio is an 
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English-based creole, closely related to other 
Atlantic-area English-based creoles formed partly 
from English pidgins spoken on parts of the West 
African coast for many centuries, and partly from 
Caribbean creolized forms of English. In a fluid situ- 
ation — with trade on a large scale, including slavery, 
liberation from slavery and resettlement of freed 
captives from more eastern parts of West Africa — all 
these varieties, spread over a large Atlantic area, 
came into contact and influenced each other. Once 
developed, Krio was one of the most influential 
creoles in further shaping English pidgins in coastal 
Nigeria and Cameroon. Its first-language speakers 
are largely descendants from these various areas who 
settled in Freetown and amalgamated into a specific 
ethnic Krio (creole) community in other towns as 
well as the vicinity of Freetown. Large numbers of 
Temne and Mende speakers are among its second- 
language speakers. 

Liberian Settler English is a phonologically and 
grammatically distinct variety of English, originating 
in the English spoken by African-Americans on the 
mid-Atlantic coast of the USA. Following mid- 
nineteenth-century emancipation, thousands of freed 
slaves from this and a wider area were persuaded to 
emigrate from the USA and form colonies in Liberia, 
in both urban and agricultural areas. These immi- 
grants coalesced into an ethnic Settler group. Apart 
from Liberian Settler English, which remains an 
ethnic form of English, Liberian Pidgin or Vernacular 
English developed as a second language among 
speakers of African languages. It is phonologically 
very distinct from other West African Pidgin English 
varieties, particularly nearby Krio, so that there 
are individuals in the general vicinity of Krio and 
Liberian English who speak both varieties as second 
languages and treat them as distinct languages, which 
they are to a far greater extent than most non-African 
varieties of English. 

South African English is a much more trad- 
itional type of English, with millions of first- 
language speakers, largely of European and East 
Indian descent. Almost the entire large East Indian 
community of South Africa, mostly concentrated in 
Durban on the east coast, speaks English as a first 
language. The European and East Indian varieties of 
South African English are largely identical, and are 
similar to other Southern Hemisphere varieties of 
English, such as Australian, which derive from the 
Southern British English of the Home Counties 
around London as spoken from the late eighteenth 
to early twentieth century. Naturally, the South Afri- 
can varieties are closest to Zimbabwean varieties 
spoken by a much smaller but still substantial Euro- 
pean settler group there. As a first language (L1), 
South African English is phonologically very distinct 
from the second-language (L2) English varieties 
widely spoken by L1 speakers of South African 
Bantu languages. It is not evident that South African 
English as spoken by Europeans and East Asians is 
the target for Bantu speakers, but rather that the lat- 
ter have developed their own English speech stand- 
ards. With the end of Apartheid, it remains to be 
seen to what extent the two varieties will converge. 
Meanwhile, it is clear that the non-Bantu variety is 
the target of Ll speakers of Afrikaans, of whom 
about one million speak English and Afrikaans with 
equal facility. There has been some phonological 
convergence between Afrikaans and South African 
English. 

In many ways Afrikaans is much more indigen- 
ous to South Africa than South African English, 
and much more distinct from its European relatives. 
Originally descended from non-standard varieties of 
seventeenth-century Southern Dutch (and Flemish) 
it increasingly came to diverge from metropolitan 
varieties, most strikingly in its grammar. Its develop- 
ment was influenced not only by the influx of various 
West Europeans, but also by the Coloured ethnic 
group, which arose from people of mixed ancestry: 
European, Khoisan, West African (mainly slaves 
imported to South Africa during the seventeenth and 
eighteenth centuries) and Malay (imported for labour 


from South-East Asian Dutch colonies). Afrikaans 
developed through this contact in the West Cape area, 
and continues to be the dominant language of the 
West and North Cape areas of South Africa. These are 
the only large areas of South Africa where Afrikaans 
is spoken as a first language by a larger population 
than any Bantu language. The large majority of 
the Afrikaans speakers in these areas are ethnic 
Coloureds. 

Portuguese-based creoles — such as those spoken 
in Principe and Sdo Tomé off the Gabon/Equatorial 
Guinea coast (Map 88), and Crioulo in Guinea-Bissau 
(Map 108) — were once more widely spoken along the 
coast of West Africa, as elsewhere in the Atlantic 
dominion of colonial Portugal. The Portuguese 
creoles bear a relationship to the European language 
similar to Krio’s relationship to European varieties of 
English. In both cases there is much basic restructur- 
ing of the European grammar away from suffixal 
inflection toward the typologically isolating West 
Benue-Kwa grammar. For example, Indo-European 
morphological complexity, such as past-tense suffix- 
ation (or internal modification) of verbs and plural 
suffixation of nouns, becomes optional, and there is a 
preference for past tense-marking by an invariant 
preverbal auxiliary and plural marking by a plural 
demonstrative (or third person) pronoun pre- or 
postposed to the noun. 


10.3 Sub-Saharan Africa as a 
linguistic area 


Amidst the linguistic diversity of the sub-Saharan 
African families, one areal feature that distinguishes 
the area as a whole from adjacent areas is lexical tone. 
In almost all the languages of this area, each word has 
its own inherent tonal pattern, so that many words 
are distinguished by relative pitch alone: for example, 
Hausa (West Chadic) fari (HH) ‘white’: fari (HL) 
‘drought’: fari (LH) ‘rolling of the eyes’. It is particu- 
larly striking that among the branches of Afro-Asiatic 
only Chadic and Cushitic have lexical tone, delimiting 
the northern boundaries of this sub-Saharan areal 
feature. The other families are exclusively sub- 
Saharan and characteristically tonal. 

Inevitably there are a few sub-Saharan lan- 
gauges that have lost lexical tone. Most notable are 
the Niger-Congo languages Ful(ani), representative 
of northern Atlantic, and Swahili, unusual, though 
not unique, among the Bantu languages. Although 
sub-Saharan tonal languages vary in their depend- 
ence on lexical tone for intelligibility, most are less 
dependent than the tonal languages of South-East 
Asia. Most sub-Saharan languages make no more 
than three tonal distinctions among words, of which 
one, the falling, is a hybrid of successive high and 
low tones. Tonal distinctions naturally also affect the 
grammars of African languages, because grammat- 
ical as well as lexical morphemes can be distin- 
guished by tone. For example, in many Bantu 
languages some verb tenses are distinguished by 
tone alone. Loosely speaking, tonal complexity 
varies inversely to morphological complexity in 
sub-Saharan Africa. For example, the southern 
Cameroons is a transitional area between the tonally 
more complex but morphologically simpler systems 
of the West African coast and the reverse complexity 
of sub-equatorial Bantu. The trend there is towards 
greater tonal complexity at the expense of morpho- 
logical complexity, so that much of Cameroonian 
Bantoid is coming increasingly to resemble its west- 
ern Benue-Congo neighbours. Elsewhere, the atonal 
northern Atlantic languages are morphologically 
more complex than the adjacent, highly tonal 
Mande languages, though it should be noted that 
the southern Mande languages manifest some 
morphological complexity by means of tone. Most 
standardized orthographies of African languages do 
not indicate tone. Therefore, the absence of tonal 
marking in orthography should not be taken as an 
indication that the language is atonal. 


Other phonological features of wide distribution 
are forms of vowel and nasal harmony, prenasalized 
stops (sometimes analysed as nasal+stop clusters) and 
labiovelar stops. Of these, vowel harmony is charac- 
teristic of most Niger-Congo and much of Nilo- 
Saharan. In most of the affected languages, this vowel 
harmony distinguishes two sets of vowels according 
to glottal features of their articulation, and is a pros- 
odic feature different from the types of vowel har- 
mony reported for the other areas of the world. 
Labiovelar stops are characteristic of most of the West 
African area, but recur in some Central Sudanic lan- 
guages and then the Miji Kenda languages of coastal 
North-East Bantu, where they are a recent and 
independent development. 

The prominence of nasal consonants across 
African languages can be noticed even in the index by 
the high relative frequency of African language names 
beginning with a nasal consonant, orthographically m 
and n. Prenasalized consonants are so common that 
they might be considered stereotypical of African 
languages, but are also common in the Pacific area 
among some Polynesian, New Guinea and Australian 
languages. Most African languages have relatively 
simple syllable structure, with words ending in vowels 
and few, if any, consonant clusters. However, there are 
many exceptions to this generalization, particularly 
among the Chadic and adjacent Niger-Congo lan- 
guages of Cameroon and adjacent Nigeria. Many of 
the Nilotic languages of East Africa are noteworthy as 
exceptions to the generalization by having a relatively 
large range of word-final consonants. 


10.4 Population statistics 


Where statistics were available, an attempt has been 
made to normalize the number of speakers to the year 
2000 in order to facilitate cross-language comparison 
for the decade of publication. Normalization was not 
applied to languages last counted at 1,000 speakers or 
less, and/or where other information indicates that 
the number of speakers has been declining as the 
result of language shift. For such languages, normal 
increase cannot be assumed. Where ‘disappearing’ 
substitutes for number of speakers, the language is 
spoken by several hundred speakers at most, none 
of them children. Such languages are endangered. 
Some languages are specifically marked as ‘probably 
extinct’. These had only a few elderly speakers left at 
last count, at least ten years ago. 

Setting aside expanding lingua francas, most lan- 
guages for which there are statistics for more than 
one point of time are consistent with an assumption 
of normal population increase. However, in quite a 
few cases there are problems. Naturally, the greatest 
problem exists when there is no reasonably recent 
census Because of large-scale social disruptions in 
various nations of West and Central Africa during the 
1990s, this is a relatively common problem, most 
prominently for DRC (Democratic Republic of 
Congo, formerly Zaire), one of the largest and lin- 
guistically most complex African nations. Even when 
there is a recent census, or some other kind of 
estimate, problems persist. A common one is the 
difficulty of disentangling ethnic from mother- 
tongue counts. With a few exceptions, statistics do 
not distinguish ethnicity from mother tongue, so that 
available counts refer to an ethnic group rather than 
to the numbers who speak the ethnic language as a 
mother tongue — or, for adults, continue to speak it, 
despite having acquired other languages. 

This problem of overestimating mother-tongue 
speakers on the basis of ethnicity is related to urban- 
ization and language shift. Rapid urbanization has 
been taking place in most parts of sub-Saharan Africa 
for well over two generations. With urbanization, 
regardless of inter-ethnic marriage, there is a marked 
tendency for the next generation to shift to the dom- 
inant lingua franca of the urban area. Thus, for 
example, urban Dar es Salaam in Tanzania is in a 
traditional Zaramo area, but probably a majority of 
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the ethnic Zaramos speak Swahili as well as, or more 
fluently than, Zaramo. 

Most dramatically, the 1996 Ethnologue, often a 
good source for language statistics, cites for South 
Africa over 890,000 speakers of Hindi and other 
South Asian languages (as of 1986). However, this 
turns out to be a vast over-estimate, based on ethnic 
count. According to the 1996 government census of 
South Africa, there were then over one million people 
in South Africa of South Asian descent. Of those, 
94.4 per cent reported English as their mother tongue 
(and indeed most have lived in South Africa for many 
generations), so that far less than 60,000 are likely to 
have an Indic or Dravidian language as a mother 
tongue. At its best, the Ethnologue (2004) reports 500 
speakers of the South Atlantic language, Krim, ‘out of 
an ethnic group of 10,000’ (in 1990). Thus, current 
attention to endangered languages has increased 
awareness of the difference between ethnicity and 
mother tongue. But most often, available sources of 
information do not allow the researcher to make such 
crucial distinctions. 

It is clear that ethnic counts can also under- 
estimate the number of speakers of a language, since 
the ethnic language may also function as the lingua 
franca of a particular area, in which case it may be 
more widely spoken as L2 than as a mother tongue. 
This has already been discussed in Section 10.1.4. 
Underestimation is obviously a less common prob- 
lem than overestimation, given the naturally small 
number of lingua francas compared to the number of 
mother tongues. Still, there may be many cases in 
which a substantial number of L2 speakers of a 
particular language remain unknown. 

Finally, a great deal of difficulty in interpreting 
the population statistics for many languages and the 
discrepancies between different sources, stems from 
variability in the labelling of some languages, as in the 
ethnic self-identification of their speakers. The 1988 
Ethnologue contains an extreme example. It reports 
that Welmers (1971) estimated the Mongo-speaking 
population of DRC (called Zaire at the time) at 
216,000, but that Hayes (1977) gave the figure of 
3,200,000 for Mongo. In this case it seems that 
Hayes’s count refers to the whole Mongo group of 
languages, including languages not normally called 
Mongo by their speakers. Mongo or Mongo-Nkundo 
refers to a group of relatively closely related Bantu 
languages, spoken in the Equateur region of DRC by 
millions of people. It is not clear which varieties of 
Mongo were counted by Welmers, but it is likely that 
they also had distinct names, just like the varieties 
of Mongo that he did not count. In any case, the 
problem lies in which speech varieties have been 
counted as Mongo at different times and/or by differ- 
ent observers, not in an unusual growth and spread of 
Mongo in the 1970s. 

While less obvious than the case of Mongo, the 
Congolese Bantu language Nande shows such fluctu- 
ation. The 1978 Ethnologue gives 480,000 speakers, 
while that of 1987 almost doubles the count to 
903,000 speakers (a figure unchanged in the 1993 
edition). It is not clear what accounts for the fluctu- 
ation in Nande, but the change is much greater than 
what would be predicted by the normal growth rate 
for its area of DRC during that period. 


10.4.1 Notes on the lists of African 
languages 


In view of the above problems, the statistics given for 
the languages are subject to the following annotations 
and explanations. 

/d = discrepancy indicates that there is a large 
discrepancy between the counts of a language at two 
points in time, reflected in the difference between the 
estimated counts in the first and second editions of 
this atlas, as well as in the sources on which the 
counts were based. For instance, the Fante-speaking 
population was estimated at under 2 million for 1990, 
but at over 5 million for 2000. At the same time, the 
other Akan languages of Ghana show a decline. There 
are many possible a priori explanations. Most 


unlikely is natural growth/decline. The annotation 
simply indicates the problem for research beyond the 
scope of this atlas. 

/o = old data indicates that the normalization is 
based on data more than twenty years old. There are 
few languages so marked, but their normalized esti- 
mates range from less than 1,000 (Dulbu) to 442,000 
(Ubu-Mbokomu-Moshi-Kirua-Kilema-Mwika — a set 
of Central Chagga varieties). Given the various possi- 
bilities other than normal growth, estimates of lan- 
guages that have not been counted for such a long 
period of time are considered questionable here, and 
thus annotated. 

Unless otherwise noted, the statistics are 
assumed to be for L1 (first language) speakers. For 
some languages that have lingua franca status, either 
on a local or wider level, the identification L2 is 
given before the figure that indicates second- 
language speakers. For example, the estimate for 
Swahili, the dominant East African lingua franca, is 
well over 78 million speakers in all countries, includ- 
ing L2 speakers. Note that, according to the esti- 
mates given here, while Swahili has more L2 
speakers than Hausa, the major lingua franca of 
northern Nigeria and Niger, it has far fewer L1 
speakers than Hausa. 

When a language is principally spoken in no 
more than two countries, the total number of (L1) 
speakers is generally followed by a figure for one 
of the countries in which it is spoken. For example, 
the total number of Maasai speakers is given as 
1,068,000; then the number of Maasai speakers in 
Tanzania is given as 548,000. The maps (76, 85, 86) 
show that virtually all other Maasai speakers reside in 
Kenya, so it can be assumed that the Kenya figure 
is the difference between the total and the Tanzania 
figures. A separate set of statistics (section 10.4.3) is 
given for the more complicated cases of languages 
with large numbers of speakers in more than two 
countries. 

Section 10.4.2 below gives the major languages 
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for each sub-Saharan nation. What counts as a major 
language naturally depends on the total population 
of the nation. For larger nations, the language must 
be spoken by millions, but for some smaller nations 
several hundred thousand speakers are sufficient. The 
statistics by nation also show that the official lan- 
guages of most African nations continue to be the 
former colonial languages. These are the languages 
used for matters involving the national government, 
and usually also for higher education and for com- 
munication among the educated elite. Tanzania 
remains unique in having chosen an African language 
(Swahili) to the exclusion of the former colonial lan- 
guage (English) as its official language. An increasing 
number of nations, but still a minority, have more 
than one official language, the former colonial lan- 
guage and a national lingua franca of African origin, 
such as English and Swahili in Kenya, or French and 
Rundi in Burundi. 

In contrast to the category ‘official language’, the 
major African languages of a nation usually fit into a 
separate category of ‘national language’. The con- 
sequences of this status vary from one nation to 
another. Minimally, a national language has special 
status as a language in which a certain amount of 
media broadcasting is scheduled on a regular basis. 
National languages are usually standardized with 
respect to vocabulary and grammar, but often not 
with respect to the details of pronunciation. Thus, for 
many national languages, a broadcaster’s accent 
reflects their region of origin. 


10.4.2 Major African languages, by nation 
(in thousands) 


Angola [National/Official = Portuguese] 


Umbundu [NC Bantu] 5,200 
Kimbundu [NC Bantu] 3,300 
Kongo [NC Bantu] 1,484 
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Democratic Republic of Congo [Official/ 
National = French, Luba, Lingala, Kongo, 


Bea il z er Swahili-Ngwana] 

Dee sea ee Swahili [NC Bantu] L2 10,738 

2 Luba [NC Bantu] 9,204 
GST ae I m Lingala [NC Bantu] 350 ~=— L2 8,260 
ee | peyer Kongo [NC Bantu] 6,500 L2 960 

u ’ Mongo [NC Bantu] 5,472 

Songye [NC Bantu] 1,200 

Nande [NC Bantu] 1,147 

Ngbaka [NC Ubangi!] 1,020 


Equatorial Guinea [Official/National = 
Spanish] 


LL be j ‘See Fang [NC Bantu] 385 
ee 4, —- Bubi [NC Bantu] 50 


Cc 
S, 2 AFRICAN REPUBLIC 4, 


om Gabon [Official/National = French] 


(GABON | oy pemocratic 10+. i ¢ Fang [NC Bantu] 517 

See ALG: . < f : 
- BREPUIBUIE ceo a, ; Shira-Punu [NC Bantu] 385 
ORGONGCO () awial a Mbere [NC Bantu] 133 


Gambia [Official/National = English] 


Mandinka [NC Mande] 427 
Fulani [NC North Atlantic] 247 
Wolof [NC North Atlantic] 156 
Diola [NC North Atlantc] 56 


Ghana [Official/National = English] 


Akan [NC Kwa] 7,400 
Ewe [NC Gbe] 1,761 
English [IE] L2 1,200 
Ga-Dangme [NC Kwa] 1,149 
Dagaare [NC Gur] 1,020 
Abron [NC Kwa] 661 
Figure 10.21 Africa: official languages of non-African origin Kusaal [NC Gur] 575 
Konkomba [NC Gur] 460 
Benin [Official: French] Central African Republic [Official = French] Guinea [Official/National = Futa Jalon 
Fon [NC Gbe] 1,694 Gbaya-Manza-Ngbaka [NC [v. Fulani], French] 
Yoruba [NC Benue-Kwa] 563 Ubangi] 1,202 Futa Jalon [NC North Atlantic/ 
Bariba [NC Gur-Bariba] 529 Sango [NC Ubangil 368  L2 1,050 Fulani] 2,700 
Aja [NC Gbe] 436 Banda [NC Ubangil 692 Mannka Kankan [NC Mande] 2,684 
Gun [NC Gbe] 387 Sara-Bagirmi [NS Central] 196 Susu [NC Mande] 968 
Fulani [NC North Atlantic] 162 
Ngbandi [NC Ubangil] 159 
Botswana [Official/National = English, Guinea Bissau [Official/National = Portuguese] 
Tswana] . ; : Crioulo [IE] 34 L2 625 
Chad [Official/National = French, Arabic] Balanta [NC Atlantic] 333 
Tswana [NC Bantu] 1,295 . : . 
Kalanga [NC Bantu/Shonal 194 Shuwa [AA Arabic] 914 124,235 Pulaar [NC North Atlantic/Fulani] 222 
Sara-Bagirmi [NS Central] 2,016 Manjaku [NC Atlantic] 154 
Kanembu [NS Saharan] 471 Mandinka [NC Mande] 140 
Maba [NS Maban] 258 Papel [NC Atlantic] 114 
Burkina Faso [Official/National = French, Fulani [NC North Atlantic] 184 
Moore, Fulani, Djula] . . . 7 
Kenya [Official/National = English, Swahili] 
Moore [NC Gur] 5,300 . 
Djula [NC Mande] 1,300 125,200 Comoros [Official = French] Swahili [NC Bantu] 155 12 27,900 
Fulani [NC North Atlantic] 755 : “4: Gikuyu [NC Bantu] 6,309 
Gurma [NC Gur] an Comorian [NC Bantu/Swahili] 552 Luhya [NC Bantu] 4,546 
Luo [NS Nilotic] 3,758 
Kalenjin [NS Nilotic] 3,269 
; ; ; ; Congo [Official = French] Kamba [NC Bantu] 3,256 
Burundi [Official/National = French, Rundi] Gusii [NC Bantu] 1.867 
Munukutuba [NC Bantu/Kongo] 1,307 > 
Rundi [NC Bantu] 4,738 Teke [NC Bantu] 510 Meru [NC Bantu] 1,540 
Swahili [NC Bantu] L2 3,500 Lingala [NC Bantu] 12 321 Miji Kenda [NC Bantu] 1,425 
Liberia [Official/National = English] 
ai [Official/National = French, Cote d'Ivoire [Official/National = French] English [IE] 83 122,000 
8 Baule [NC Kwa] 2,279 Kpelle [NC Mande] 619 
Fulani [NC North Atlantic] 1,011 127,000 Senufo [NC Senufo-Gur] 1,338 Bassa [NC Kru] 411 
Bamileke [NC Bantoid] 3,822 Dan [NC Mande] 856 Grebo [NC Kru] 270 
Fang-Beti [NC Bantu] 2,600 Anyi [NC Kwa] 653 Klao [NC Kru] 229 
Pidgin English [IE English/ Bete [NC Kru] 559 Dan [NC Mande] 228 
Weskos] L2 2,600 Djula [NC Mande] 513 Mano [NC Mande] 204 
Duala [NC Bantu] 52 12 1,703 Wee [NC Kru] 483 Loma [NC Mande] 168 
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Malawi [Official/National = English, Nyanja, 
Tumbuka] 


Nyanja-Chewa [NC Bantu] 4,100 L2 2,736 
Lomwe [NC Bantu] 1,767 
Yao [NC Bantu] 1,143 
Tumbuka [NC Bantu] 755 


Mali [Official/National = French, Bambara, 
Mamara and Senari [v. Senufo], Fulani, 
Maninka, Soninke, Songhai, Dogon, Tamashegq, 
Bozo, Xasonke, Hassaniya Arabic, Bwamu] 


Bambara [NC Mande] 3,105 
Senufo [NC Senufo-Gur] 1,780 
Fulani [NC North Atlantic] 1,443 
Maninka [NC Mande/Malinke] 992 
Soninke [NC Mande] 847 
Songhai [NS Songhay] 696 
Tamasheq [AA Berber] 584 
Dogon [NC Dogon] 531 
Bozo [NC Mande] 286 


Mozambique [Official/National = Portuguese] 


Makua-Lomwe [NC Bantu] 4,450 
Tsonga-Ronga-Tshwa [NC Bantu] 4,320 


Sena [NC Bantu] 1,820 
Chwabo [NC Bantu] 1,064 
Shona [NC Bantu] 778 
Nyanja-Chewa [NC Bantu] 512 


Namibia [Official/National = English] 
Ovambo-Kwanyama [NC Bantu] 843 


Nama-Damara [Khoisan] 204 
Afrikaans [IE] 169 
Herero [NC Bantu] 140 


Niger [Official = French /National = Hausa, 
Zarma, Fulani, Tamasheq, Kanuri, Tubu, 
Arabic, Gurma] 


Hausa [AA Chadic] 5,300  L2 2,700 
Zarma [NS/Songhay] 2,226 
Fulani [NC North Atlantic] 901 
Tamasheq [AA Berber] 848 
Kanuri [NS Saharan] 424 
Tubu [NS Saharan/Teda-Daza] 42 


Nigeria [Official/National = English, Yoruba, 
Hausa, Igbo, Fulani, Efik, Kanuri, Edo, Idoma] 


Hausa [AA Chadic] 23,527 
Yoruba [NC Benue-Kwa] 22,809 
Igbo [NC Benue-Kwa] 18,540 
Fulani [NC North Atlantic] 9,670 
Kanuri [NS Saharan] 4,160 
Nupe-Gbagyi [NC Benue-Kwa] 3,620 
Efik-Ibibio [NC Benue-Kwa] 3,400 L2 2,120 
Tiv [NC Bantoid] 2,810 
Tjo [NC Jjoid] 2,690 
Edo [NC Benue-Kwa] 1,400 
Izi [NC Benue-Kwa/Igboid] 1,157 
Idoma [NC/Benue-Kwa] 1,143 
Igala [NC/Benue-Kwa] 968 


Republic of South Africa [Official/National = 
Zulu, Xhosa, Afrikaans, English, Tswana, 
Sotho, Tsonga, Venda, Ndebele] 


Zulu [NC Bantu] 9,940 
Xhosa [NC Bantu] 7,780 
Afrikaans [IE] 6,280 
Pedi [NC Bantu/South Sotho] 3,990 
English [IE] 3,872 
Tswana [NC Bantu] 3,560 
Sotho [NC Bantu] 3,244 
Tsonga [NC Bantu] 1,896 
Swati [NC Bantu] 1,094 


Venda [NC Bantu] 946 
Ndebele [NC Bantu] 634 


Rwanda [Official/National = French, English, 
Rwanda] 


Rwanda [NC Bantu] 6,882 


Senegal [Official = French/National = Wolof, 
Fulani, Mandinka, Malinka, Serer-Sin, 
Soninke, Diola-Fogny] 


Wolof [NC North Atlantic] 3,360 L2 2,253 
Fulani [NC North Atlantic] 2,377 
Serer [NC North Atlantic] 1,088 
Mandinka [NC Mande] 571 
Malinke [NC Mande] 360 
Diola-Fogny [NC Atlantic] 276 
Soninke [NC Mande] 180 


Sierra Leone [Official/National = English] 


Krio [IE/English] 539 
Mende [NC Mande] 1,840 
Temne [NC Atlantic] 1,464 


L2 4,600 


Swaziland [Official/National = English, Swati] 
Swati [NC Bantu] 787 


Tanzania [Official/National = Swahili] 


Swahili [NC Bantu] 3,264 L2 29,376 


Sukuma-Nyamwezi [NC Bantu] 7,170 
Gogo [NC Bantu] 1,612 
Bena-Hehe [NC Bantu] 1,595 
Haya [NC Bantu] 1,524 
Tumbuka [NC Bantu] 1,210 
Chagga [NC Bantu] 1,150 
Makonde [NC Bantu] 1,090 
Ha [NC Bantu] 968 
Nyakyusa [NC Bantu] 930 


Togo [Official/National = French, Ewe, Kabre] 


Ewe [NC Gbe] 1,095 
Kabiye [NC/Gur/Kabre] 742 


Uganda [Official/National = English] 


Swahili [NC Bantu] L2 7,000 
Nyankore-Kiga [NC Bantu] 3,854 
Luganda [NC Bantu] 3,479 128,900 
Lango-Acholi[NS Nilotic] 2,190 
Soga [NC Bantu] 1,747 
Teso [NS Nilotic] 1,270 


Zambia [Official/National = English] 


Bemba [NC Bantu] 2,420 
Tonga [NC Bantu] 1,200 
Nyanja-Chewa [NC Bantu] 1,197 


Zimbabwe [Official/National = English] 


Shona [NC Bantu] 
Ndebele [NC Bantu] 


6,910 
1,600 


10.4.3 Major transnational languages 


Swahili [NC Bantu] 

Tanzania 3,264 L2 29,376 
Kenya 155 L2 27,900 
Democratic Republic of Congo L2 10,738 
Uganda L2 7,000 
Mozambique 2,000 

Rwanda L2 2,000 
Burundi L2 1,500 
Comoros 552 

Swahili total 78,971 5,971 L2 73,000 


Hausa [AA Chadic] 


Nigeria 23,527 L2 16,000 
Niger 5,300 L2 2,700 
Elsewhere 4,290 L2 100 
Hausa total 51,917 33,117 L2 18,800 
Fulani [NC North Atlantic] 

Nigeria 9,670 

Cameroon 1,011 L2 7,000 
Guinea 2,700 

Senegal 2,377 

Mali 1,443 

Niger 901 

Burkina Faso 755 

Gambia 247 

Guinea Bissau 222, 

Chad 184 

Central African Republic 162 

Sudan 122 

Fulani total 26,794 19,794 127,000 
Manding [NC Mande] 

Mali 4,097 

Guinea 2,684 

Burkina Faso 1,300 125,200 
Senegal 951 

Cote d'Ivoire 513 

Gambia 427 

Guinea Bissau 140 

Manding total 15,312 10,112 125,200 


Kanuri-Kanembu [NS Saharan] 


Nigeria 4,160 
Chad 578 
Niger 424 
Sudan 222 
Kanuri-Kanembu total 5,384 
Songhai [NS Songhay] 

Niger 2,261 
Mali 696 
Burkina Faso 129 
Elsewhere 90 
Songhai total 3,176 


10.4.4 Other sub-Saharan languages 


See Section 10.4.1 for notes on interpreting the stat- 
istics given below. In column 1, languages are listed 
with their common alternative names. Column 2 
gives the relevant map number/language number — 
so, for example, !'o-!khung appears on Map 74 as lan- 
guage no. 60; it is also found on Map 84 (language 
no. 49). Some languages of the same name are iden- 
tified by their Guthrie numbers; Guthrie classified all 
Bantu languages by zone (letter) and group (num- 
ber): Bati C50 is thus distinguished from Bati A40. 
In some cases, the number of speakers (given in 
thousands) of a language is not known, but a refer- 
ence is given to a larger group of which the language 
is a member, where an estimate for the number 
of speakers of the larger group is available — for 
example, Akutu as a member of the Mongo group, or 
Bieno as a variety of Umbundu. For different lan- 
guages with the same name, classificatory informa- 
tion is given to distinguish the languages, e.g., Lamba 
the Bantu language and Lamba the Gur language. 
Otherwise, refer to the maps to obtain classificatory 
information. 
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Language Number of speakers Language Number of speakers 
[alternative name] Map no./language no. (in thousands) [alternative name] Map no./language no. (in thousands) 
!'o-!khung [(!'o-)!xung] 74/60, 84/49 12 Anyama 101/10c - 
'khong [!/X6o] 74/56 5 Anyi 103/05, 105/14 975/d 
‘khung [(!'o-)!xung] 74/59 6 Aowin 103/06b, 105/15 - 
Apakibeti 79/32, 82/35b 33 
"‘Dongo 79/51 6 Arbore 86/98 - 
"‘Dugwor 94/37 - Arusha 86/4 127 
Asala 90/73d, 99/17 7 
Ani 74/53 2 Asante 102/28, 103/01, 105/19 1,416/d 
IIGana [Gllana] 74/51 1 Asom 87/48b, 91/57 - 
Gui [G/ui] 74/50 <1 Asua 79/61 - 
WKh'au-//'e [IIX'au-Il'e] 74/58, 77/75 Atakat 96/46d, 97/22 = 
Aten 96/48b, 97/14 54/d 
Abayongo 101/29e - Atie 105/06 476/d 
Abdalla 85/A = Atong 93/33, 95/40a - 
Abdwak 85/B - Atsi 83/69, 88/43f - 
Abey 105/05 191/d Aulihan 85/H - 
Abiji 105/03 62 Auna 101/39 56 
Abini 101/29c - Aushi 77107 176 
Abinsi 96/53, 95/80 3 Avatime [Siya] 103/30 - 
Abron 103/04, 105/18 937 Avatime-Yangbo 102/06 15 
Absa 96/08, 101/42 195 Avikam 105/02 25 
Abu-Chesu [Arum-Tesu] 96/57 5 Avukaya 79/65d 53/d 
Abua 101/08a 33 Awak [Yebu] 98/02 11/o 
Abua 79/61f - Awing 93/54, 95/57 - 
Abure 105/10 67/d Awutu 102/18a, 103/20 136 
Achipa 100/17a, 101/47b 29/d Ayu 96/40, 97/13a 6 
Acholi 86/62, 79/76 978/d Aza 99/26 = 
Adangme [Dangme] 102/13, 103/36 1,049 
Adele [Sedere] 102/07b, 103/22 24 Babale-Kunda 82/32, 87/24 = 
Adhola 86/65 352 Babale-Lenge 82/36, 87/29 = 
Adie 92/37a = Babanki 93/40, 95/42c = 
Adim 101/29d - Babende 87/57d - 
Adja [Aja] 102/02b 575 Babenzele 87/35 - 
Afa'da [Afa'de] 90/67, 99/05 26 Babole 87/34 5 
Afrikaans 74/62 7,018 (L2 11,000) Bachama 91/75, 98/108a 186/d 
Agara'iwa 100/20, 101/39 - Bada 96/60, 97/58, 98/48 13 
Agbo 101/82a Igbo Badara 96/59a, 97/59 = 
Aghem 95/41 30 Bade 100/08 265/d 
Agoi 93/12, 95/74 16 Bafek 93/98c = 
Agwagune 93/16, 101/29a 39 Bafia (Total) 93/77-81 131 
Ahanta 103/08 127 Bafut 93/50, 95/55 68 
Ahlo(ng) 102/03, 103/29a 8 Baga Binari 107/18 4 
Aizi 105/21 19 Baga Koba 107/20 6 
Aja 79/59, 89/79 - Baga Ma(n)duri 107/16 5 
Ajukru 105/04 106 Baga Mboteni 107/06 4 
Ajuran 85/43e 33 Baga Sitemu 107/10 37/d 
Aka 82/22, 87/23 - Baga Sobane 107/19 probably extinct 
Aka [Mbosi] 87/41 34 Bageto 87/53g 57 
Aka-Ngombe 89/59 - Bago 102/48 8 
Ake 96/37 - Bagyeli 88/34, 92/32 <1/d 
Akoko 101/87 - Bailundu 84/26d Umbundu 
Akoose 92/28, 93/84 74 Bajun [Tikuu] 85/la 78 
Akpa 101/58a 30 Bajwe 87/56a - 
Akpafu-Lolobi [Siwu] 102/10, 103/25 20 Bajwo 95/35b - 
Akpes 101/77 14 Baka 791/64 32 
Akposo 102/04, 103/29 127 Baki 87/50, 91/58 - 
Akutu 79/41d, 80/04h, 81/01v Mongo Bakoko 92/37 65 
Akwa 81/18, 83/38, 87/42 - Bakole 92/19, 93/89 13 
Akwapem 102/29, 103/03 278/d Bakoni 93/30 = 
Aladian 105/01 28 Bakosongo 83/10 = 
Alago 95/93, 96/67¢ 69 Bakpinka 93/10a probably extinct 
Alagwa 76/69, 86/91 18 Bakundu 92/26b, 93/83c 100 
Alege 95/73a 1 Bakwe 105/27 12 
Ali 87/81, 89/37b 38 Bala 84/18 Kimbundu 
Alur 86/61, 79/74 1,168 (DRC 660) Balanta 108/21 347 
Amadi 79/50 6 Baldamu 94/31 - 
Amba 79/09 26/d Bale(dha) 79/60a - 
Ambam 88/43c, 92/40d - Bali (NC/Gur-Adamawa/ 98/33 3 
Ambele 93/31, 95/40g - Yendang) 
Ambo 75/34, 77/36 3 Bali (NC/North-West 
Amgbe 101/86 - Bantu/Ngombe) (C50) 79/28, 83/67 = 
Amo 97/24 5 Bali Nyonga (NC/ 
Ampari 104/04b Dogon Mbam-Nkam/Nun) 93/46a, 95/52 74 
Amu 85/1d 18 Balong 92/30, 93/87 = 
Ana 102/32a 91 Bambama 83/29a, 88/47 16 
Anang [Anaang] 101/19 1,300/d Bambara [Bamankan]| 104/73, 107/28a, 108/35a 3,527 
Angas 96/83, 97/73a, 98/84 424/d Bambili 93/52, 95/56 14 
Angba 79/29 - Bambomba 87/33a - 
Animere 103/33 3 Bambuka 98/19 20 
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Language 
[alternative name] 


Map no./language no. 


Number of speakers 
(in thousands) 


Language 
[alternative name] 


Map no./language no. 


Number of speakers 
(in thousands) 


Bamnyo 

Bamun [Shipamon] 
Bamunka 

Bana 

Banda 

Banda Mid-Southern 
Banda South-Central 
Banda-Banda 
Banda-Mbres 
Banda-Ndele 
Bandawa-Minda 
Bandi 


Banga (NC/Gur-Adamawa/ 
Waja/Mboi) 

Banga (NC/West Kainji/ 
Duka) 

Bangandu 

Bangari Me 

Bangba-Mayogo 

Bangem 

Bangkeng 

Bango 

Bangu-Bangu 

Bangwinji 

Bankal 

Banma Tegu 

Banoo 

Banyun 

Bapakum 

Bapuko [Bapuku] 

Barambo 

Barani Ful 

Barein 

Bari 

Bariba [Baatonum] 

Benin 

Barma 

Barwe [Wesa] 

Basa Kaduna 

Basaa 

Basaa ba Duala 

Basari 

Basari-Tobote 

Bashar 

Basho 

Basila 

Bassa 

Basum 

Batanga 

Bati (NC/North-West 
Bantu/Ngombe) (C50) 

Bati (NC/Bantu/Mbam/A40) 

Batomo 

Batu 

Baule-Ando 

Baushi 

Bay 

Baya 

Bayot-Essin 

Bazanche 

Be(e)ten 

Beba' 

Bebe 

Bedik 

Beeke 

Beekol 

Befang 

Bekware [Bekwarra] 

Bekwil 

Bekwil Peda 

Bele 

Belingo 

Bemba 

Bembe (NC/North-East 
Bantu/Shi-Havu) (J50) 

Bembe (NC/Central-West 
Bantu/Kongo) (H10) 

Ben [Gan] 


91/43d, 95/70 

95/51, 93/47 

93/43, 95/44a 

94/15 

79/55a, 82/56, 87/95, 90/21 


98/60 
106/27 


98/12b 


100/26, 102/43 
87/87 

104/04f 

79/47 

93/84a 

93/88k 

79/26, 82/30 
78/27, 80/61 

98/05 

96/62, 97/54, 98/50 
104/041 

92/23 

108/27 

93/46b 

88/01la, 92/24a 
79/44 

104/B 

89/87 

86/67, 79/69 
101/07, 102/44, 104/67 


90/44f, 99/31 
74/18, 75/42c 
101/41 

92/35, 93/95 
92/35b 

107/12, 108/22 
102/54, 103/59, 104/30 
98/64a 

95/35a 
102/07a, 104/69 
106/14 

91/36a 

92/24 


79/31, 82/37 
92/10, 93/74 
95/07 

95/15 
105/13 
101/45 
81/09 

95/98 
108/09 
102/16, 104/71 
87/49, 91/59 
95/55a 
95/18 
108/23 
79/21 
87/57e 
95/39b 
95/73c 
87/531 
87/53m 
91/37a 
89/33 

76/60, 77/01, 78/29 


80/48 
83/13, 88/54 


105/72 


- Bena 
300 Bende 
26 Bendeghe 
59/d Benga 
735 (CAR 692) Benge 
108 Bereby Kru 
162 Besep 
110 Besleri 
48 Besum 
47 Bete 
15 Bete-Bendi 
84/d Biafada 
(refug. GU 59) Bieno 
Bieri 
- Bijago 
Biugi 
10 Bilala 
4 Bile 
Dogon Bima 
13 Bindi 
- Binji [Binja] 
- Binza 
- Bira (NC/North-East Bantu/ 
197 Bira-Huku) (D30) 
7 Bira (NC/North-East Bantu/ 
51 Nyoro-Ganda) (J10) 
Dogon Birawa 
- Birgit 
15 Birifor [Birifur/Lobr] 
- Birom 
9 Bisa (NC/East Mande) 
33/d Bisa (NC/Central Bantu/ 
- Bisa-Lamba) (M50) 
5 Biseni-Oko(r)dia 
475 (Sudan 375)Bisoo 
644 Bitare 
529 Bitieku 
54 Bo 
29 Bobangi 
- Bobo [Sya] 
390 Bodede 
- Bodikiri 
18 (Guinea 10)Bodiman 
177/d Bodo 
<1 Bodongo 
- Bofi 
13 Boghom 
448 Boguru 
- Bok(y)i 
16 Bokare 
Bokare-Mbakolo 
6 Bokas [Bokkos] 
9 Bokote 
- Bokpan 
44 Bole 
2,447 Bolgo 
27 Bolia 
- Bolo 
- Boloki 
15/d Bolon 
- Bom-Mani 
- Boma 
Bombali 
- Bomboma 
4 Bombwanja 
- Bomudi 
16 Bomvana 
- Bomwali 
133 Bon Gula 
20 Bondanga-Lingonda 
= Bonde 
- Bondei 
Linda Bondo 
2,662 Bondum Dom 
Bong'omok 
335 Bongo 
320/d Bongo (and dialects) 
Boni 
97 (Mali 78) Bonjo 


76/42 

76/23, 78/42 
93/24, 95/33 
88/01 

79/35, 82/39 
105/28b 
87/57b 

94/46 

94/06b 

105/24 

95/73d 

108/28 

84/26e 

102/72, 103/52, 104/35 
107/09, 108/31 
87/53c 

99/29, 90/44c 
98/44 

93/82b 

78/21, 80/66, 81/69 
80/64 

79/25, 82/29 


79/11 


80/19a 

96/03, 97/34 

89/83b 

103/41, 104/17, 105/34a 
96/48a, 97/15 

102/78, 103/83, 104/84 


75/35, 76/68, 77/34 
101/06 
92/35c, 93/95a 
95/14 

93/28a 
80/34b 
81/04, 82/01, 83/62, 87/01 
104/78 
88/03c 

90/30 

92/14c, 93/90 
79/20, 89/57 
82/27, 87/06 
87/82 
96/100, 97/95, 98/94 
79/19 

93/18, 95/73e 
87/83 

87/84 

96/87 

81/010, 82/47 
87/80b 
98/77, 100/04 
89/49, 90/05 
81/06 

84/17 

82/04, 87/11 
104/74 
106/07 

81/30, 83/59 
106/08b 
82/18, 87/21 
81/02 

88/03a 
74/04d 
87/52a 

89/51 

82/16, 87/19 
82/47d 
76/04, 85/15 
84/21 
104/04g 
86/82 

82/59, 89/22 
79/58 

85/40 

87/77 


687 
29 


559/d 
73 
27 
Umbundu 
84 


114 
363 
380 (Bf 543) 


91 
278 (BF 196) 


almost extinct 


Xhosa 
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Booli 

Borana 

Borobo 

Bosaka 

Bosho 

Bosoko 

Bowiri 

Bowol [Bwol] 

Bozo 

Bria 

Bu-za-ngem 

Bua 

Bubalia [Beraku] 

Bubi 

Bubia 

Buda-Mono 

Budu 

Buduma 

Budya 

Budza 

Buga 

Buja 

Bujeba 

Buji 

Bukusu 

Bulba [Nootre] 

Buli (NC/Central Gur/ 
Oti-Volta/Buli-Konni) 

Buli (NC/Ubangi/South- 
West Gbaya-Manza- 
Ngbaka) 

Bulu 

Bum 

Bumaji 

Bunda 

Bura 

Burak 

Burhinyi 

Burji 

Burunge 

Busa 

Busam 

Bushong [Kuba] 

Butu-Ningi 

Buu-1 

Buu-2 

Buu-3 

Buwal 

Buyu 

Bveri 

Bviri 

Bwa 

Bwamu 

Bwari 

Bwende 

Bweru 

Bwiko 

Bwile 

Bwisi (NC/North-East 
Bantu/Nyoro-Ganda) 
(J10) 

Bwisi (NC/North-west 


Bantu/Shira-Punu) (B40) 


Byep 


C(h)ara 

Ca' 

Caka (NC/Central East 
Bantu/Makua) (P30) 

Caka (NC/South Bantoid/ 
Tivoid) 

Central Gbaya 

Central Idoma 

Central Igbo 

Central Ijo 

Central Karanga 

Central Nupe 

Central Yoruba 

Chag(g)a 


80/04d, 81/0le 
85/43b, 86/96b 
105/28e 
80/04k, 81/Oly 
86/6h 

91/39c 

102/4a, 103/28 
96/78d 

104/80 

89/25 

95/24 

90/01 

90/44d, 99/30 
92/20 

92/16 

87/92 

79/15 

99/12 

78/20b 

82/28b 

74/54, 77/77 
79/27, 82/23 
88/35, 92/33 
97/46b 

86/35a 

102/63 


103/54, 104/39a 


91/39b 

92/40f 

95/42a 

95/73h 

84/01g 

98/104, 94/09, 100/11 
98/16 

80/46e 

85/42, 86/95 
76/70, 86/92 
100/27, 101/94, 102/76 
93/32, 95/40h 
81/49 

97/30 

85/3k 

85/3h 

85/3m 

94/43 

78/26, 80/59 
91/36c, 94/06c 
79167, 89/77 
79/30 

104/40a, 105/35 
80/49 

83/03 

80/23a 

88/03d 

78/31a 


79105 


83/23, 88/16 
87/57a 


96/44, 97/l6a 
93/64f 


75/19 


95/12 

87/80 

101/60b 

101/83 

101/05, 102/75 
74/16, 75/42h 

97/65, 101/57, 102/35 
101/88, 102/30 
85/28, 86/6 


Mongo 


108/d 


Mongo 


Chag 


Band. 


ga 


14 
302/d 
a-Banda 
6 
7/d 


probably extinct 


Kuba 


46 


42 
21 (L2 22) 


Pokomo 
Pokomo 
Pokomo 


Bami 


Gbay: 


31 
236/d 
488 (BE 287) 


13 


leke 


7 
a 


Idoma 


Igbo 
Tjo 


2,565/d 
975 
Yoruba 


1,15 


O/d 


Chakali 

Chakfam-Moshere 

Chakosi 

Chala 

Cham-Mwana 

Chamba Daka 

Chamus 

Chavikwa 

Chawai 

Chedepo 

Chembara 

Cherepong 

Chewa 

Chewele 

Chikunda [Kunda] 

Chimba [Zemba] (NC/ 
South-West Bantu/ 
Hoanib) (R30) 

Chimba-Herero 

Chip 

Chokobo 

Chokwe 

Chomo(-Karim) 

Chonyi 

Chopi 

Chuka 

Chumbuli 

Chumburung 

Chwabo 

Chwaka 

Ci-Banda 

Ci-Mongo 

Ciinaluul 

Ciley 

Cinda 

Common Somali 

Comorian (total) (NC/ 
North-East Bantu/ 
Swahili-Oid) 


Crioulo 


Cuvok 
Cwa 


Daba 

Dabida 
Dafo-Batura 
Daga-Nyonga 
Dagaari 
Dagana 
Dagbani 
Dahalo 
Dahomey Ful 
Dai 

Daju 

Dakpa 

Daloa 

Dama 

Dan Muure 
Dan [Gio] 


Dangla 

Das(s) 

Dasenech 

Datoga 

De(Woin) 

Debaba 

Defaka-Nkoro 

Deg(ha)/Mo 

Degema 

Degodia 

Delo [Ntrubo] 

Delta Igbo 

Dendi (NS/Songhay) 

Dendi (NC/Ubangi/ 
Ngbandi) 

Dengfap 


103/71, 104/53 
96/73 

102/27, 103/09, 104/70 
102/51, 103/64 
98/06 

91/47, 95/72a, 98/51 
86/76 

84/33 

96/06, 97/25b 
106/12a 

105/59c 

102/26, 103/19 
75/22, 76/59a, 77/23 
85/2a 

74/21, 75/31, 77/21 


74/44a, 84/35 
96/75 

97/32 

771/68, 84/41, 81/77 
95/85, 96/54, 98/61 
85/6 

75/44 

85/33, 86/14 
102/15 

102/21, 103/13 
75/29 

85/12a 

83/06b 

83/06a 

78/24c 

78/24b 

100/17, 101/47a 
85/41a 


75/2-5 


108/C 


94/40 
80/65 


94/45 

85/25 

96/92 

93/100, 95/96a 

103/40, 104/16, 105/34a 
99/14 

102/68, 103/48, 104/23 
85/38 

102/4 

89/55, 90/08 

89/61, 90/40 

89/08 

105/24b 

91/24 

91/05a, 95/97b 

105/66, 106/3, D 


90/57 

96/96, 97/87, 98/98 
86/97 

86/89 

106/16 

90/73a 

101/01 

103/74, 105/37 
101/61b 

85/D 

102/49, 103/63 
101/85 

102/81 


82/46, 89/41 
93/64c 


42 
Grebo 
1,043/d 
70 
6,776 


288/d 


1,300 
143/d 
920 
117 

2 
50/d 
1,064/d 


Kongo 
Kongo 
Songye 
Songye 

35 
12,300 


760 
(Comoros 597) 
405 
(Guinea Bissau) 
178 (L2 762) 


1,582/d 


621 


1,476 
(Céte d'Ivoire 856) 

48 

11 


12 (Ghana 8) 
Igbo 
36 


14 (CAR 12) 
Bamileke 
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Denje [NA] 
Denkwop 
Denya 

Dera 

Dera [Kanakuru] 
Dghwe'de 
Dhopaluo [Chopi] 
Dia 

Dibo 
Dibumum 
Dida 
Didinga 

Digo 

Dii 

Dimba 
Dimbambang 
Dimbong 
Ding 
Diola-Fogny 
Dirim 

Dirya 

Disa 

Divenie 
Dodos [Dodoth] 
Doe 

Doemak 
Dogo Dum 
Dogon 
Dogos(y)e 
Doka 

Doko 
Doko-Yumba 
Dombondola 


Dondo 

Dong 

Donno So 
Dooyaa(ng)yo 
Dotanga 
Duala 


Dubulu 
Dugbo 
Duguri 
Duguza 
Duka 
Dulbu 
Duleri Dom 
Duli (disappearing) 
Duma 
Dumbule 
Dungi 
Durbeye 
Duruma 
Duupa 
Duwai 
Dwentza Ful 
Dyakanka 
Dyan 

Dye 
Dyimini 
Dyula 
Dzodinka 
Dzunza 


East Bisa 

East Daju 

East Ejagham 

East Kare 

East Kesherda Daza 
East Massina Ful 
East Mbum 

East Nyala 

East Oroko 

East Teke 


Eastern Huang [+H64] 


Ebira 
Ebkuo [Kuo] 
Eboi 


89/73, 90/41h 
93/62e 
93/28, 95/35 
98/71 

91/78, 98/83 
94/23 

86/60, 79/73 
81/08 
101/50 
93/94 
105/22 
86/59 

76/02, 85/12 
90/20, 91/01 
84/32 
92/39, 93/93 
93/79 

81/39 
108/12 
91/46, 95/72b 
97/77, 98/88 
89/65 
83/22a 
86/72 

76/06, 85/17 
96/78b 
104/04j 
104/04 
104/59, 105/46 
96/22, 97/04 
93/11 
82/34c 
84/36f 


83/11 

98/57 
104/04n 
91/08, 98/26c 
93/82c 

92/14 


83/22c 
105/28a 
96/59, 98/47 
96/10, 97/49 
100/24, 102/41 
97/57, 98/46 
104/04e 

91/15 

83/42, 88/44 
92/06, 93/69b 
97/39 

91/36b, 94/06a 
85/11 

91/02 

100/09 

104/D 

106/18f 


103/77, 104/62b, 105/44 


102/58 

105/56 

104/77, 105/62d 
95/46 

85/31 


77/34b 
79/68 
93/22a 
90/12a 
99/23 

104/E 
90/09, 91/27 
86/35d 
92/26, 93/83 
81/25, 83/30 
74/61 
101/51 
95/42d 
83/34a 


Sara 
Bamileke 
20 


124/d 

236 
90/0 

320 (Kenya 256) 
66 


Dogon 
672 (Mali 517) 
38 
<1 
<1 
Ovambo- 
Kwanyama 
6 
Dogon 
26 


52/d 
(L2 1,703) 


24 

23 

3 
101/d 
<1/o 

Dogon 

probably extinct 

12/d 

6 

<l 

291 

7 


3,202 
15 
Mi-Gangam 
116/d 
1,250/d 
4 
Pokomo 


Bisa 

Daju 

Ejagham 
104 

Daza 

Massina Ful 


Ebrie 
Echie 
Ediamat 
Edo [Bini] 
Efe 

Efik 


Efium 
Efutop 
Efutu 
Ega 
Egan 
Egbema 
Eggon 
Ehabo 
Ehanda 


Ehow 

Ejagham 
Ekajuk 

Eki 

Ekit 

Eko 

Ekombe 

Ekota 

Ekpanu [Logba] 
Ekpeye 


Elem (NC/East Benue- 
Congo/Ogoni Cross) 


Elem-Ogoi 
Elmolo 
Eloyi 
Elung 
Eman 
Embu 
Enenga 
Enga 
Engeni [Egene] 
English 
Enow 
Enya 
Enyele 
Enyom Igbo 
Eotile 
Epie-Atisa 
Erei 

Eruwa 
Esan 

Esel 
Esimbi 
Esinga 


Eton 

Etono 

Etsako [Yekhee] 
Etulo 

Etung 
Eunda-Kolonkodhi 


Evale 


Evand 
Ewaku 
Ewe 
Ewondo 
Eza 


Fa' 
Fali 
Fam 


Fanakalo (NC/South-East 


Bantu Pidgin) 
Fang(-Beti) 
Fanian 
Fanian (and Mana?) 
Fante 
Fe'Fe' 
Fer 
Fipa 


105/09 

101/83c 

108/15 

101/63 

79/63b 

92/01, 93/01, 101/17 


95/78c, 101/26b 
93/26, 95/27 
103/21 

105/07 

101/52 

101/81la 

95/86, 96/38 
93/881 

84/36d 


93/88d 
93/22, 95/34 
95/31 


87/70c, 91/56b, 92/40i,v.93/98 


93/02, 101/20 
101/90a, 102/31¢ 
92/26a, 93/83a 


79/41c, 80/04), 81/01x 


102/12, 103/27 
101/85a 


101/15 

86/E 

95/88, 96/67b 
93/84b 
95/11 

85/32, 86/8 
88/04 

82/03, 87/10 
101/6la 
7AIE 

93/88e 
79/23a 
82/28, 87/07 
101/84 
105/11 
101/61c 
101/29b 
101/62a 
101/64 
87/53a 
95/09 
84/36g 


92/40h 
101/29f 
101/65 
95/92 
93/23, 95/26 
84/36h 


84/36e 


95/05 

82/19 

102/02, 103/37 
92/40g 
101/78c 


93/63b 
94/05 
95/71 


98 
Igbo 
22 
1,390 
25 
424/d 
(L2 2,544) 


Igbo 

177/d 
Ovambo- 
Kwanyama 


33 

6/d 
probably extinct 

29 


<1 


390 
1 
33/d 
Ovambo- 
Kwanyama 
88 
315 
11 
Ovambo- 
Kwanyama 
Ovambo- 
Kwanyama 
11 
3,150 (L2 3,810) 
1,622 
299 


Bamileke 
93/d 
2 
1,202 


87/70, 88/43, 91/56a, 92/40+40a 4,730 


90/06 

89/47 

102/29a, 103/02 
93/63 

89/74 

76/25, 77/73c, 78/39 


2 


5,203/d 
166 
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Language 
[alternative name] 


Number of speakers 


Map no./language no. Map no./language no. (in thousands) 


Fon 102/01 1,738 Giziga North 94/38a 31 
(Benin 1,694) Giziga South 94/38b 78 
Frafra-Nankani [Gurenne] 103/38 700 Glebo 105/28d Grebo 
Fu'da 93/62b Bamileke Glio-Ubi 105/29 8 
Fu'sap 93/621 Bamileke Goba [Mozambian 
Fuliro 80/25+25a 309 Korekore] 77157a 1l1/d 
Fumu 81/27, 83/26 - Godie 105/25 31 
Fumuwumbu 81/62 - Goemai 96/83, 98/86 242/d 
Fundi [Shirazi] 85/1g 17 Gogo 76/16 1,612 
Fur [For] 89/80 3 Gokana 101/16b 157 
Furi 103/70, 104/45 - Gola 106/02, G 136 
Furu 82/79, 89/78 15 Golo 79/34, 82/38 - 
Futa-Jalon Ful 107/01b 3,683 Gonge 90/18 - 
(Guinea 3,239) Gonja 102/20, 103/11, 104/72, 105/20 280/d 
Fwe 74/24, 77/26 9 Goyi 80/45a - 
Fyam [Fyem] 96/50a, 97/51 27 Grebo 105/28, 106/12 331 
Fyer 96/86a, 97/74 4 Gu'de 94/48 10 
Gubi 97/56, 98/45 1 
Ga 102/14, 103/35 363 Gudu 91/76, 98/107 7 
Ga'an(d)a 91/70, 98/102 53/d Guduf 94/22 42 
Gabra 85/43a, 86/96a 51/d Gulai 90/44a 192/d 
Gabri 90/52 30/d Gulu [Gele/Fongoro] 89/76 2 
Gadang 90/50 3 Gungu 79/03a 32 
Gade 101/53 49/d Gure-Kahugu 97/43 10 
Galambi 97/72, 98/76 24 Gurma [Nchema] 102/53, 103/58, 104/29 824 (BF 619) 
Galwa 88/06 13 Gurmana 101/46 5 
Gamargu 94/19a, 99/02a 12 Guro 105/68 402 
Gamba 90/41f - Guruntum(-Mbaaru) 96/93, 97/85, 98/90 5 
Gan 104/57, 105/46a = Gusii 86/53 1,867 
Ganda 84/26c Umbundu Gusilay 108/13 21 
Gangam 102/59 = Gusu 96/12, 97/46c 16/d 
Gangela 77/71a - Gwa (NC/South Bantoid/ 
Ganja [Ja] 107/11, 108/21a = Jarawan) 97/55 1 
Ganyoa 105/24d 216 Gwa (NC/Kwa/South 
Garreh 85/43d 113 Guang) 102/24, 103/18 39 
Gavar 94/44 7/d Gwa (NC/Central-West 
Gaya 86/30c - Bantu/Bushong) (C90) 81/51 - 
Gbagyi 96/68, 101/54 881/d Gwandara-Gari 95/100, 96/71a, 101/96 59 
Gbaka-Manza 89/37 31 Gwano 85/3b 3 
Gban [Gagu] 105/70 45 Gwari-Yama 97/66, 101/55 381 
Gbanu 87/80a, 89/38, 90/34 106 Gwe 86/40 51 
Gbanzili [Buraka] 82/74, 87/73, 89/03 20/d Gwembe 74/26, 77/24d - 
Gbaya 87/88, 91/40a NW 310, SW 183 Gweno 85/23, 86/3a = 
Gbaya (total) (NC/Ubangi/ Gwere 86/38 351 
Central Gbaya-Manza- Gwom(um) 98/18 - 
Ngbaka) 1,654 Gworam 96/78e 
Gbaya Kara 90/36, 91/38b _ Gyando 82/13, 87/16 - 
Gbaya-Biyanda 87/90, 91/39d - Gyem 97/28 1 
Gbaya-Bokoto 90/33 190 
Gbaya-Ngombe 87/93, 91/40b 2 H(y)am [Jaba] 96/24, 97/07 118 
Gbaya-Toongo 90/37, 91/39a - Ha 78/45, 86/54, 80/34 968 
Gbaya-Yangere 87/91, 91/39e - Hadad 99/23a - 
Gbede 101/89b Yoruba Hadza [Tindiga] 86/101 <l 
Gbee [Gbii] 106/15 8 Hai-//'om [San] 74/46a, 84/53 18/d 
Gbende 82/61, 89/31 5 Hamba Sambala 80/71b Tetela 
Gbete 87/72, 91/31 - Hamer 86/99 - 
Gbeya 89/39, 90/29 - Handa (NC/Gur-Adamawa/ 
Gbwata [Bata] 91/66, 94/55, 98/108 176/d Waja-Jen/Waja/Mboi) 98/12c - 
Gcaleka 74/04b Xhosa Handa (NC/South-West 
Gciriku [Dciriku] 84/25b 35 Bantu/Umbundu) (R10) = 84/29b Umbundu 
Ge [Mina] 102/02a 415 (Togo 255) Hanga [Yaka] 80/41 10 
Geji 97/88, 98/97 7 Hanga-Kamara 103/51, 104/26 7 
Gelvaxdaxa 94/20 39 Hangaza 80/36 39 
Geme 89/05 <1 Hanya 84/26h Umbundu 
Gengle 98/31 - Havu 80/45 511 
Gera 97/69, 98/73 230/d Haya 86/26, 80/43 1,524 
Gerka [Yiwom] 96/85, 98/87 1 Hehe 76/43 885 
Geruma 97/71, 98/75 7 Hemba 77147, 78/14, 80/54 146 
Gey (disappearing) 91/34, 94/03 probably extinct Her 108/17 12 
Ghomala 93/62 390 Herero 74/44 157 
Ghotuo 101/67 11 Higi [Kamwe] 94/13 372 
Gichode 102/23, 103/16 9 Hijuk 92/13, 93/76 <1 
Gichugu 86/9a - Himba (NC/South-West 
Gikuyu 85/37, 86/9 6,309 Bantu/Hoanib) (R30) 84/34 - 
Gimine [Gimnime] 91/10, 98/25f 4 Himba (NC/North-West 
Gimme 91/09, 98/25e - Bantu/Tsogo) (B30) 83/56, 88/20 4 
Gingwak 96/61, 97/61, 98/49 38 Hinga 84/29g = 
Giriama 85/4 585 Hlengwe 74/10, 75/46 6 
Gisu 86/37 Masaba Holoholo 78/35, 80/17 25 
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Holu 

Hom 

Homa 

Horom 
Humbe 
Humbu 
Humburi Senni 
Hunde 
Hungan(a) 
Hungu 
Hurutshe 
Hwana [Hona] 
Hya 

Hyanzi 


Iambi 

Iba 

Ibami 

Ibani 

Ibeto 

Ibibio 

Ibie 

Iboko 

Ibuno 

Ibuti 

Icen 

Iceve 

Idaitsa 

Idakho 

Ido(ng) 

Idoma 

Ife 

Igala 

Igbo Savannah Cluster 

Igede 

Igembe 

Igoji 

Ihabe 

Ihatum 

Te 

Tjesha 

Tuma 

Ik [Teuso] 

Ika 

Ikale 

Ikizu 

Ikogo 

Ikom 

Ikom-Olulumo 

Ikoma 

Ikpeshi 

Iku-Gore-Ankwa 

Ikulu 

Ikwere 

la 

Ilande 

Ilwana 

Imenti 

Imoma 

Imoma-Mpongo 

Inamwanga 

Ipulo 

Iraqw [Mbulu] 

Irigwe 

Iro Gula [Kulaal] 

Isanzu 

Isoko (NC/West 
Benue-Congo/South 
Edoid) 

Isoko (NC/North-West 
Bantu/Mongo-Nkundo) 
(C70) 

Isu 

Isukha [Kakamega] 

Itsa 

Itsekiri 

Iwa 

lyive 

Iyoniyong 


84/10 
93/62g 
79/18, 89/58 
96/50 
84/29e 
81/21, 83/52 
104/1d 
80/47 

81/63 

84/08 
74/07d 


91/71, 98/103, 94/08 


94/14 
80/19b 


86/19 

87/53k 
93/10b 
101/02b 
100/19a, 101/38b 
101/18 
101/70 
82/09, 87/13 
93/03, 101/22 
79/l1a 
95/82c 

95/03 
102/31a 
86/35k 
96/23, 97/03a 
95/94, 101/60 
101/88a 
96/69, 101/93 
95/95 

95/89 

85/34a 
85/35, 86/11 
93/88} 

95/10 
102/31b 
101/88b 
101/89c 
86/55 
101/82b 
101/91b, 102/33a 
86/48 
80/04g, 81/0lu 
93/14 

95/77 

86/49 

101/72 
96/11, 97/105 
96/21, 97/06 
101/85b 
77/29 

78/19 

85/2 

85/34c 
81/01k 
80/04e, 81/01b 


76/52, 77/11, 78/32 


95/13 
86/90 
96/43, 97/16b 
89/52 
86/18 


101/62b 


81/01f 

92/15 

86/351 

102/32b 
101/91a, 102/33b 


75/23, 76/57, 77/12 


95/04 
93/10, 101/25 


52 
Bamileke 
probably extinct 
2 
168/d 


Songhay 
300 
<1 
202 
Tswana 
40 


72 


91 

<1 

1,143 
202 (Togo 97) 

968 

18,540 

318/d 
31 


25 
Yoruba 
Yoruba 

7 
Igbo 

39 

Mongo 


Izarek [Zarek] 
Izi 


Jaku 

Jalkuna 
Jamsay Tegu 
Jan(jo) 

Jara [Jera] 
Jegu 

Jelgolgi Ful 
Jera 

Jere 

Jetko 

Jeto 

Jibana [Dzihana] 
Jibu 
Jidda-Abu 
Jie 

Jieng [Dinka] 
jiji 

Jimbin [Zumbun] 
Jimi 
Jimjimen 
Jina 

Jinja [Zinza] 
Jipal 

Jiru [Kir] 

Jita 

Jluko 

Jogo [Ligbi] 
Jomvu 
Jonkor 
Jorto(1) 

Ju 

Juang 
Junguru 

Jur Lwo [Lwoo] 


Ka'da 

Kaado 

Kaang 

Kaba 

Kabalai 
Kabare 
Kabende 
Kabras 

Kabre [Kaliye] 
Kacicere [Kuchichere] 
Kadara 
Kafanchan 
Kafima 


Kagbo 

Kagoma [Gyong] 

Kagoro 

Kaguru 

Kahe-Okuma 

Kajakse 

Kaje [Jju] 

Kakaa [Kuri] 

Kakanda 

Kakihum 

Kako 

Kakolo 

Kakongo 

Kakwa 

Kala 

Kalabari 

Kalanga (NC/South-East 
Bantu/(S10) Shona) 

Kalanga (NC/North-East 
Bantu/Lega-Kalanga) 
(D20) 

Kali 

Kalindi 

Kam 

Kamazung 

Kamba (NC/North-East 
Bantu/Thagicu) 


96/42, 97/16c 
101/78a 


96/63, 97/53, 98/39 
104/76 
104/04r 
98/14 

98/101 
89/82c 

104/J 

96/13, 97/46 
96/14, 97/46a 
99/24 

82/60, 89/35 
85/7 

91/42, 95/79b 
96/34 

86/71 

79177 

78/44, 80/33 
97/80, 98/89 
95/101, 97/92 
94/49 

99/07 

86/25, 80/42 
96/78£ 

98/62 

86/44 
105/26d 
103/84 

85/1f 

89/82b 

96/84 

97/83, 98/96 
102/19, 103/10 
79/55d, 89/18 
79175 


91/77, 94/59 

104/1c 

91/37c 

89/72, 90/42 

90/53 

80/46a 

77/09 

86/35b 

102/50, 103/67, 104/49 
96/46b, 97/20 
96/19, 97/01, 101/33 
96/30f, 97/18 
84/36c 


105/26e 
96/25, 97/08 
96/46e, 97/23 
76/15 

86/6g 

89/83a 

96/45, 97/17 
90/70a, 99/12a 
101/56 
100/19b, 101/38c 
87/45, 91/60 
107/28f 
84/0la 
86/67b, 79/70 
81/01h 

101/03 


74/13, 77/58 
78/35a, 80/17b 
91/28 

85/3n 

98/36 

95/17 


85/29, 86/7 


61 
1,157 


55 
169 
784 (Togo 742) 
96 
Katab 
Ovambo- 
Kwanyama 
8 
12 


i. 
416 


27/d 
17/d 
94 
Manding 
Kongo 
169 
Mongo 
572 


388 


Pokomo 


3,258 
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Kamba (NC/Central West 
Bantu/Kongo) (H10) 

Kambari 

Kambe 

Kami 

Kamo 

Kamuku 

Kana 

Kanam 

Kande 

Kanembu 

Kaningi 

Kanufi 

Kanyok 

Kaonde 

Kara 

Karaboro 

Karamojong [Karimojong] 

Kare 

Karekare 

Karfa 

Kari 

Kariya 

Karon 

Kasa 

Kasele 

Kasem 

Kasenga 

Kasunga 

Katab 

Kauma 

Kavango 

Kaziba 

Keaka 

Kebu 

Kel 

Kela 

Kele (NC/North-West 
Bantu/Poke-Kele (C60) 

Kele (NC/North-West 
Batu/Kele-Mbete) (B20) 

Kelwer 

Kendem 

Kenga 

Kenge 

Kengin Gyee 

Kenswei Nsei 

Kenyang 

Kenyang Lower 

Kenyang Upper 

Kenyi 

Kera 

Kerewe 

Kete 

Ketere 

Keyo 

Kgatla 

Khayo 

Khiri 

Khoe 

Khozam 

Khutswe 

Kiambu 

Kibalo 

Kibangu 

Kibet 

Kiga [Ciga] 

Kilba 

Kim 

Kimbu 

Kimbundu (total) (NC/ 
Central West Bantu/ 
Kimbundu) (H20) 

Kimongo 

Kinakomba 

Kinda 

Kinga 

Kingi-Ntansye 

Kintswen 


83/14 

96/16, 97/50 
85/8 

76/09 

98/03 

100/15 
101/16a 
96/65, 97/62, 98/41 
87/69, 88/21 
99/23 

83/33 

96/32c, 97/lla 
78/24 

77153 

86/43 

104/07, 105/61 
86/70 

89/56, 90/12 
98/72, 100/02 
96/90 

79/17 

97/79 

108/10 

108/14 

102/55 
103/68, 104/43 
78/12, 80/55 
108/26 
96/34+34a 
85/5 

74/08 

80/46c 

93/22c 

102/05, 103/34 
81/9b 

80/04b, 81/01s 


79/36, 80/02 


83/43, 88/24 
81/32b 
93/29, 95/36 
90/44e 
83/13d 
81/lla 
93/42, 95/44c 
95/37 

93/29b 
93/29a 

86/33 

90/56, 94/60 
86/42 

78/25, 81/72 
81/llc 

86/87 

74/07h 
86/35n 

74/45 

84/52 
90/73c, 99/16 
74/05d 

86/9f 

97/36 

83/15b 

89/62 

80/31 

91/72, 98/106, 94/12 
90/19 

76/20 


83/09a, 88/53 
85/3c 

87/48a, 91/61 
76/46, 77/33a 
81/32c 
81/32a 


26 

81 
Kenyang 
Kenyang 

449 

54 

140 


280 
Tswana 
78 


North Sotho 


Kinuku 

Kiong 

Kipsigis 

Kir-Balar 

Kirfi 

Kiria 

Kirma 

Kisa 

Kisi (NC/South Atlantic/ 
Bulom-Kisi) 

Kisi (NC/Central East 


Bantu/Bena-Kinga) (G60) 


Kitimi [Tumi] 

Kituba 

Kitui 

Kiwo-Naruma-Vunjo 

Klao 

Ko 

Kobi 

Kobiana 

Kobo 

Kobo-Dii (NC/Gur- 
Adamawa/Leko-Nimbari/ 
Duru) 

Koenoem 

Kofa-Mogum 

Kofyar 

Kohno 

Kohumono 

Koji 

Koji-Budya 

Koke 

Kolena 

Kolo 

Kolokuma 

Kom 

Koma 

Kombe 

Kome 

Komono [Khisa] 
Kona 
Konda (NC/Central-West 
Bantu/Tetela) (C70) 
Konda (NC/North-West 
Bantu/Mongo-Nkundo) 
(C70) 

Kongo (NC/North-West 
Bantu/Mongo-Nkundo) 
(C70) 

Kongola 

Koningkom 

Konjo 

Konkomba 

Konni 

Kono (NC/East Kainji/ 
Kaura) 

Kono (NC/Central West 
Mande/Kono- Vai) 

Konongo 

Kony 

Konyagi 

Konyanka 

Koozime 

Korandebo 

Korekore 

Koring 

Koro 

Korop 

Kosa 

Koshin [Koskin] 

Kota (NC/North-West 
Bantu/Kele-Mbete) (B20) 


Kota (NC/North-West 
Bantu/Ngando) (C10) 

Koti 

Kotokoli [Tem] 

Kotokoli 

Kotopo 


97/35 

93/09, 101/32a 
86/84 

97/94, 98/96 
98/67 

94/16 

104/61a, 105/41 
86/35h 


106/03 


76/45 
97/40, 101/48 


85/29b 

86/6e 
106/13a 
104/52 

80/30 

108/25 

91/12, 98/25d 


96/81 

89/88 
96/65+65a 
106/22 
101/28 
78/20a, 80/69 
78/20 

89/48, 90/07 
91/16 
101/10a 
101/05a 
95/42e 

91/14, 98/25b 
88/03, 92/22 
86/30a 
104/60, 105/47 
98/63 


78/02a, 80/71g 


81/03 


81/01i 

78/03, 80/73 

96/32a, 97/11c 

79/06, 80/40a 

102/56, 103/57, 104/31 
103/55, 104/39b 


97/37 


106/20 

76/22 

86/80 

107/07, 108/24 
106/18d 

87/56 

104/04d 
74/14, 75/42g 
101/26 

96/15, 97/09, 101/34 
93/07 

77/56 

95/23 


81/22, 82/49, 83/46, 87/63, 
88/28 


87/31la 

75/15 

90/70e, 94/56b, 99/13 
102/47, 103/65, 104/48 
95/97a 


1 
almost extinct 

1,200 

4 

20 


96 
651 (N 421) 


21 
1 
4,620 (L2 5500) 


372 


Kobo-Dii 


Tetela 


253 (L2 40) 
70 
26 


55 
66 
65/d 
410 (Togo 208) 
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Koyo [NC/Kru] 
Koyo [NC/Bantu] 
Koyra Chiini 
Koyra Boro 
Kpala 

Kpalaga [Palaka] 
Kpan 

Kpasam 

Kpatiri 

Kpelle 


Krachi 

Kranh 

Kreda Aza 

Krim 

Krio 

Krobu 

Kru 

Kuang 

Kuba (NC/North-West 
Bantu/Mbosi) (C30) 

Kuba Group (NC/Central- 
West Bantu/Bushong) 
(C90) 

Kubi 

Kuda-Chamo 

Kugama 

Kugbo 

Kuka 

Kukele [Ukele] 

Kuku 

Kukwa 

Kulango 

Kulango-Nkuareng 

Kulere 

Kulesa 

Kulung [Wurkum] 

Kumam 

Kumba 

Kumu 

Kunabeeb 

Kunda (NC/Central Bantu/ 
Bisa-Lamba) (M50) 

Kunda (NC/Central-West 
Bantu/Luba) (L30) 

Kunyi 

Kuo 

Kupto 

Kurama 

Kuranko 

Kuria 

Kurike 

Kurumba 

Kusaal 

Kushi 

Kusu 

Kutep [Kuteb] 

Kuti Ambo 

Kutin 

Kutu 

Kuturmi 

Kuumbere 

Kuvale 

Kuwaa 

Kuya 

Kuzamani 

Kwa [Ba] 

Kwa' 

Kwaatay 

Kwadi 

Kwadia 

Kwakum 

Kwala 

Kwaluudhu 


Kwambi 

Kwame 

Kwami 
Kwandi-Kwangwa 


105/26c 

81/17, 83/37 
104/la 

104/1b 

82/64 

104/65, 105/57 
95/82a 

98/34 

82/71, 89/06 
106/25, K 


102/22, 103/14 
105/31c, 106/11 
99/27 

106/04 

106/34 

105/12 

106/13 

90/55 


81/15, 83/35 


97/70, 98/74 
97/29 

98/35 

101/09 
90/44b, 99/28 
95/78a 
86/67a, 79/71 
83/28e 


103/79, 104/64, 105/48 


103/78, 105/49 
96/91 

85/31 

96/58, 97/63, 98/42 
86/64 

98/29 

79/16, 80/11, 81/45 
87/53b 


77/38 


77/46a, 78/17 
83/15, 88/56 
90/16, 91/30 
98/70, 100/07 
96/01, 97/33 
106/19, 107/30 
86/51 

106/08d 
104/40b 


102/69, 103/47, 104/22 


98/69 
80/7le 
95/84 

80/09, 81/43 
91/06, 98/26b 
76/10 
96/18, 97/02 
91/43b 
84/31 
106/17 
105/24a 
97/31 

98/37 

93/65 
108/11 
84/51 
105/26a 
87/47, 91/62 
81/16, 83/36 
84/361 


84/36k 

80/16 

98/79, 100/06 
77/62a 


1,010 
(Liberia 619) 
77/d 
136 
<l 
550 (L2 4,600) 
12 
Klao 


105 

191 

21 
Pokomo 

30 


387 
656 (Tanz. 497) 
168 
590/d 
13 


probably extinct 
13 
4] 
Ovambo- 
Kwanyama 
59 
<l 
13 
65 


Kwandu 
Kwangalak 
Kwangali 
Kwange 
Kwanja [Konja] 
Kwanka 
Kwanyama 
Kwanyoka 
Kwasio 

Kwaya 

Kwena 

Kweso 

Kwezo 
Kwo/Meru 
Kxoe 

Kyibaku [Cibak] 


La(a)di 

Labi (secret language) 

Lagwan [Logone] 

Laka (NS/Sara) 

Laka (NC/Gur-Adamawa/ 
Mbum-Day/Mbum/ 
Central/Karang) 

Laka-Kaba 

Laka-Mbum 

Lakota 

Lala 

Lali 

Lamang North 

Lamang South 

Lamba (NC/Gur-Adamawa/ 
Gur/Central/South/Grusi/ 
East) 

Lamba (NC/Central Bantu/ 
Bisa-Lamba) (M50) 

Lambia 

Lame (NC/South Bantoid/ 
Jarawan) 

Lame (AA/Masa Chadic) 

Lamja 

Lamnso' 

Landuma 

Lang’ 

Langa 

Langbasi 

Lango 

Larteh 

Laru 

Lefa' 

Lega 

Legbo 

Lehar 

Leisan 

Leke 

Lekongo 

Leku 

Lela 

Lelau 

Lele (NC/Central West 
Mande/Manding- 
Kuranko) 

Lele (NC/Central-West 
Bantu/Bushong) (C90) 

Lele (AA/East Chadic) 

Lelemi [Bwem] 

Lemoro 

Lempu 

Lendo 

Lendu [Kebu] 

Lenge (NC/South-West 
Bantu/Umbundu) (R10) 

Lenge (NC/North-West 
Bantu/Bobangi-Ntomba) 
(C40) 

Lenge (NC/North-West 
Bantu/Maka-Njem) (A80) 

Lengola 

Lenje 


77/62e 
96/78c 
74/37, 84/39 
80/64c 
95/64 
96/39, 97/12 
84/36a 
84/30 

87/60, 88/40, 92/31q 
86/45 
74/07¢ 
87/51 

81/66 

86/5 

74/55 

94/10 


83/12 

90/09a 

90/70b, 94/56a, 99/10 
90/41a 


91/32, 98/38 
90/41b 
90/10 
105/23a 
77135 
83/28c 
94/25a 
94/25b 


102/45, 103/66, 104/46 


77/40 
75/41, 76/66, 77/44 


97/52 

90/61a 

91/48, 98/52 
93/41, 95/43 
107/21 

93/62a 

80/08, 81/44 
79/56a, 82/68, 89/14 
86/63 

102/25, 103/17 
101/35, 102/39b 
93/78 

80/14 

95/75b, 101/27 
108/04 

85/L 

87/33 

93/88a 

82/02, 87/09 
100/23, 102/40 
98/21 


106/19a 
81/55 


90/54 

102/08, 103/23 
97/26 

84/04b 

81/011 

79/60 


84/29a 


82/28a 


88/36 
79/23b, 80/05, 81/40 
77/28 


137 
Tswana 
117 
78 
10 
121/d 


111 
63 (Nigeria 47) 
72 


255 


221 (Togo 149) 
98 (Tanz. 48) 


13 

72 

195 

18 
Bamileke 


49/d (CAR 44) 
1,242 
47/d 
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Sub-Saharan Africa 


Language 
[alternative name] 


Map no./language no. 


Number of speakers 
(in thousands) 


Language 
[alternative name] 


Map no./language no. 


Number of speakers 
(in thousands) 


Lenyima 

Lese 

Leti 

Leya 

Leyigha 

Li-Tamare 

Li-Tamare-Niende 

Liberian Settler 
English (IE/WG/ 
Creole) 

Liberian Pidgin Engish 

Libo 

Libobi-Likata-Lifonga 

Lifonga 

Ligbi 

Ligri 

Likile [Likila] 

Liko 

Likoka 

Likpa [Dompago] 

Likpe [Sekpela] 

Likungu 

Lima 

Limba 

Limbum 

Limi 

Linda 

Lingala 

Lino 

Liombo 

Lionje 


Liptako Ful 
Lo(o) 

Lobir(i) 

Lobo 

Lobola 

Lodiya 
Log(h)on 

Logo 

Loi [Ngiri] 
Loka(a) [Yakur] 
Loko 

Lokoli 

Lolue 

Loma 

Lombe 

Lombi 

Lombo 
Lomoto 
Lomotwa 
Lomwe 

Londo [Borondo] 
Longo 

Longto 
Longuda 
Lonzo 

Looliya [Dadiya] 
Lopa 
Losakanyi 
Lotapa 
Lotsu-Piri 
Lotuko 

Lozi 

Lozwa 
Lu(u)mbu 
Luano 

Luba Katanga 
Luba Shankali 
Luba-Lulua 
Lubila 

Luchazi 
Ludima 
Luganda [Ganda] 
Lugbara 

Lugoli [(Ma)Ragoli] 
Luguru 

Luhya 

Luimbi 


93/15b, 95/75c 

79/63a 

92/09, 93/71 

74/27, 77/24b 

93/15c, 95/75a, 102/71 
102/71 

103/53, 104/36 


106/33 


98/13 

82/20 

82/21, 87/22 

105/63 

98/40 

79/24, 82/31 

79/22 

82/25, 87/25 

102/46, 104/47 

103/24 

82/34a 

77/41 

106/01, 107/23 

95/45 

80/24 

89/32 

82/11, 87/04 

82/50, 87/52 

79/37a 

79/41b, 80/04, 81/01w, 
82/47a 

102/L, 104/L 

98/17 

103/76, 104/62a, 105/43 

82/12, 87/15 

82/24, 87/26 

80/71la 

103/80, 104/63b, 105/51 

79/65e 

82/06, 87/03 

93/13c, 95/76a 

106/29, 107/31 

93/13b, 95/76b 

93/83d 

106/26, L 

92/38, 93/96 

79/62, 80/76 

79/39 

79/37b 

77/52, 78/08 

75/18 

92/25a 

81/01m 

91/07 

91/21, 98/09 

81/57, 83/17 

98/08 

100/22, 101/36, 102/39a 

81/07c 

80/71c 

98/07 

86/68 

77/159 

105/23c 

83/24, 88/12 

77137 

77/48, 78/05 

78/15a 

78/13, 80/53, 81/73 

93/13a 

77170, 84/44 

83/14a, 15a, 88/55 

86/32, 79/02, 80/20 

86/58, 79/65b 

86/35q 

76/11, 85/20 

86/35 

84/42 


152 


83 

(L2 1,980) 
13 
4 


19 


205 
(L2 7,210) 


Mongo 


3 
454 (BF 328) 


Tetela 


3,449/d 
87 
Mongo 
4 
62 


1,770 


8,450 
356/d 

Kongo 

3,479 (L2 8,900) 
663/d 
302 
629 
4546 
39 


Lulua 

Lumbu (NC/Central- 
East Bantu/(M60) 
Lenje-Tonga) 

Lumbu (NC/Central- 
West Bantu/Songye) 
(L20) 

Luna 

Lunda-Ndembu 

Lunda-Ruund 

Lundwe 

Lungu [Idun] 

Luntu 

Lunu 

Lunwinja 

Luo 


Luri 

Luunda 

Luvelu 

Luyana 

Lwel 

Lwena [Luvale] 
Lyele 


Ma'a [Mbugu] 
Ma'da 
Maasai 


Maay 

Maba 

Mabale 
Mabas 

Mabi 
Mabo-Barkul 
Machame 
Machinga 
Mada 

Madi 
Madimba 
Madingu 
Maele [Malele] 
Mafa 


Mafia 
Maha 
Mahi 
Mahongwe 
Mahwa 
Maja 
Majera 
Majingay [Sar] 
Makere 
Makina 
Makoma 
Makonda 
Makonde 


Makua 

Makunduchi 

Malaba 

Malalulu 

Malgbe 

Malila 

Malinke (total) 
(NC/Central West 
Mande/Manding) 

Mama 

Mamba 

Mambay 

Mambila 

Mambwe 

Mamenyam [Bamenyam] 

Mampruli 

Manda 

Mandinka 

Manengabo 

Manganja (Malawian 
Nyanja-Chewa) 


78/13a 


74/28, 77/32 


78/28, 80/60 

78/23, 80/68, 81/70 
77/54, 84/23 

77155, 78/01, 84/24 
74/29, 77/30 

96/27, 97/10 
78/21a, 80/70, 81/68 
78/10, 80/57 

80/46d 

86/66 


96/103, 97/96 
77104 

74/05b, 75/50a 
74/35, 77/62, 84/37 
81/32 

77169, 84/45 
104/42 


86/93 
94/34 
76/71, 85/44, 86/74 


85/41b 

89/63 

82/10, 87/14 
94/24 

88/38, 92/31a 
96/49 

86/6d 

76/34 

96/30 

86/57, 79/65a 
83/09b 
83/14b 
79/61c 

94/42 


76/01f 
98/68, 100/05 
102/0la 
87/65 
89/82a, 90/58 
93/78c 

99/08 

89/69, 90/41e 
79/61d 
87/59, 88/41 
77/62d 
84/26g 
75/09, 76/38 


75/16, 76/41 
76/O1c, 85/1m 
75/09a, 76/38a 
85/3f 

90/68, 99/03 
76/53, 77/18 


96/64 

78/35b, 80/17a 
91/33b, 94/02b 

95/62 

76/24, 77/73b, 78/41 
93/49, 95/53 

102/67, 103/49, 104/24 
76/64 

107/29, 108/35b 
93/88f 


75/22a 


Luba 


4,033 
(Kenya 3,758) 
62 
North Sotho 
163 
66 
433/d 
272/d 


46 

30 

1,068 
(Tanz. 548) 


189 


221/d 


Luyana 
10 
1,573 
(Tanz. 1,150) 
7,397 


52 
74 


2,362 
46 
Holoholo 
5/d 
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Sub-Saharan Africa 


Language 
[alternative name] 


Map no./language no. 


Number of speakers 
(in thousands) 


Language 
[alternative name] 


Map no./language no. 


Number of speakers 
(in thousands) 


Mangas(-Margi) 
Mangba 
Mangbetu 


Mangbutu 
Maninka 
Maninkaxanwo 
Manjanku (?) 


Mankanya [Bola] 
Mankon 

Mano 

Manta 

Manya 


Manyanga 
Manyika 


Manza 

Maore 

Mapeo 

Mapopoi [Popoi] 

Marachi 

Marakwet 

Marba 

Mare 

Margi North 

Margi South 

Marka 

Marke 

Masa 

Masaba 

Masaka [Machakos] 

Masama 

Masenzu 

Mashi 

Maslam 

Masoanka [Sua] 

Matakam 

Matal 

Matengo 

Mathira 

Matumbi 

Mau(ka) 

Mawenzi 

Mawia 

Mayogo 

Mba (NC/Ubangi/Sere- 
Ngbaka-Mba) 

Mba (NC/Mbam-Nkam/ 
Nkambe) 

Mbagani 

Mbai 

Mbai-Diabe 

Mbala (NC/Central-West 
Bantu/Mbala) (K60) 

Mbala (NC/Ubangi/Mid- 
South Banda) 

Mbalantu 


Mbamba 
Mbangala 
Mbangi 
Mbangwe 
Mbanja 
Mbanjo 
Mbanza 
Mbanza-Manteke 
Mbata 
Mbati 
Mbato 
Mbause 
Mbe 
Mbedam 
Mbeere 
Mbeku 
Mbele 
Mbelo 


96/102, 97/93, 98/95 
82/40, 87/40 
79/61a 


79/63, 63e 
106/18e, 107/28d 
107/28e 

108/18 


108/19 
93/51, 95/58 
106/32 
93/21, 95/06 
106/18g, M 


84/01d 
74/17, 75/42e 


89/37a, 90/38 
75/04 

91/50, 98/54 
79/61e 

86/35f 

86/88 

90/64, 94/63 
88/03b 

94/lla 

94/11b 

104/75 

91/08a 

90/62, 91/68, 94/65 
86/36 

85/29a 

86/6c 

83/22b 

77164 

90/70c, 99/09 
108/32 

94/32 

94/28 

75/38, 76/62 
86/9c 

76/30 

105/62c, 106/18a 
86/6k 

75/09b, 76/38b 
79/48 


79/52 


95/49 
81/52 
89/67, 90/41c 
90/32 


81/67, 83/18 


89/23 
84/361 


81/23, 83/32 
84/11 

79/54, 89/40 
83/45, 88/27 
84/36m 
92/02b, 93/19d 
82/70, 87/96, 89/36 
84/02, 83/16 
84/04c 

87/32 

105/08 

81/11b 

95/38 

94/47 

85/31 

84/04d 

89/09, 90/23 
81/01q 


<l 
975 
(DRC 914) 
17 
1,228 
Manding 
268 
(Guinea Bissau 
160) 
65 


87 
(Liberia 59) 
Kongo 

924 
(Zimb. 712) 
246 


330 


Ovambo- 
Kwanyama 
98 


Banda-Mbres 
Mongo 


Mbembe (NC/East Benue- 
Congo/Upper Cross) 
Mbembe (NC/East Benue- 


Congo/Central Jukunoid) 


Mbere 

Mbesa 

Mbeti 

Mbi 

Mbilia-Nkamba 

Mbo (NC/North-East 


Bantu/Bira-Huku) (D30) 


Mbo (NC/North-West 
Bantu/Lundu-Balong) 
(A10) 

Mbodomo 

Mbogno 

Mboi 

Mboka 

Mboko 

Mbole 

Mbombeleng 

Mbombo 

Mbong'a 

Mbonge 

Mbonji 

Mbonzi 

Mbonzo 

Mboshi 

Mbowe-Milonga-Liyuwa 

Mbu(u)n 

Mbugwe 

Mbui 

Mbuko 

Mbukushu 

Mbula-Bwaza 

Mbulu 

Mbulungish 

Mbunda 

Mbunga 

Mbwale 

Mbwas 

Mbwela [Lukolwe] (NC/ 
South-Central Bantu/ 
Nkoya) (L50) 

Mbwela [Mashi] 
(NC/South-Central 
Bantu/Kwangwa) (K40) 

Mebeza 

Medumba 

Mefele 

Megaka 

Meje 

Meka(a) 

Mekambo 

Meke 

Mekwob 

Mende 

Menka 

Merey 

Mern(y)ang 

Meru 

Mese 

Mesme 

Meta’ 

Mezime 

Mfinu 

Mfumte 

Mgbo 

Mi-Gangam 

Mi-Gangam-Dye 

Migili [Jijili/Lijili] 

Mikaya 

Miltu 

Mini 

Minianka 

Minungu 

Mirembe 

Mis(s)ong 

Mituku 


93/15a, 95/75d 


95/81 

81/24, 83/31, 87/67 
79/40, 82/43 
92/40b 

89/13 

81/01) 


79/13 


92/29, 93/88 
87/89 

95/65 
98/12a, 94/04 
83/08 
83/39a, 87/43 
79/23c, 80/06, 81/41 
87/85 

83/13c 

91/52 

93/83b 
82/05, 87/02 
83/34d 
82/26, 87/05 
81/14, 83/34 
77/62b 

81/38 

86/23 

84/27 

94/26 
77/64a, 84/40 
91/55, 98/43 
82/63, 89/28 
107/05 
77/71, 84/46 
76/31 

93/881 
87/57c 


77/165 


84/38 
87/53n, 88/39 
93/48 

94/41 

93/57 

79/61b 

87/57, 91/64 
87/54 

87/70a, 88/43e 
87/53} 

106/28 

93/38, 95/40e 
94/36 

96/77 

85/34, 86/10 
93/66b 

90/66, 94/61 
93/36, 95/40b 
87/53h 
81/36, 83/58 
95/47 
101/78b 
102/57, 103/60 
104/32 

95/87, 96/51 
87/55 

89/81b, 90/47 
101/14 
104/05, 107/34 
84/41a 
78/24a 

95/25 

80/07, 81/42 


143 


77 (Cam. 50) 
133 


2,300 
Banda-Mbres 
Mongo 


99 


208/d 
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Language 
[alternative name] 


Map no./language no. 


Number of speakers 


(in thousands) 


Language 
[alternative name] 


Map no./language no. 


Number of speakers 
(in thousands) 


Miutini 
Miya 
Miyengen 
Miyobe 
Mlomp 
Mmaala 
Mmani 
Mmem 
Mmetho 
Mo(o)re 
Moba [Bimoba] 
Mober 
Modele 
Mofu North 
Moghamo [Widekum] 
Mokiba 
Mokpwe 
Mokulu 
Moliba 
Molokwo 
Mombasa 
Mombo 
Mongo 


Moni [Mwa(n)] 
Monja 
Mono (NC/Ubangi/Mid- 
South Banda) 
Mono (NC/Gur-Adamawa/ 
Mbum-Day/Mbum/ 
South) 
Monogoy 
Montol 
Monzombo 
Moru 
Moru-Madi 
Moruba 
Mosa 
Mosi-Gurma Ful 
Moso 
Motembo 
Mowea 
Moy 
Moyoko 
Mpada 
Mpama 
Mpee 
Mpese 
Mpiem(o) 
Mpo 
Mpoman 
Mpondo 
Mpongwe 
Mpopon 
Mpoto 
Mputu 
Mser 
Mtang’ata 
Mujum 
Mukulu 
Mumbake 
Mumoni 
Mumuye 
Mundang 
Mundani 
Mundat 
Mundu 
Mundum 
Munga 
Mungaka 
Munjuk 
Munyo 
Murang'a 
Murulle 
Musey 
Musgu 
Muthambi 
Muungo 
Muure 


85/34d 

97/81 

93/85a 

102/61, 103/61, 104/34 

108/16 

92/05a 

106/05 

95/42b 

75/17 

102/64, 103/46, 104/14 

102/60, 103/56, 104/33 

99/22 

95/39a 

94/39a+39b 

93/37, 95/40i 

83/68, 87/38 

92/17, 93/91 

89/84, 90/59 

82/17, 87/20 

94/30 

85/le 

104/04c 

79/41, 80/04, 81/01, 82/47, 
83/70, 87/44 

105/73 

80/71f 


82/57, 89/23A 


91/23 

90/65, 94/62 
96/82, 98/85 
82/72, 87/76 
79/65¢ 
79165 

89/12, 90/24 
85/2b 
102/M, 104/M 
80/34c 
82/33, 87/30 
82/34b 
81/13, 83/64 
83/40a 
90/69, 99/04 
81/04a 
81/10 
84/04a 
87/53d 
91/65 
87/53£ 
74/04a 
88/09 
87/53e 
75/37, 76/63 
78/22, 80/67, 81/71 
90/70d, 99/06 
76/01e, 85/1} 
93/62h 
77/01b 
98/23 
85/29c 
91/20, 98/27 
90/14, 91/33c, 94/02a 
93/39 

96/88 

79/49 

95/54 

98/15 

93/46, 95/50 
90/72, 94/57 
85/43f 
86/9e 

85/M 

90/63, 94/64 
90/71, 94/58 
85/36b 
92/14b 
91/05b 


35/d 
ll 


9 

7 

102 

1,404 

5,406 
337 (Togo 240) 

78 


78 


36 
Dogon 
5,472 


113 
Tetela 


98 


1 

29 
22 (CAR 13) 

133 


Banda-Mbres 


95/d 


Xhosa 
5 
20 

191 (Tanz. 143) 


4 
Bamileke 

14 

20 


Muyang 

Muyonzi 

Mvano 

Mvedere 

Mvele 

Mveny 

Mvuba 

Mvumbo 

Mvumbu [Wuumu] 

Mwahed 

Mwali [Mohere] 

Mwaneko 

Mwani 

Mwele 

Mwenshi 

Mwenyi-Simaa-Mbumi- 
Imilangu 

Mwera 

Mwimbi 

Mwina 

Mwini [Miini/Barawa] 
(NC/North-East Bantu/ 
Swahilioid) 


N(g)hwele 
N(h)aro(n) 
Naadh [Nuer] 
Nabit 

Nabulu [Abulu] 
Nafaanra 
Nafara 

Nagumi 
Nahara 

Naki 
Nalu 


Nama-Damara 
Nande 
Nandi 
Nandu-Tari 
Nangam 
Nankanse 
Nanuni 
Naraguta 
Nata 
Natioro 
Naudem 
Nawuri 
Ncane 
Nda'nda' 
Ndai 
Ndaka 
Ndali 
Ndam 
Ndamba 
Ndasa 


Ndebele 

Ndembu 

Ndemli 

Ndendeule 

Ndengense 
Ndengereko 

Ndera (NC/North-East 
Bantu/Pokomo) 
Ndera (NC/Gur-Adamawa/ 
Leko-Nimbari/Duru) 
Ndi 

Ndia 

Ndibu 

Ndienga 

Ndingi 

Ndo 

Ndobo 

Ndoe 

Ndokpa 

Ndolo 

Ndombe 


94/29 

83/13b 

95/66 

89/24 

87/70i, 92/40}, 93/98d 
87/70h, 88/43a 
791/63¢ 

88/37, 92/31b 
83/29, 88/48 
93/88h 

075/5 

93/888 

75/01, 76/01g 
83/48, 88/30 
77/51, 78/07 


77162c 
75/07, 76/37 
85/36a 
85/3¢ 


76/07, 85/18 
74/52 

79178 

102/65, 103/44, 104/20 
79/61g 
103/82, 105/58 
105/59b 

91/53 

75/13 

95/22 

107/03, 108/30 


74/46, 84/54 
79/07, 80/18 
86/85 

96/47 

104/04h 

104/15 

102/70, 103/50, 104/25 
96/09, 97/48 
86/41 

104/10 

102/62b, 103/62, 104/27 
102/18, 103/15 
95/20 

93/64 

91/25 

79/14 

76/54 

90/49 

76/32 

83/50, 88/32 
74/19, 75/42a 
93/27, 95/29+29a 
74/01, 77/60 
84/14 

92/03, 93/34 
75/08, 76/36 
80/04f, 81/0la 
76/28 


85/3a 


91/11, 98/25c 
89/29 

86/9b 
84/01le 
83/34 
83/07 
79/63d 
82/08, 87/12 
93/25 95/28 
82/62, 89/34 
82/14, 87/17 
84/28 


17 


Banda-Banda 
156 


130 (L2 23) 


98 

484 

117 
Pokomo 


52 

49 
probably extinct 

2,800 

6 

23 

(Guinea 18) 

247 

1,147 

332 

) 

Dogon 


14 
almost extinct 
30 


30 
Banda-Banda 


Kongo 


376/d 


323 


Sub-Saharan Africa 


Language 
[alternative name] 
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[alternative name] 


Map no./language no. 


Number of speakers 
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Ndonde [Mawanda] 
Ndonga 


Ndongo-Binza 
Ndongona 

Ndoro 

Ndri 

Ndura 

Ndut 

Nduuvum 

Nee 

Nembe 

Neyo 

Ng'Wande 

Ngabu 

Ngaja 

Ngala 

Ngam(a) 

Ngamambo 

Ngambay 

Ngamo 

Ngando (NC/North-West 
Bantu/Mongo-Nkundo) 
(C70) 

Ngando (NC/North-West 
Bantu/Ngando) (C10) 
Ngangwela 

Nganjera 


Ngao 
Ngapu 
Ngare 
Ngatana 
Ngazija 

Ngbaka 
Ngbaka-Baka 
Ngbaka-Ma'bo 
Ngbaka-Manza 
Ngbaka-Minangende 
Ngbandi 

Ngbundu 

Ngele 

Ngende 

Ngie 

Ngindo 

Ngishe 

Ngiti [Dru(na) ] 
Ngizim 

Ngola 

Ngom 

Ngoma 

Ngomba 
Ngombale 

Ngombe 
Ngombe-Motembo 
Ngondi 

Ngong 

Ngongo 

Ngoni 


goni-Senga 

goro (NC/Bantoid/ 
Mambiloid) 

Ngoro (NC/North-West 
Bantu/Lundu-Balong) 
(A10) 

Negika 

Negubi 

Negulu 

Ngumbo 

Ngungwel 

Ngungwoni 

Ngurimi 

Ngwa 

Newaketse 

Ngwane 

Negwaphyi [Nggwahyi] 
Ngwato 

Ngwe 


N 
N 


75/10, 76/39 
84/36b 


82/15, 87/18 
84/29f 

95/67 

82/66, 89/30 
85/3d 

108/07 
91/43c 
93/63c 
101/04 
105/26b 
91/51, 95/16 
79/56b, 89/15 
89/26 

84/29 

89/68, 90/41d 
93/39a, 95/40} 
90/41g, 91/79 
98/78, 100/03 


79/41a, 80/04a, 81/01r 


82/42, 87/31 
74/38, 84/43 
84/36j 


89/19, 90/26 

89/27 

81/19, 83/39, 87/39 
85/3; 

75/02 

87/78 

87/75 

82/73, 87/74, 89/02 
82/76, 87/79 
82/75, 87/86 
79/53, 82/44, 87/97, 89/43 
82/69 

80/71i 

81/50 

93/35, 95/40 
75/06, 76/35 
95/40d 

79/60b 

100/10 

84/15 

83/47, 87/66 
77/01a 

93/58 

93/56 

82/34, 87/27 
79/33, 82/35, 87/28 
87/37 

91/54 

80/75, 81/46 
75/12, 76/61 


75/26, 76/58, 77/19 


92/02a, 93/19c 


93/82a 
74/04c 
88/13 
76/14, 85/21 
77/03 
81/26, 83/25 
83/59c 
86/50 
93/64d 
74/07£ 
91/43a 
94/10a 
74/07b 
93/61 


47 
Ovambo- 
Kwanyama 


53/d 
Banda-Banda 
3 
27 
Bamileke 
131 
11 


14 


220 
Ovambo- 
Kwanyama 
Banda-Ndele 
Banda-Banda 
Pokomo 

760 

1,050 


1,030 
267 
Banda 

Tetela 
26 


127 
97 
Kimbundu 
14/o 
69 
54 


almost extinct 
284 

(Tanz. 236) 
491/d 


Tswana 


3 
Tswana 
66 


gweme 
gweshe (NC/North-East 


Bantu/Shi-Havu) (J50) 
Neweshe [Gvoko] (AA/ 
Central Chadic) 

Newi 

gwo 

gwol 

gyeboong 

ho 


N 
N 


N 

N 

N 

N 

Niellim 
Niende 

Nigi 
Nimbari 
Nindem 
Ninzam [Ninzo] 
Nithi 
Njanyi 
Njem 
Njikun 
Njinga 
Njinju 
Njoa-poantu 
Njoyame 
Nkamu 

Nkangala 

Nkem 

Nkim 

Nkoi 

Nkombe 

Nkomi 

Nkonya 

Nkore [Nyankore] 
Nkoya 

Nkuchu 

Nkum 

Nkundo 

Nkwen 

Nlaa 

Nle 

Nnakenyake 
Nnam 

No(o)n 

Nomaante [Numaand] 
Nonda 

Noni 

North Binja 

North Bisa 

North Etung 
North Hemba 
North Idoma 
North Ngom 
North Ntomba 
North Samo 
North Songye 
North Sotho 
North-East Badara 
North-East Duguri 
North-East Yoruba 
North-West Gbaya 
Nowola 

Noy 

Nozaman 

Nsari 

Nsele 

Nsenga 

Nsinseke 

Nsongo 

Nsose 

Nsundi 

Nta 

Ntandu 

Ntomba 

Ntsiaki 

Ntsotso 

Ntumu 
Nudoo(ng) 
Nujum 
Numana-Gwantu 


87/56b 
80/46b 


94/21 

81/33 

95/40c 
100/17b, 101/47c 
93/59 

92/27, 93/86 
89/46, 90/02 
102/71la 

93/72 

91/19 

96/32b, 97/11b 
96/31 

85/36, 86/12 
91/74 

87/56d 
95/79c 

84/20 

83/59a 

93/63a 

91/45 

84/04e 

84/47 

95/32a 

95/32 

78/02b, 80/71) 
80/71d 

88/07 

102/17, 103/12 
80/28 

77167 

80/72, 81/48 
95/32b 
81/01n 

93/53, 95/59 
93/88b 
93/88c 

91/49, 98/53 
95/30 

108/06 

92/11, 93/73 
80/61a 

95/21 

80/64a 

77/34a 
93/23a 

78/14a 
96/67a, 101/60a 
87/66 

81/07a 
104/83a 
78/18a 

74/05, 75/50 
96/59c 

97/64 

101/89, 102/34 
91/38a 
105/59d 
89/53, 90/03 
87/58, 88/42 
95/19 

95/29b 

75/33, 77/20b 
83/59b 
81/01c, 82/47b 
93/85 

84/03 

95/29c 


84/04, 81/56, 83/02 


81/07, 83/63 
83/13a 

84/07 

88/43d, 92/40c 
93/19b 

93/19a 

96/33 


18 


Bamileke 


23 

35 
Tetela 
Tetela 

25 

29 

2,083 

81 

78 


92 
Ntomba 
Samo 
Songye 

4,235 
Badara 
Duguri 
Yoruba 

120 


almost extinct 
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Language 
[alternative name] 


Number of speakers 


Map no./language no. Map no./language no. (in thousands) 


Nungu 96/36 52 Oromo 85/43, 86/96 - 
Nungunu 92/12, 93/75 - Oru Yille 104/4i Dogon 
Nuni 104/44, 105/40 - Oshun 102/30a - 
Nuno 97/38 1 Osimili Igbo 101/81 Igbo 
Nunu 81/12, 83/61 = Ososo 101/68 = 
Nupe Tako 101/58 1,040 Otank 95/02 4 
Nutata 92/05b - Ovambo 74/42, 84/36 862 
Nuyangbe 92/05c - Owere 101/83b Igbo 
Nwa(n)/Wan 105/71 27 Oyo 102/30b - 
Nyabo 105/30 - Oyuba 88/05 3 
Nyabwa 105/31b 52 
Nyakyusa 76/55, 77/16 1,302 (Tanz. 930) Pa'a 97/78 10 
Nyala 86/29 - Pa'no 91/03 - 
Nyali 79/12 52 Pabir 98/104a 273 
Nyambo 80/27 10 Padogho 104/58, 105/45 1 
Nyamwezi 76/19, 78/36, 86/16, 80/51 1,120 Pai 74/05e North Sotho 
Nyan(y)eka 84/29d 324/d Pajade [Badyara] 107/08 7 
Nyanga (NC/North-East Palor 108/08 10 
Bantu/Rwanda-Rundi) 80/26 - Pam 91/26 - 
Nyanga (NC/Central-West Pambia 79/45 3 
Bantu/Kongo) (H10) 83/04 49 Pana (NC/Gur/C/S/Grusi/n) 104/41 12 (Mali 6) 
Nyangbo [Yangbo] 103/31 5 Pana (NC/Gur-Adamawa/ 
Nyanja 75/20 4,400 Mbum-Day/Kim) 90/17 71 
Nyarafolo(ro) 105/59a 48 Pande 87/36 - 
Nyasa (Tanzanian Nyanja) 75/21, 76/59b 70 Pangwa 76/44 260 
Nyengo 74/36, 77/63 7 Panyam 98/20 - 
Nyep 93/64a - Papel 108/20 122 
Nyeri 86/9d - (Guinea Bissau 
Nyiha 75/24, 76/51, 77/14, 78/33 757 120) 
Nyindu 80/44 - Pare [Asu] 85/22, 86/3 484 
Nyiramba 86/20 530 Parekwa 94/18 36 
Nyo'o 93/67 7 Pata 78/04, 80/74 - 
Nyole [Nyuli] 86/39 - Pate 85/1c 4 
Nyore 86/351 290 Pedi 74/05a 3,990 
Nyoro [Bunyoro] 86/28, 79/03 817 Peere 91/05, 98/26a 24 
Nyungwe 74/23, 75/30, 77/20a 419 Pelasla 94/27 ll 
Nzadi 81/65 - Pemba 76/01d, 85/1i - 
Nzakara 79/43 54 Pende 81/64 533 
Nzanda 83/15c - Peri 79/10, 80/13 20/d 
Nzanyi 94/54 104 Pero 98/80 29 
Nzebi [Njabi] 83/40, 88/45 195 Peve 91/69a 36 
Nzema 103/07, 105/17 448 Phuthi 74/06a South Sotho 
Nzime 87/56c - Pidgin/English (IE/WG/ 
Nzwani [Njwani] 75/03 195 Pidgin) - 
Piin 81/34 - 
Obang 93/22b - Pimbwe 76/27, 78/37 39 
Obanliku 95/73g 98 Pinji 83/55, 88/22 7 
Oblo 91/35 - Pinyin 93/55, 95/60 17 
Obolo 101/21 130 Piti 96/04, 97/25a 3 
Obronuagum 101/11 - Piya [Wurkum] 98/81 7 
Obulom 101/13 - Plains Igbo 101/82 Igbo 
Odual 101/08b 26 Podzo 75/27 104 
Odut 93/08, 101/32b disappearing Pogoro 76/33 247 
Ogaden 85/0 - Pojulu 79/72 42/0 
Ogbah 101/85c 217/d Pok 86/81 20 
Ogbia 101/10 266 Poka-Kori-Wipsi-Geeri 100/25, 102/42 49 
Ogbogolo 101/12 12 Poke 79/37, 80/01, 81/20 90 
Ogoi 101/15 7 Poko-Vumba 82/34d - 
Ogori-Magongo [Oko] 101/59 14 Pokomo 85/3 74 (U 40, L 34) 
Okak 87/70e = Pokot 86/79 312 
Okobo 93/05, 101/24 54/d Polci 96/97, 97/89, 98/97A 30 
Okpahmeri 101/74 59 Pomo 87/46 - 
Okpe 101/62e 11/o Pongo 92/14a - 
Okpe-Idesa-Oloma 101/73 - Pongu [Ri] 101/43 27/d 
Okrika [Kirike] 101/02a 285/d Pori [Pol] 87/48, 91/63 - 
Olee 83/34b - Portuguese-based Creoles 88/57 106 (L2 5,300) 
Olobiri 101/10b - Poto 82/35a - 
Olulumo 95/77 12 Pove 83/54, 88/23 7 
Omambala Igbo 101/79 Igbo Psikye 94/17 60 
Ombo 80/03 - Puguli [Pwo] 104/50, 105/38 - 
Omvang 87/70d - Pulana 74/05 North Sotho 
Omyene (total?) Pular 108/01la 3,095 
(NC/North-West Bantu/ Pungu 84/29 - 
Omyene) (B10) 32 Punu 83/22, 88/10 148 
Ongamo 86/75 4 Putai 98/105, 94/11, 99/01 almost extinct 
Onicha 101/80 Igbo Pyapun 96/80 - 
Ora-Emai-Ialeha 101/69 139 
Orma 85/43g 65 Rabai 85/10 85 
Oro 93/04, 101/23 104 Ragane 80/34a - 
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Rangi 

Rendille 

Rera 

Reshe 

Ribina 

Rihe [Ribe] 

Rikpa 

Rimi [Nyaturu] 

Ripey 

Roba 

Rolong 

Rombo 

Ronde 

Ronga 

Rue 

Ruihi [Rufiji] 

Rukuba [Che] 

Ruli 

Rumaya [Mala] 

Rundi 

Runga 

Rungu (NC/North-East 
Bantu/Tongwe) (F10) 

Rungu (NC/North-West 
Bantu/Omyene) (B10) 

Rungwa 

Ruruma 

Ruto 

Ruund 

Rwanda [Nyarwanda] 

Rwo 


S(h)inyar [Shemya] 

Sa 

Saba 

Sabanga 

Safalaba 

Safen 

Safwa 

Sag(h)ala 

Sagala 

Sakaci 

Sakarda 

Sakata 

Sake 

Sakuyu 

Sala 

Salka 

Sama (NC/Central-West 
Bantu/Kimbundu) (H20) 

Sama (NC/North-west 
Bantu/Sama) 

Samba (Leko) 

Sambo 

Samburu 

Samia 

Samo 

Samogo 

Samwe 

San 

Sanda 

Sandawe 

Sanga (NC/East Kainji) 

Sanga (NC/Central-West 
Bantu/Luba) (L30) 

Sango 


Sangu (NC/Central-East/ 
Bena-Kinga) (G60) 

Sangu (NC/North-West 
Bantu/Shira-Punu) (B40) 

Santrokofi [Sele] 

Sanza 

Sapiny 

Sarwa 

Sasaru-Enwan-Igwe 

Savannah Igbo 

Saya 

Seba 


76/17, 86/22 
85/39, 86/94 
80/45b 
100/21, 101/37, 102/38 
96/05, 97/44 
85/9 

93/81 

76/18, 86/21 
93/80 

98/11 

74/07g 

86/6) 

106/06b 
74/11, 75/48 
74/22, 75/32, 77/22 
76/29 

96/29, 97/11d 
86/31 

97/42 

80/23 

89/64 


77173a, 78/40 


88/08 

76/26, 78/38 
97/41 

89/71, 90/43a 
81/75 

79/01, 80/21 
86/6a 


89/66 
93/62c 
89/86 

89/07 
103/42 
108/05 
76/49, 77/17 
85/26 

76/13 

75/14 
99/23c 
81/11, 83/65 
87/64, 88/33 
85/43c 
74/30, 77/31 
100/19, 101/38a 


84/16 


87/68, 88/50, 91/17 
91/17, 95/96, 98/22+552 
84/26f 

86/77 

86/35m 

104/83 

104/81, 105/64, 107/25 
104/11 

81/35 

106/08e 

76/72, 86/100 

97/27 


77/49, 78/06 
82/55, 89/42 


76/48 


83/20, 88/17 
102/11, 103/26 
80/40c 

86/78 

89/81a, 90/46 
101/71 

101/78 

96/94, 97/86, 98/99 
77/42 


59 
Tswana 


Chagga 


Bamileke 
5 
Banda-Mbres 
4 
105/d 
202 


69 
456 
(L2 6,000) 


104 


Sefwi 
Segeju 
Seki 


Sekpela [Likpe] 

Sele 

Sembla 

Seme 

Sena 

Senari 

Sende 

Sengele 

Seno ful 

Sere (NC/Ubangi/Sere- 
Ngbaka-Mba/Bviri) 

Sere (NC/Ubangi/Sere- 
Ngbaka-Mba) 

Serer(-sin) 

Sese 

Sha 

Shagawu 

Shall-Zwall 

Shamba(l)a 

Shambiu 

Shangaan [Tsonga-Jonga] 

Shange 

Shani 

Shanjo 

Sharwa 

Shasha 

Shatt 

Sherbro 

Shi 

Shila 

Shinga 

Shingu 

Shira 

Shiwo 

Shobio 

Sholio 

Shona 


Showa 

Shua 

Shuwa 

Sighau 

Sigu 

Siha 

Sile 

Sinji 

Siri 

Sisaala 

Siska [Tonga] 

Siti 

Siyu 

Sizaki 

So (NS/Kuliak) 

So(ko) (NC/North-West 
Bantu/Poke-Kele) (C60) 

So(o) (NC/BC/E/Bantoid/ 
S/Bantu/NW/Maka- 
Njem) (A80) 

Soga 

Sokoro 

Soli 

Solongo 

Somali 

Somrai 

Somyewe 

Sondi 

Songhai 


Songo 

Songu 

Songye 
Songye/Luba-Lulua 
Soni 

Soninke 

Sonjo 


103/06a, 105/16 
76/02a, 85/13 
88/25 


102/09 

84/26a 

104/82, 105/65 
104/68, 105/32 

75/28 

104/08, 105/59, 107/32 
84/19 

81/05, 83/66 

104/S 


79/46 


89/01 

108/03 

86/30 

96/95 

96/89 

96/41, 97/13, 98/65 
76/03, 85/14, 86/2 
84/25a 

74/12, 75/47 
106/06a 
97/47 

74/31, 77/27 
94/52 

77166 

79/68a 
106/06 

80/46 

77/10, 78/30 
77108 

93/64g 
83/19, 88/14 
87/61, 88/40a 
80/37 

96/46c, 97/21 
77157 


81/49a 

74/48, 77/76 

90/73e, 99/18 

85/27 

83/44, 88/26 

86/6b 

80/15 

84/13 

97/76 

103/69, 104/51, 105/39 
75/36, 76/65, 77/43 
103/75, 104/56, 105/36 
85/1b 

86/47 

86/56 


79/38 


87/62 

86/34 

89/85, 90/60 
74/32, 77/25 
84/02a 

85/41 

90/51 

95/63 

84/22, 81/76 
102/83, 104/01 


80/71h 

84/12 

78/18, 80/63 

81/74 

80/35 

104/79, 107/24, 108/33 
86/24 


236 

8 

19 
(Eq. Guinea 13) 


1,088 


7,980 
(Zimb. 6,910) 
10 


72 
Kongo 
450 
10/d 
<l 
3,176 
(Niger 2,261) 
Tetela 
30/d 
1,200 


1,360 
24 
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Sotho (total) (NC/South- 
East Bantu/Sotho- 
Tswana) (S30) 

South Binja 

South Bisa 

South Etung 

South Hemba 

South Idoma 

South Kambari 

South Keaka 

South Ngom 

South Ntomba 

South Samo 

South Songye 

South Sotho 

South Tugan Ful 

South-West Badara 

South-West Duguri 

South-East Yoruba 

South-West Yoruba 

Sozo 

Suba 

Subi 

Subiya 


Subre-Kaberwa 
Suga 

Sugurti 

Suku 

Sukuma 

Suma 

Sumbe 
Sumbwa [Mweri] 
Sundi 

Sunga 

Supyire 

Sura 

Surubu 

Susu 

Swahili 

Swaka 

Swati [Swazi] 


Taari 

Taboan 

Tabwa 

Tachoni 

Tadakshak 

Tafi/Tutrugba/Tegbo 

Tafire 

Tagbana 

Tajuos 

Takum 

Tal 

Tala 

Tale (NC/Central-West 
Bantu/Songye) (L20) 

Tale (NC/Gur-Adamawa/ 
Mbum-Day/Mbum) 

Talinge 

Talni 

Tamasheq 


Tambago 
Tambas 
Tambo 
Tampulma 
Tangale 
Tapshin 
Taram 
Tarok 
Tavara [Tawara] 
Taveta 
Tawana 
Tayari 

Te’ 
Teda-Daza 
Tee 

Teere 


80/64b 
77/34c 
93/23b 
78/14b 
101/60d 
101/38, 102/36 
93/22e 

83/47, 88/29 
81/07b 
104/83b 
78/18b 

74/06 

104/T 
96/59b 

97/60 
101/91, 102/31 
101/90 

80/38 

86/52 

80/39 

74/39, 77/61a 


105/24c 

91/44, 95/69 
99/21 

84/09, 81/57a 
86/17 

90/31 

84/26b 

86/15, 80/50 
84/01f, 83/09 
86/35p 

104/06, 105/60, 107/33 
96/74, 97/73 
971/45 

106/23, 107/27 
76/01, 85/1 
77/39 

74/02, 75/49 


91/69b 
93/66c 
77106, 78/31 
86/35c 
104/02 
103/32 
104/66, 105/54 
105/55 
106/13b 
95/82b 
96/79 
97/82, 98/91 


80/62 


90/11 

79/08, 80/40b 

102/66, 103/45, 104/21 
100/12, 102/80 


79/55c, 89/17, 90/27 
96/86b 

75/25, 76/56, 77/13 
102/52, 103/72, 104/54 
98/82 

96/70, 97/67 

95/72c 

96/55, 98/64 

75/42d 

85/24 

74/07a 

102/74, 104/37 
93/62d 

100/14 

83/28f 

91/08b 


10,352 
Binja 
Bisa 
Etung 
Hemba 
Idoma 
Kambari 
Keaka 
Ngom 
Ntomba 
90 
Songye 
2,974 
Fulani 
Badara 
Duguri 
Yoruba 
Yoruba 


67 
(Zambia 53) 
157 


164 

6,050 

56 

Umbundu 

252 

Kongo 

Luhya 
710 (Mali 562) 

78 

3 

968 


5,971 (L2 73,000) 


199 
1,121 


12 
3 (L2 46) 
20 
2 


3 
102/d 
102 
678 
(Burkina Faso 38) 
Banda-Ndele 
9 
9/d 
48/d 
150 


17 
Tswana 
54 
Bamileke 
469 


Tege 

Tegesye 

Teke (total) (NC/North- 
west Bantu/Teke) (B70) 

Tembo (NC/Central-West 
Bantu/Luba) (L30) 

Tembo (NC/North-East 
Bantu-/Congo/Bira- 
Huku) (D30) 

Teme 

Temne 

Tengo [Mamvu] 

Tengu Kan 

Tepo 

Tera 

Terik 

Teso 


Tetela 

Teve 

Tharaka 

Thembu 

Tiba 

Tibea 

Tiene 

Tigania 

Tige 

Tiharo 

Tikar 

Tingelin 

Tingong 

Tiriko 

Tiv 

Tlhaping 

To(o)ro 

Tocenga 

Togbo 

Togo Kan 

Toka 

Tommo So 

Tomo Kan 

Tonga (NC/Central-East 
Bantu/Lenje-Tonga) 
(M60) 

Tonga (NC/South-East 
Bantu/Chopi) (S60) 

Tongwe 

Tonjo 

Tonjo-Tukombe 

Toram 

Toro 

Toro So 

Toro Tegu 

Totela 

Ts'ao [Kua-Tsua] 

Ts'ikha [Ts'ixa] 

Tsabe 

Tsamba 

Tsangi 

Tsiege 

Tsogo 

Tsotso 

Tsuvan 

Tswa 

Tswana 

Tukele 

Tuken 

Tuki 

Tukombe 

Tula 

Tuling 

Tumak 

Tumbatu 

Tumbuka 

Tumbwe 

Tumnelia 

Tunen 

Tungoro 

Tunia 


83/28b 
103/81, 104/63a, 105/50 


77/46, 78/16 
80/10 


98/30 

106/08 

79/63f 

104/040 
105/28c 

98/100, 100/11A 
86/83 

86/69 


78/02, 80/71, 81/47 
74/20, 75/42b 
85/30, 86/13 
74/04e 

98/56 

93/77 

81/37, 83/57 
85/34b 
95/61b 
74/071 

93/20, 95/61 
91/37b 
93/78a 

86/35] 

95/01, 96/66 
74/07e 

86/27 

93/70f 

82/64, 89/16b 
104/04p 
74/33, 77/24a 
104/04k 
104/04a 


74/25, 77/24 


75/43 

78/43, 80/52 
93/70c 
93/70e 
89/83c 

79/04, 80/19 
104/04q 
104/04s 
74/40, 77/61b 
74/47 

74/49 
102/30c 
81/58 

83/41, 88/46 
83/51 

83/53, 88/18 
86/35g 

94/53 

75/45 

74/07, 84/48 
92/08 

86/86 

92/07, 93/70 
93/70b 

98/04 

92/04a, 93/66a 
89/81c, 90/48 
76/01b, 85/1k 
75/40, 76/67, 77/45 
78/34, 80/12 
99/23a 
92/04, 93/66 
93/70a 
89/54, 90/04 


26 
1,527 
(Uganda 1,270) 
953 
265 
132/d 
Xhosa 
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Tuotampa 

Tupuri 

Tura 

Turkana 

Turkwam [Toro] 

Tusya(ng) 

Tutsingo 

Twa (NC/Central-East 
Bantu/Bemba) (M40) 

Twa (NC/North-East 
Bantu/Rwanda-Rundi) 

Twafap 

Twumfu 

Tyap 

Tyapi 

Tyenga [Shanga] 

Tyepo 

Tyo 

Tyurama [Turka] 


Ubaghala 

Ubang 

Ubu-Mbokomu-Moshi- 
Kirua-Kilema-Mwika 

Ufia 

Ugare 

Uhami-lyayu 

Uhuum 

Ukaan 

Ukpe-Bayobiri 

Ukpet-Ehom 

Ukue-Ehwen 

Ukwani-Aboh 

Ukwo 

Ulu 

Ulukwumi 

Umbundu 

Umon 

Umvang 

Uneme 

Unga 

Unguja 

Ura 

Urhobo 

Usaghade 

Useri 

Utagwang 

Utuli 

Uvbie 

Uzekwe 


Vagla 
Vai 
Vale 
Varama 
Vata 
Venda 
Veng'o 
Vere 
Vidunda 
Viemo 
Vili 
Vimtim 
Vinza 
Vira 
Viya 
Vora 
Vuma 
Vumba 
Vungu 
Vute 


Waata 
Wada 


Waja 

Waka 

Wali 
Wama-Tankamba 


93/68 

90/13, 91/33d, 94/02c 
105/67, 106/30 

86/73 

96/56 

104/09, 105/52 
93/70d 


77105 


80/29 

93/64b 

95/6la 

97/19 

107/22 

100/31, 102/77 
104/13 

83/28a 

104/61b, 105/42 


101/31 
95/73£ 


86/6f 

95/78d, 101/26a 
95/08 

101/75 

95/84a 

101/49 

95/731 

93/17 

101/76 
101/81b 
101/78d 
101/83a 
101/92 

74/43, 84/26 
101/30 

87/70g, 88/43b, 92/40e 
101/66 

77/02 

76/O01a, 85/11 
100/16, 101/44 
101/62c 

93/06 

86/61 

95/73b 

85/2c 

101/62d 
95/78b 


103/73, 104/55 
106/21, V 
89/70, 90/3b 
88/15 

105/23b 
74/09, 75/51 
93/45, 95/44d 
91/13, 98/25a 
76/12 

104/12, 105/53 
83/06, 88/52 
94/51 

78/46, 80/32 
80/25b 

88/19 

82/65, 89/l6a 
86/30b 

85/1h 

83/21, 88/11 
91/41 


85/43h 

79/55b, 82/67, 89/11, 
90/5 

98/01 

98/28 

103/39, 104/19 

102/73, 104/38 


47 
402 
5 

45 (N 21, $ 24) 


29 


3 


Bamileke 


260 
Igbo 
12 
5,200 
24 
Fang 
30 
Swahili 
2 
661 
13 
Chagga 
23 
39 
7 


Togbo-Vara 
3 


23 


8 
Banda-Mbres 


Wanda 


75/39, 76/50, 77/15 


Wandala (AA/Central Chadic) 95/102, 94/19, 99/02 
Wandala Daza (NS/Saharan) 99/25 


Wanga 
Wanji 
Warji 
Wasa 


Wase 
Wasulunka 


Watsi 

Wawa 

We 

Webo 

Wee [Guere] 


Weskos [Pidgin] 
(IE/WG/Pidgin) 

West Gbaya 

West Idoma 

West Kambari 

West Kare 

West Keaka 

West Massina Ful 

West Mbum 

West Nyala 

West Oroko 

West Teke 

West Temne 

West Yoruba 

Wi-Isu 

Wile 

Wobe 


Woj [Enekan] 
Wolof 

Wom 

Wongo 

Wor [Dugukan] 
Woyo 

Wukari 
Wum(b)vu 
Wumboko 
Wumbu 
Wumu 
Wungu 
Wushi 
Wuzlam 


Xasonke 
Xhosa 


Yaa 

Yabasi 
Yadina [Yidena] 
Yagba 
Yagima 
Yaka 
Yakoma 
Yakpa 

Yala 
Yalunka 
Yamba 
Yambasa 
Yambiang Yapeke 
Yanda Dom 
Yandang 
Yangavok 
Yangere 
Yans(i) 

Yao 
Yapoma 
Yasa 

Yasem 
Yashi 
Yasing 
Yasu(n)g 


86/35e 

76/47, 77/33b 
97175 

79/55e, 89/20 


96/52b, 98/59 
105/62e, 106/18c, 107/28b 


81/01g 

91/43e 

74/34, 77/24c 
105/28f 
105/31, 106/10 


90/35 

101/60c 

100/18, 102/37 

90/12b 

93/22d 

104/W, 107/01a, 108/01b 
91/29 

86/350 

92/25, 93/82 

81/29, 83/28, 88/49 
106/08a 

102/32 

95/41a 

103/43, 104/18, 105/34b 
105/31a 


105/62a, 106/18b 
107/02, 108/02 
91/18, 98/24 
81/54 

105/62b 

84/01c 

95/79a, 96/52a, 98/58 
83/49, 88/31 
92/18, 93/92 
81/61, 83/17a 
81/28, 83/27 
76/21, 77/74 
93/44, 95/44b 
94/33 


107/28c 
74/04 


83/28d 

92/35a, 93/97 

99/11 

101/89a 

81/01d 

84/06, 81/59, 83/17b 
82/54, 89/44 

79/55f, 82/58, 89/21, 90/28 
93/99, 95/91 

106/24, 107/26, 108/34 
95/48 

92/05, 93/69a 

92/34 

104/04m 

98/32 

93/98b 

87/94, 90/22 

81/31, 83/60 

75/11, 76/40 

92/36 

88/02, 92/21 

93/98a 

96/24 

90/15, 91/33a, 94/02d 
92/37b 


35 
66/d 


217 
81 
Banda Mid- 
Southern 
3 
191 
(Céte d'Ivoire 139) 
Mongo 


598 
(Céte d'Ivoire 510) 


2,670 
Gbaya 
Idoma 

195 
Kare 
Keaka 
Fulani 

79 
52 
44 

175/o 
Temne 
Yoruba 

2, 

130 

543 
(North 189, South 
354) 


3,413 (L2 7,800) 


<l 
Yoruba 
Mongo 
390 
123/d 
33 
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Yatye 95/90 

Yauma 84/46a 

Yaure 105/69 

Yazi 78/09, 80/58 
Yebokolo 87/70b 

Yei 74/41, 77/72 
Yeke 77/50 

Yela 80/04c, 81/01t 
Yemba [Dschang] 93/60 


Yembe 


84/02b, 81/01p 


82/07, 87/08 


Yeskwa 96/28, 97/9a 
Yira 79/04a 

Yogoma 80/22 
Yom-Tanla 102/62a, 104/28 
Yombe 84/01b, 83/05, 88/51 
Yombo 81/53 

Yoni 106/08c 
Yoruma 99/23b 

Yugum 93/62f 
Yukuben 95/83 

Yulu 89/75 

Yungur 91/22, 98/10 
Zakan 93/78b 

Zalage 87/531 

Zaman 87/70f, 88/43¢ 


10.5 The maps 


Map 74 focuses on the southernmost area of Africa, 
including South Africa, Botswana, Namibia and Zim- 
babwe. The southernmost extent of the African con- 
tinent is co-terminous with the southernmost extent 
of the Bantu languages. This area is also the exclusive 
host of most of the surviving Khoisan languages, of 
which Nama-Damara in the west is the most exten- 
sively spoken, enhancing the breadth of the area in 
which the Khoe family is spoken. The inset map 
shows the extent to which Khoisan varieties were 
spoken in previous centuries. They have left their 
mark on the Bantu languages, especially Zulu and 
Xhosa along the east and south coast, in the form of 
loan-words and the addition of clicks to the phono- 
logical inventories of these languages. Clicks occur 
exclusively in words of Khoisan origin, and are 
unknown elsewhere in Bantu. In the east, the Nguni 
group of South-East Bantu constitutes a dialect con- 
tinuum so that Xhosa and Zulu are highly mutually 
intelligible. The continuum extends northward into 
Swati and Ndebele. Neighbouring Bantu groups are 
similar in grammar but the Sotho group is quite 
different in phonology, due both to an archaic vowel 
system and a series of unusual consonantal shifts. 

To a much greater extent than elsewhere in 
Africa, South Africa in particular features a large 
number of L1 speakers of Indo-European languages, 
especially Afrikaans and English. Both languages are 
spoken by millions in this area. They were originally 
brought by waves of settlers from Holland and later 
England, but have also become the first languages 
of other groups — Afrikaans of the large Coloured 
population of mixed ancestry, and English of the 
East Indian population centred in the east around 
Durban. The North and West Cape areas of South 
Africa are majority Afrikaans-speaking (about 3 
million in 2000), with Tswana a distant second in 
the North Cape (about 170,000 in 2000), and English 
and Xhosa second most numerous in the West Cape 
(about 800,000 each). 

The Southern branch of Khoisan survives in 
small but scattered communities almost exclusively in 
Botswana, while the Northern branch survives among 
more widely scattered small bands in Botswana and 
across the border in Angola and marginally in 
Zambia. Most speakers of Khoisan are bilingual in a 
Bantu or Germanic language by adulthood. 


30 Zamba 
- Zan Gula 
32 Zanaki 
= Zande 
- Zangwal 
34 Zaramo 
- Zari 
65 Zarma 
350 Zeem 
Mongo Zela 
26 Zelgwa 
- Zezuru 
- Zhu-/Hoa(si) 
137 
Kongo Ziba 
= Zigua 
- Zime [Herde] 
- Ziziliveken 
Bamileke Zombo 
19 Zongbi 
9 Zua 
88 Zubaki 
Zula 
Bafia Zulu 
7 Zumaya 
Fang 


Map 75 focuses on Mozambique, but includes 
the Comoro Islands, each with a distinct but closely 
related language. These languages have often been 
considered by scholars as divergent forms of Swahili, 
most similar to mainland rural dialects further north, 
but speaker perception is that these are distinct lan- 
guages. They are not mutually intelligible with other 
varieties of Swahili, and should probably be con- 
sidered a separate, but closely related language group 
for sociopolitical reasons. The phonological features 
they share with rural Swahili are also characteristic of 
much more distantly related Makua, the majority 
language group of northern Mozambique. There is no 
doubt that Mwani, the divergent Swahili variety of 
the northern Mozambique coast, owes much of its 
divergent features to Makua, and also to Makonde. 

The entire area of Map 75 is exclusively Bantu. 
As on other Bantu maps in this edition, Guthrie’s 
classification is generally used for convenience, when 
more decisive classification still remains to be done. 
However, Chwabo is here classified as N40 with Sena 
et al., rather than with Makua P30 since it shares fea- 
tures with both groups, but would be highly divergent 
as a Makua language. Ngoni-Senga (N40) is very close 
to Tumbuka (N20), though classified here with Sena- 
Senga. The continuum nature of much of the area, as 
elsewhere in Bantu, makes some classification arbi- 
trary. As a clearly distinctive group on its own, Shona, 
centred in Zimbabwe, extends to the coast through 
central Mozambique, separating the Central from the 
Southern East Bantu languages centred in Mozam- 
bique. Because of its colonial history and continuing 
educational policy, Portuguese is widely spoken as an 
L2 by adults throughout Mozambique. 

Map 76 focuses on southern and coastal Tanzania. 
It continues the East Bantu continuum northwards 
from Mozambique and Malawi. There is much variety 
among these languages, but less than in many other 
Bantu areas. A source of potential confusion is the 
term Nyasa, indicated on the map as closely related to 
Nyanja. Nyasa and Nyanja are cognate words refer- 
ring to the lake (Lake Malawi). The Mpoto and 
Manda, distinct ethnic groups speaking distinct, 
though closely related Bantu languages, both some- 
times refer to themselves and their language as Nyasa, 
meaning ‘people/language of the lake area’. There- 
fore, Nyasa may refer to Mpoto or Manda, languages 
more similar to the Ruvuma group than to Nyanja/ 
Chewa. The area designated as Nyasa is a mixed area, 
where Nyanja, Mpoto and Manda are all spoken. 


89/50 3 
86/46 94 
79/42, 82/77, 89/04 1,188 
96/97B, 97/84, 98/93 <l 
76/08, 85/19 593/d 
96/98, 97/90 24 
102/82, Z, 104/03 2,280 
96/99, 97/91 - 
77/48a, 78/15b Luba 
94/35 26 
74/15, 75/42f 2,178 
74/57, 84/50 39 
(Namibia 30) 
86/26a - 
76/05, 85/16, 86/1 405 
91/62 42 
94/50 <1 
84/05, 81/60, 83/17c, 91/04 = 
91/04 - 
93/64e = 
85/3e 8 
78/11, 80/56 Luba 
74/03 9,940 
94/66 probably extinct 


Swahili is spoken as a lingua franca throughout the 
area of the map. The unique concurrence of members 
of the four African linguistic phyla, seen in the 
middle of the top of the map, will be discussed 
further under Map 86, where the entirety of the con- 
currence can be seen. 

Map 77 focuses on Zambia, an area that antici- 
pates the increasing heterogeneity of Bantu further 
north. The area includes East, West, South and 
Central Bantu groups. Bemba serves as a lingua 
franca in Northern Zambia, as does Tonga (of the 
Tonga group) in Southern Zambia. In the south-west, 
Lozi originated as a variety of Southern Sotho spoken 
south of the area, and remains distinct from the other 
languages of the area. However, it has also diverged 
considerably from all other varieties of Sotho-Tswana 
in the direction of adjacent Luyana. In Malawi, 
Chewa — differing minimally from Nyanja — is wide- 
spread as a second language. Note that Senga (Ngoni/ 
Senga on the key) is distinct from Nsenga. Senga is 
sometimes classified as a dialect of Tumbuka, while 
Nsenga is closer to Chikunda. Among the ethnic Ngoni 
there are also Nsenga speakers. Mpezeni is a variety 
of Nsenga spoken by ethnic Ngoni in Zambia. 

At least until recently, small migratory groups of 
Khoe (Central Khoisan) speakers occasionally crossed 
from Botswana and Angola to Southern Zambia, 
speaking a language sometimes called Hukwe or 
Kwengo. The name is clearly not an autonym, but 
seems to contain the root *kwe (< khoe). This lan- 
guage does not seem to be spoken in Zambia any 
more, and cannot be located on Map 77. Not repre- 
sented on the map is the Kingwana variety of Swahili 
widely spoken in Lubumbashi, and much influenced 
by Luba in its structure. 

Map 78 focuses on the Katanga area of the 
Democratic Republic of Congo (DRC) with the 
adjacent areas of northern Zambia and south-western 
Tanzania. Most of the Katanga languages are closely 
related to one of the two distinct varieties of Luba: 
Luba-Katanga (formerly Luba-Shaba), and Luba- 
Lulua further west. Each is spoken by millions of 
people. Due to ethnic variety among Luba speakers, 
there is some divergence over whether specific 
varieties are considered dialects or distinct languages. 
For example, Shankali and Zela are considered 
dialects of Luba Katanga in some studies and indepen- 
dent languages in others. The Luba and Songye groups 
are particularly close. The Tetela group is more dis- 
tinct. It is noteworthy that Lake Tanganyika does not 
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appear to be a barrier to the Bantu continuum to its 
east and west. 

Map 79 focuses on Orientale (formerly Haute 
Zaire), the most north-eastern area of the DRC. Ori- 
entale marks the eastern extent of the major Bantu 
internal diversity, increasing in a western direction 
throughout northern Bantu. The map also features 
the eastern limit of contact between Bantu and 
Ubangi, very distinct groups of Niger-Congo in the 
area. The eastern Ubangi language Zande serves as a 
lingua franca in most of the Ubangi-dominated part 
of the area, while Lingala is a widespread Bantu-based 
lingua franca in the Bantu part of the area. The shift 
of much of the Sere-speaking population to Zande 
was mentioned in the introduction (10.1.3). Some of 
the Nilo-Saharan languages of the area are spoken by 
relatively large populations as first languages, but 
largely remain ethnic languages. The eastern area 
shows the interpenetration of Ubangi- and Central 
Sudanic-speaking populations. Note the scattered 
pockets of Bantu in the Zande area, and especially the 
scattering of Bantu Boguru in both Zande and Man- 
gbetu areas. Without further information it is difficult 
to tell whether these outliers are extreme advances of 
Bantu northward in the area, or islands left from the 
southern spread of Ubangi- and Mangbetu-speaking 
peoples. The extreme southern position of Lombi 
suggests it has advanced south into Bantu-speaking 
areas, with which it co-exists. 

Map 80 focuses on the Kivu area of the eastern 
DRC, and neighbouring Rwanda and Burundi to the 
east. The North-East Bantu languages of the area are 
relatively closely related, with greater diversity among 
the North-West Bantu languages. Swahili is widely 
spoken as a lingua franca in the eastern part of the 
area, especially in Kivu. Lingala is the lingua franca of 
the western part of the area, extending along the Zaire 
River. Burundi and Rwanda are linguistically quite 
homogeneous, but ethnically diverse. These closely 
related languages are spoken by very large numbers of 
speakers, by Bantu standards, extending into adjacent 
Uganda and DRC beyond the areas of concentration 
shown on the map. Note the Holoholo-speaking 
communities on both sides of Lake Tanganyika, dem- 
onstrating that the lake is no barrier to linguistic dif- 
fusion (cf. Map 78). 

Map 81 focuses on east and west Kasai in the 
DRC. It is an area of relatively great diversity in West 
Bantu. However, large parts of the area are dominated 
by specific subgroups of closely related Bantu lan- 
guages, especially the Mongo-Nkundo group. Nkundo 
is occasionally used to refer to the dialects of Mongo 
south of the Lomela River, for example Booli and 
Mbidiankambaa It is not clear if it is also used to refer 
to a generalized Mongo lingua franca in the southern 
Mongo area. Ndengese, grouped with Mongo-Nkundo 
on this and the preceding map, is usually classified as 
a member of the Kuba group. Kituba, a pidginized 
variety of Kongo, is widely used as a lingua franca in 
the western part of the area. 

Map 82 focuses on the Equateur region of the 
DRC. It is a western area of Bantu-Ubangi contact in 
the north. It features the homelands of both the 
Bantu-based Lingala lingua franca and the Ubangi- 
based Sango lingua franca. As lingua francas, both 
these languages extend far beyond the first-language 
areas for which they are represented on the map, but 
are largely contained by Bantu-speaking populations 
for Lingala and Ubangi-speaking populations for 
Sango. Some of the southernmost Ubangi languages, 
particularly Monzombo in the west, are spoken 
by relatively small populations, which have been 
intermarrying with the larger Bantu populations and 
shifting to Bantu languages while maintaining their 
Ubangi ethnicity. Noteworthy is Furu, a southern 
outlier of the Central Sudanic group of Nilo-Saharan, 
isolated from other Central Sudanic languages. 
Language shift to neighbouring Ubangi languages is 
probably imminent. 

It is generally assumed that the small and scat- 
tered bands of pygmies in the area, now speaking 
Aka, varieties of Bantu closely related to Lingala, ori- 
ginally spoke a totally different language, about which 


nothing is known. Thus, the situation is similar to 
some of the Khoisan groups in southern Africa, but 
much older, so that language shift is complete. The 
Aka remain ethnically distinct from other Bantu 
groups by their lifestyle and also by their physical 
characteristics. Particularly in the vicinity of Ngbaka- 
Minagende there are small scattered groups of pyg- 
mies who speak 'Baka, varieties of Ngbaka (Ubangi), 
instead of or in addition to Aka (Bantu). Aka is 
usually classified as a member of the Ngundi group 
within North-West Bantu. Information is insufficient 
to ascertain whether or not Aka, as spoken in the 
eastern areas featured on this map, maintains a 
resemblance to the more westerly Ngundi languages, 
or shows greater resemblance to the Ngombe and 
Lingala lingua francas which dominate the eastern 
Aka areas. 

Map 83 focuses on Bas DRC and adjacent Congo 
to the north. Kongo is spoken by large populations as 
a first language, and its pidginized Kituba variety as a 
second language by many others. The area includes 
southern limits of Fang, very widely spoken as a first 
language to the north of the area on the map. 

Map 84 focuses on Angola, and includes the 
Atlantic entrance to the DRC. There is considerable 
diversity in the area, but most of it is of a continuous 
nature in all directions. To some extent there is 
mutual intelligibility within — but not across — the 
large Umbundu, Kimbundu and Kongo groups. 
Lwena extends beyond its first language area as a 
lingua franca in south-eastern Angola and slightly 
into south-western Zambia. The map shows the 
north-western extent of Khoisan. Small Zhu-speaking 
communities are dispersed throughout southern 
Angola, but seem to be disappearing. They seem to 
be remnants of the former greater area covered by 
Khoisan. Kwadi, northernmost of the Khoisan lan- 
guages of the west coast, is thought by some to have 
resulted from an unusual shift of Bantu speakers to a 
Khoisan language. It appears to be of the Khoe family, 
but research is still in progress. 

Map 85 focuses on coastal Kenya and adjacent 
Tanzania. This represents the extent of Bantu- 
dominant areas in the north-east and its contact with 
Cushitic-dominant areas to its north. Small pockets 
of Cushitic (Waata) persist in the Bantu (Miji Kenda) 
coastal area further south. Otherwise the area is 
exclusively Bantu, with a slight penetration of Nilotic 
Maasai from the interior. Bantu continues northward 
along the coast from Bajuni to Mwini, too far north 
to be represented on the map. Mwini is relatively 
similar to other forms of urban Swahili spoken 
further south, but is considered a separate language 
by its speakers, most of whom use the relatively large 
Bajuni group as a reference for ‘Swahili’. The other 
Sabaki languages, most closely related to Swahili, are 
spoken on or near the Kenyan coast. In some respects 
the Kenyan and Tanzanian dialects of Swahili are 
more similar to Pokomo and Miji Kenda than they 
are to the more distant Mwini and Comorian groups. 

The older literature identifies Segeju with a 
language called Dhaiso. Dhaiso appears to have been 
a southern dialect of Kamba influenced by Digo, 
but cannot be equated with current Segeju. Current 
Segeju is clearly a somewhat conservative variety of 
Digo. The Thagicu languages are relatively similar to 
each other, but relatively distinct from the coastal and 
all other Bantu languages. They have a remote affinity 
to Chagga, spoken in the Kilimanjaro area further 
south (Map 93). Swahili is spoken as a lingua franca 
throughout the area, though often in a pidginized 
form in the Kenyan interior and by non-Bantu coastal 
speakers. In contrast to the great diversity of Bantu in 
the area, there is a great deal of mutual intelligibility 
among the East Cushitic languages of the area. 

Map 86 focuses on interior Kenya, adjacent 
northern Tanzania and Uganda. There is great diver- 
sity among the Bantu languages, but of a relatively 
continuous nature. Languages like Luhya and Chagga 
are highly fragmented into dialects, particularly on 
the basis of phonology. Lugo(o)li [(Ma)Ragoli] is 
distinct from other Luhya languages, and is some 
times considered a separate language. On the other 


hand, among the Ugandan varieties of Luhya, Masaba 
and Gisu are often used as synonyms. Otherwise, 
Masaba is the more generic label. The Lake languages 
are particularly continuous, so that it is difficult to 
avoid arbitrariness in deciding where one group ends 
and the next begins. The name Meru, applying to 
both some Chagga and Thagicu varieties of Bantu, is 
of Maasai (Nilotic) origin, and indicates an ethnic 
affinity to the Maasai acknowledged by speakers of 
both Meru varieties. 

Interior Kenya and adjacent Tanzania show 
the southern extent of the East Sudanic branch of 
Nilo-Saharan. Some of these languages are spoken by 
very large populations, especially among sedentary 
groups like the Luo. All the West Nilotic languages 
are relatively similar and quite distinct from the East 
and South Nilotic languages. The Kuliak and Central 
Sudanic languages are also very distinct from each 
other and from the other Nilo-Saharan groups. The 
area as a whole is one of the most heterogeneous of 
sub-Saharan Africa. 

Map 87 focuses on the south-western limit of 
Bantu—Ubangi contact between northern Congo and 
the Central African Republic, and the eastern limits 
of the Bantu area of Cameroon. There is a great deal 
of fragmentation of Bantu in the eastern part of the 
area, bounded in the west by the relatively homo- 
geneous Fang(-Mbeti) group. The small Ubangi 
populations in the Bantu part of the eastern area 
suggest ongoing language shift from Ubangi to Bantu, 
and the recession of Ubangi northwards into the 
Central African Republic. 

Map 88 focuses on the North-West Bantu- 
dominated areas of Gabon, Equatorial Guinea and 
south-western Cameroon. There are a number of dis- 
tinct Bantu groups along the coast, but most of the 
interior is dominated by varieties of Fang. Similar 
Portuguese-based creoles are the majority languages 
of the offshore islands of Principe and Sao Tomé, 
shown on the map. The former colonial language 
French is widely spoken by adults in Congo and 
Gabon, and Spanish in Equatorial Guinea. In the 
Cameroon area and across the southern border the 
English-based pidgin Weskos is commonly spoken in 
the coastal area. 

Map 89 focuses on eastern Central African 
Republic (CAR) and adjacent Chad to the north. 
The area is dominated by the Ubangi branch of 
Niger-Congo in the CAR, and the Central Sudanic 
languages of southern Chad to the north. The east- 
ernmost Adamawan (Niger-Congo) and Chadic 
(Afro-Asiatic) languages also appear in the Chad part 
of the area, as do relatively large Fulani minorities. 
Near the north-west margin of the map around the 
Chari River in Chad, the three major phyla of West 
and Central Africa come into close contact with each 
other: Niger-Congo, represented by the most south- 
eastern Adamawa languages; Nilo-Saharan, repre- 
sented by various Sara-Bagirmi languages of the 
Central Sudanic family; and Afro-Asiatic, represented 
by the most south-eastern languages of the Chadic 
family. 

Map 90 focuses on the western Central African 
Republic and adjacent Chad and Cameroon. As else- 
where, the Ubangi branch dominates the CAR area. 
The East Chadic, Adamawan and Central Sudanic 
languages are all prominent to the north of Ubangi, 
especially in southern Chad, with large areas of con- 
tact among them. Fulani-speaking groups are dis- 
persed in much of the area. Except for some further 
penetration of the large Sara group westwards into 
central eastern Cameroon, the area represents the 
limits of Central Sudanic. The other groups continue 
westwards. Noteworthy is that the separated Nilo- 
Saharan branches of Central Sudanic and Saharan 
come into relatively close proximity in the area of 
western Lake Chad, but nowhere else. The three non- 
West branches of Chadic come into contact further 
south around the Logone and Chari rivers. This may 
be somewhere near to the original area of Chadic 
before diversification took place. 

Map 91 focuses on the Adamawan area of 
Cameroon. The Adamawan branch of Niger-Congo 
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takes its name from this area, and is prominent 
in much of the area, Mbum being the major represen- 
tative. To the south, the western limits of Ubangi 
occur. Thus, the map also represents the western 
limits of Ubangi-Adamawan contact to the north, 
and Ubangi—Bantu contact to the south. Intervening 
between Bantu and Adamawan are Bantoid languages 
more closely related to Bantu than to Adamawan. 
Among these the Mambiloid Bantoid language Vute 
is most prominent in the area covered by the map. 
The Fulani actually form a majority population in the 
represented area of Adamawa, and form very large 
minorities, when not majorities, throughout the rest 
of West Africa at this latitude. Ngambay, north of 
the Wina River, marks the western limit of Sara and 
the Central Sudanic family. 

Map 92 focuses on the most south-western area 
of Cameroon. It marks the northern limit of Bantu 
(formerly ‘Narrow Bantu’). Despite the diversity of 
the Bantu languages, they are still relatively similar 
compared to the Bantoid languages to the north. This 
situation suggests that Proto-Bantu spread southward 
and eastward to its current domain from somewhere 
around the area covered by this map, and that proto- 
Bantu itself arose through the diversification of South 
Bantoid languages originally spoken further north 
and east. Despite this implication of the current 
genetic model for the status of Bantu within Bantoid, 
the continuum nature of South Bantoid leaves an 
element of arbitrariness in including the Mbam lan- 
guages within Bantu, but not the more northern 
Grass(fields) languages formerly considered “Wide(r) 
Bantu’, as if they had an exclusive relationship with 
current Bantu not shared by other South Bantoid 
languages. The Fang group within Bantu dominates 
a large area of the interior, and Duala extends past 
its Ll area as a local lingua franca. However, the 
Cameroon English Pidgin (Weskos), similar to the 
pidgins used in other areas of the West African coast, 
especially adjacent Nigeria, is the lingua franca more 
widely used in much of the area. 

The south-easternmost Cross language, Efik, also 
makes an appearance along the west coast of this area. 
The Cross languages are very similar to Bantoid, to 
which they are related as branches of East Benue- 
Congo. Current classification favours Cross as the 
group most closely related to Bantoid. However, more 
research is necessary to secure this relationship against 
a special relationship between Bantoid and the 
Central Nigerian branch of East Benue-Congo 
further north. The continuum nature of East Benue- 
Congo may defy resolution of the branch model. 

Maps 93 and 95 focus on the areas of extreme 
fragmentation of Bantoid. Both maps shows the very 
extensive and relatively continuous nature of South 
Bantoid, as against the more scattered, though often 
large, areas of other Bantoid languages. Both maps 
also indicate that South Bantoid extends far west into 
Nigeria, while other Bantoid languages are limited to 
a more eastern region, and extend northward into the 
Adamawan area of Cameroon (cf. Map 91). Map 95 
features the northern limits of Cross and the eastern 
limits of the Central Nigerian languages, represented 
by Jukunoid; Bantoid Tiv separates these other two 
East Benue-Congo branches, leaving Bantoid as the 
branch most similar to both. Tiv is a widely spoken 
South Bantoid language in the Benue and Gongola 
regions of Nigeria, and adjacent Cameroon. 

It was formerly thought that Proto-Bantu origin- 
ated somewhere in the border area between Nigeria 
and Cameroon, possibly in the area of greatest South 
Bantoid diversification on Map 95. However, now 
it seems that this is more likely the original area 
of South Bantoid, from which it spread south into 
Cameroon long before proto-Bantu emerged. In this 
case, the Jarawan languages of South Bantoid found 
further north (Map 96) should be the result of 
northward spread of South Bantoid. Given a lack of 
special relationship between the non-South Bantoid 
languages, for instance, the Mambiloid and Dakoid 
branches, proto-Bantoid itself might have emerged 
further north, possibly in the area currently occupied 
by Jukunoid in the Nigeria—Cameroon border area on 


Map 95, between current Dakoid and Mambiloid. 
Map 95 also displays the most southern outliers of 
Chadic. Otherwise the area is exclusively Benue- 
Congo. The isolated positions of Etulo (Idomoid) in 
West Benue-Congo and Baya in Ubangi are remark- 
able as respective eastern and western outliers of their 
larger groups. 

Map 94 focuses on the Chadic area of the 
Cameroons, along with adjacent Nigeria to the west 
and Chad to the east. Fulani and Adamawan mark the 
northern extent of Niger-Congo in the area. By virtue 
of its great spread in Cameroon, the area of Fulani 
north of Maroua marks a location where the three 
major African phyla come into direct contact: Niger- 
Congo represented by Fulani, Nilo-Saharan repre- 
sented by Kanuri, and Afro-Asiatic represented by 
Central Chadic languages such as Wandala (cf. Map 
89). The area is one of major fragmentation of Cen- 
tral Chadic, and of Masa, possibly its closest relative 
elsewhere in Chadic, and possibly close to the original 
area of proto-Chadic, as mentioned in conjunction 
with Map 90. 

Maps 96 and 97 focus on the Plateau area of 
Central Nigeria. East Benue-Congo is prominent in 
its Central-Nigerian and Bantoid branches. The 
Jarawan Bantoid languages are far north and west of 
all the other South Bantoid languages, well into the 
predominantly Afro-Asiatic area of Hausa and other 
Chadic languages. The occurrence of Jarawan in this 
area raises questions about either its historical rela- 
tionship to other South Bantoid languages, or the 
special relationship of (South) Bantoid to Central- 
Nigerian (as opposed to its relationship to Cross). 
Jaku is a particularly widespread Jarawan language, 
occurring in several pockets on both maps, and well 
into the predominantly Kainji area in the west. 

Map 96 shows the southern limits of the exten- 
sive West Chadic area, with a major area of fragmen- 
tation intervening between the Plateau and Jukunoid 
branches of Central-Nigerian Benue-Congo. Hausa 
serves as a lingua franca in most of the area of both 
maps. Jukunoid Wukari, itself influenced by Hausa, is 
a more local lingua in and around its dominant area. 
Map 97 shows most clearly the spread of Hausa far 
north and east of the area of (West) Chadic diversity, 
or of any other Chadic languages. Smaller groups of 
Fulani co-exist with Hausa throughout the area. 

Map 98 focuses on the north-east Nigerian area, 
where the West and Central (Biu-Mandara) branches 
of Chadic come into contact. The Adamawan branch 
of Niger-Congo is most prominent and_ highly 
diversified here, and interpenetrates with the Jarawan 
and Dakoid groups of Bantoid. Leko seems to refer to 
both Adamawan and Dakoid languages spoken in 
the same territory, as a result of political and ethnic 
relationships between speakers of each language. The 
Dakoid language Dong was formerly classified as an 
Adamawan language, presumably owing to Adama- 
wan influence. Hausa is the general lingua franca of 
the area. The southern limit of the major Saharan 
language Kanuri is seen in the north, to the south of 
the Nafada River near Nafada town. 

Map 99 focuses on the eastern part of the 
Kanuri—Chadic contact around Lake Chad, between 
Chad, north-east Nigeria and northern Cameroon. 
Around the eastern border of the Cameroon and the 
Lake it can be seen that intervening Central Chadic 
languages prevent direct contact between the Central 
Sudanic and Saharan branches of Nilo-Saharan. Fulani 
communities are as numerous in the Kanuri area as in 
the Hausa area further west (cf. Map 100). Thus, it is 
likely that there is contact among the three major 
African phyla along the border between Kanuri and 
Chadic similar to that seen in Map 94. However, such 
a degree of contact is not prominent enough to regis- 
ter precisely on this map. Teda and Daza are both 
related to Kanuri and relatively distinct from each 
other, but dialect diversity within Teda and Daza does 
not seem to be great. Most Teda and Daza have 
migratory lifestyles in the Sahara desert, and extend 
northward up to Libya along with other groups 
of similar lifestyle, such as the Shuwa among the 
Saharan Arabic-speaking groups. 


Map 100 focuses on the westernmost extension 
of Kanuri from Lake Chad (see Map 99), and its con- 
tact with Chadic, particularly the extensive Hausa 
(West Chadic) area of northern Nigeria. The (East) 
Benue-Congo continuum extending into southern- 
most Africa as the South Bantu languages (Map 74) 
has its other end in north-west Nigeria, around the 
Kainji reservoir, as the West Kainji group. There is 
contact in this area between Benue-Congo and the 
easternmost languages of a major Niger-Congo 
family of northern West Africa, Mande. Mande is rep- 
resented by Busa and Shanga in the vicinity of Kainji. 
Mande languages will figure on all the more western 
maps up to the Senegalese coast (Map 108). 

Map 101 focuses on the fragmentation of Benue- 
Congo in south-western Nigeria. Hausa is the lingua 
franca in the northern area. In the southern area, 
Yoruba is widely spoken in the west and Igbo in the 
east. As first languages, these and the Edoid languages 
exhibit relatively great internal diversity. Ijoid is a 
Niger-Congo isolate with little similarity to other 
Niger-Congo languages either with respect to vocabu- 
lary or grammar. It is principally spoken in the delta 
area, and is in contact with Benue-Congo languages 
spoken in the delta, particularly the Delta Cross and 
Edoid languages. Despite its distinctive unity, it has a 
great amount of internal diversity. Although they are 
very diverse, most of the Benue-Congo languages of 
the area are more similar to each other than they are 
to other Niger-Congo groups. Except for Cross, the 
Benue-Congo groups at or near the coast are quite 
similar to the Kwa group further west, with which 
they used to be classified. Cross in the east and Kainji 
in the north are more similar to Bantoid. Idomoid is 
somewhat transitional between these two types, as 
befits its location. With West Kainji, the westward 
extension of the Benue-Congo continuum ends in the 
north. The easternmost representatives of the Gur 
and Mande branches of Niger-Congo appear in 
the north-west of the area. Nigerian Pidgin English, 
akin to Weskos in Cameroon, is also widely spoken as 
a lingua franca in the coastal urban areas and 
northward. 

Map 102 focuses on Benin, Togo and adjacent 
Niger. Yoruba represents the westernmost extension 
of Benue-Congo in the south, where it is in contact 
with the easternmost Kwa languages, Fon and Ewe, 
closely related members of the Gbe group. Other 
eastern Kwa languages are spoken by relatively small 
populations in the southern area. The occurrence 
in the Gur area of Chakosi, a Kwa language of the 
northern Bia group, is noteworthy. The other Bia 
languages are spoken much further south and west 
(Maps 108 and 110). 

In the north, Gur languages dominate. Bariba 
(Bargu) is sufficiently distinct from the other Gur 
languages to constitute a separate East Gur group. Of 
the Central Gur languages, Gurma is most widely 
spoken in the area. It is in contact with Hausa in 
Niger. The map shows the western limit of Hausa. 
In Niger it is in contact with the Songhay branch of 
Nilo-Saharan. While Hausa is spoken over such a 
wide area that it is in contact with both Songhay and 
Kanuri, there is no contact between the latter two 
western Nilo-Saharan branches. Their similarities and 
relationship are remote. Gur separates the East 
Mande languages of north-west Nigeria from other 
Mande languages, all spoken west of the area of the 
map. Various distinct Fulani-speaking groups form 
sizeable minorities in the Gur-speaking area. Dialect 
differences are relatively slight between adjacent 
Fulani-speaking groups, though across the entire 
Fulani continuum the differences are sufficient to 
impede initial communication. Dendi marks the 
southern limit of the Songhay group, crossing into 
extreme north-east Benin and north-west Nigeria. 

Map 103 focuses on Ghana. Central Gur lan- 
guages dominate the north, and various groups of 
Kwa languages the south. The most widely spoken 
group of Kwa languages in the area is Akan. The Akan 
lingua franca is often called Twi, but the Akan lan- 
guages are so similar that there is not much linguistic 
difference between the terms Akan or Twi for the 
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general lingua franca of southern Ghana and the 
individual Akan languages. The Ga-Adangme group, 
spoken in and around Accra, and Ewe are markedly 
different. Also different are the Na-Togo and Ka-Togo 
groups. Although they are both highly fragmented 
along the south Ghana—Togo border area, and are in 
some degree of contact with each other, they are now 
considered to be more distant groups, as different from 
each other as each is from the Tano and Gbe branches 
of Kwa. Formerly, Na- and Ka-Togo were classed 
together as a “Central Togo’ group. That grouping was 
the residue of an earlier label, “Togo remnant’ (trans- 
lating the German Togo Restsprache), by which these 
groups were noted for their striking resemblance to 
Bantu, as if they were remnants of Bantu in Togo. 

That was before it was clearly understood that 
the similarities between Central Togo and Bantu, par- 
ticularly the system of noun-class prefixes, are the 
result of retention of an earlier southern Niger-Congo 
feature, lost to a large extent in the intervening coastal 
area of Kwa and Benue-Congo. Many Guang languages 
also share this feature, as do some of the southern 
Central Gur languages. However, in most Central Gur 
languages noun classes are marked by suffixes, charac- 
teristic of more northern Niger-Congo groups. A few 
eastern Mande languages are spoken in northern parts 
of the area, but the main Mande area is further west. 

Map 104 focuses on Burkina Faso and adjacent 
areas. This area is dominated by Gur languages. 
Moore is the most widely spoken of these languages, 
both as a first language and as a lingua franca. The 
western part of this area is the main area of Gur— 
Mande contact, though further east Bisa is isolated by 
Gur from other Mande languages. The West Gur lan- 
guages and Dogon, formerly classified as part of Gur, 
share certain grammatical features with Mande, par- 
ticularly OV (Object—Verb) word order, manifestly the 
result of contact among these Niger-Congo branches. 

Remarkable is the presence in south-west 
Burkina Faso of the (probable) Kru outlier Seme, 
amidst western Gur languages. A current notion is 
that Seme is a remnant of the former wider presence 
of Kru, or its ancestor, in this northern area, as part of 
the West/North Volta-Congo continuum, including 
the Gur-Adamawa languages. Subsequently, Seme was 
isolated from the main body of Kru along the coast by 
the intervention of Gur and Mande languages, as 
seen more clearly in Map 105. The Songhay branch 
of Nilo-Saharan is spoken continuously along the 
Volta River from western Mali to Zarma and Dendi 
(Map 102) in the east. Among the key dialects of the 
Songhay continuum, Kaado and Humburi Senni are 
most notable as the only OV varieties of Songhay, 
mostly east of the extensive Niger-Congo OV men- 
tioned above. However, Humburi Senni has an ‘island’ 
close to Toro Tegu, at the eastern end of the OV 
Dogon continuum. 

Map 105 focuses on the Ivory Coast. The Kru 
branch of Niger-Congo appears along the coast as the 
hinge between the East/South Volta-Congo con- 
tinuum, where Kru and Kwa meet, and the West/ 
North Volta-Congo continuum between Kru and 
Gur-Adamawa. The map shows clearly how Seme is 
isolated from the main body of Kru by West Gur and 
Mande languages (cf. Map 104). Gur-Senufo in Ivory 
Coast and adjacent southern Mali marks the western 
limit of the North Volta-Congo continuum. The 
eastern limit is Ubangi Zande in DRC Orientale and 
across the border in south-west Sudan (Map 80). 
West of North Volta-Congo is the West branch of 
Mande. West and East Mande come into contact 
between Gur-Senufo and Kru. 

Maps 106 and 107 focus on Liberia, Sierra 
Leone, Guinea and Guinea Bissau. The western extent 
of Kru is delimited along the coast on Map 106. 
Further north the coast is dominated by the southern 
groups of the Atlantic branch of Niger-Congo. 
Speakers of Fulani of the northern branch of Atlantic 
are also numerous in part of this area, especially 
Guinea. Most of the interior is dominated by Mande 
languages, especially Manding. Some Southern 
Mande languages reach the coast in this area, such as 
Mende, Susu and Vai. The coastal layer of Atlantic 


languages is diverse and thin. As suggested by the 
extension of the Mande languages to the coast in 
various areas, there is a trend in the area towards 
bilingualism, and language shift by speakers of the 
smaller Atlantic languages to Mande languages and/ 
or the larger Atlantic languages, especially to Temne 
in its area. Kisii, Gola and Limba are relatively resist- 
ant to language shift, and maintain large numbers 
of speakers. Varieties of English, including pidgins 
and L2 varieties of creoles, are widely spoken as 
lingua francas, in Liberia and Sierra Leone. Liberian 
Settler English is mainly confined to communities of 
American descent, for whom it is a first language. A 
more general Liberian Pidgin English is more widely 
spoken as a lingua franca. The Sierra Leone English 
creole, Krio, is spoken as L1 in Freetown by descend- 
ants of slaves resettled there after being freed in 
former British North America or from intercepted 
slave-ships. Lexical influences on Krio show that 
many of the vessels carried speakers of Yoruba among 
other coastal Nigerian and Benin languages. Krio is 
also commonly spoken throughout Sierra Leone and 
beyond as a lingua franca. 

Map 108 focuses on the Senegambian area. It 
delimits the north-western extent of Niger-Congo. 
For the most part, the more coastal area is dominated 
by the northern group of Atlantic languages. The rest 
of the area is dominated by Western Mande, espe- 
cially the extensive Manding group. There is con- 
siderable diversity in North Atlantic, especially in the 
Diola and East Seneguinea groups. The issue of ‘dia- 
lect’ or ‘language’ persists among varieties of Diola in 
particular. The original homeland of the Fulani is in 
the interior of northern Senegal. The languages most 
closely related to Fulani are spoken in this area. The 
lingua franca of Senegal and Gambia is Wolof, rela- 
tively closely related to Fulani. Bijago is the most 
divergent of the Atlantic languages, so that it is classi- 
fied as a distinct branch. It is somewhat protected 
from the language shift affecting other small Atlantic 
languages by its offshore location. However, it is also 
subject to dialectal fragmentation. Further north, 
Atlantic becomes more resistant to language shift and 
maintains large numbers of speakers, starting with 
Balanta in the south. 

Nevertheless, Manding reaches the coast further 
north in Gambia, continuing the trend towards 
Mande expansion towards the coast seen further 
south. Elsewhere, it is very widely spoken both as 
a first language and lingua franca in the north- 
west interior of sub-Saharan Africa, between Wolof 
(Northern Atlantic) and Moore (Central Gur). 
Pockets of Tanda/Tenda (of the East Seneguinea 
group of Atlantic) are so small and mobile in the 
Fulani area just east of Nalu and Balanta that they are 
not represented on the map. Crioulo, a Portuguese- 
based creole, originating in Portuguese—African con- 
tact in the late fifteenth century, is widely spoken as a 
lingua franca in Guinea Bissau. 


Pie-charts 


The pie-charts given here, based on the latest 
available census figures (1996), illustrate the linguistic 
composition of each province of South Africa. The 
figures relate to the country’s 11 officially recognized 
languages. 
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information provided by the Musée includes the 
Narrow Bantu areas of Angola, Botswana, Burundi, 
Cameroon, Central African Republic, Congo, Equa- 
torial Guinea, Gabon, Namibia, Rwanda, Tanzania 
(south-western area), Zaire (detailed sectional maps) 
and Zambia. 

Tucker Childs of Temple University prepared a 
map and ample information about the Atlantic lan- 
guages. The area covered by his map includes the 
Atlantic areas of Gambia, Guinea, Guinea Bissau, 
Liberia, Senegal and Sierra Leone. 

John Kelly of the University of York prepared 
a map and provided further information about the 
language groups of southern Nigeria. 

I would also like to express gratitude for the 
generous advice I got from Phil Jagger of the Uni- 
versity of London about how to handle the classifica- 
tion of the Chadic languages, and Russell Schuh of 
the University of California at Los Angeles for com- 
ments on my introductory discussion of Chadic. 
Similarly, I would like to thank Arthur Irvine of the 
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University of London for his courtesy and effort in pro- 
viding me with information to help me determine the 
northern extent of the sub-Saharan region. I am also 
indebted to Tom Hinnebusch at UCLA for his generos- 
ity in offering me sources of information to facilitate 
checking maps and population statistics. Finally, I must 
thank the atlas editor Christopher Moseley and the 
publication staff, particularly Jonathan Price, for 
attentiveness, help and even hospitality during the 
period within which this atlas was produced. 

As regional editor, I was responsible for prepar- 
ing final versions of the sub-Saharan maps for pub- 
lication, and for the text. Included in the editing 
process was reconciliation of some differences in 
areas of overlap between different contributors, and 
sometimes the addition of further information on 
particular maps, where I thought it important in view 
of the literature on African languages. Thus, I must 
assume responsibility for any errors I may have intro- 
duced into the published versions of the maps. Nat- 
urally this does not include inevitable errors that 
unsuspecting generations of cartographers may have 
dragged through time from earlier source maps, in the 
same way that medieval scribes faithfully copied errors 
from their source texts. It is of course to be hoped that 
both my own and traditional errors are minimal, but it 
is realistic to assume that there are at least a few errors 
on these maps that can be corrected in the future, as 
on most other complex maps of the kind. 


Appendix to Section 10.1.2 


A troubling but instructive reversal of the phenom- 
enon of pejorative labels for languages and their 
speakers unwittingly accepted in African linguistic 
scholarship is the offensiveness of the term ‘Bantu’ to 
most South Africans. In this case, a term which ori- 
ginated in the linguistic literature became pejorative 
when it was expropriated for political purposes by the 
openly racist Apartheid regime in the context of the 
total socio-political disenfranchisement and eco- 
nomic exploitation of the African-origin peoples to 
whom this label was applied. The manner in which 
the term ‘Bantu’ became tainted with racism in South 
Africa is clearly exemplified by comparing the South 
African colonialist Native Lands Act of 1913* with the 
Bantu Homelands Act of 1951,+ both extremely repres- 


* The 1913 act reserved the least productive 7.3 per cent of the 
land of South Africa as the only land that could be owned by the 
members of the African-origin population, 80 per cent of the total 
South African population. 

+ Typical of Apartheid practice, the 1951 act stripped citizenship 
from the so-called ‘Bantu’, the Bantu-speaking African-origin 
majority population of South Africa, and relegated them to 
‘homelands’, based on the only lands that members of this popu- 
lation were allowed to own according to the 1913 act. 


sive legislative acts targeting the African-origin seg- 
ments of South African society. The term ‘Native’ for 
people subjected to European-origin domination 
throughout the world has long been recognised as 
pejorative, according to the attitudes reflected in its 
colonial and imperial uses. The Oxford English Dic- 
tionary (OED) recognises that the term ‘Native’ has 
offensive senses in such contexts. The Apartheid 
regime replaced ‘Native’ with the term ‘Bantu’, in 
part to take advantage of the respectability originat- 
ing in its use in linguistics and the social sciences. 
(The OED has not yet recognised offensive senses of 
the term ‘Bantu’ in the South African context, 
although among many other words it attends to 
senses peculiar to South Africa.) Predictably, the taint 
that had adhered to the term ‘Native’ under racism 
and economic exploitation spread to the term ‘Bantu’ 
in South Africa under the same and worsening condi- 
tions, and has lingered as a reminder of those former 
conditions since the dismantling of Apartheid. Out- 
side of South Africa the term ‘Bantu’ has been con- 
tinuously used since 1862 to refer to the scientifically 
demonstrable family of languages so designated, but 
within South Africa the damage done to that term by 
Apartheid policy remains troubling. 
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Language index 


After any other characters, alphabetical order is word-by-word. In the sequence, accents are ignored; other marks come before any letter or numeral. 
Names of language groups — phyla, families and so on — are in italics. Where the name of a language and the family it belongs to would be adjacent in the index, the 
language is shown as a subhead under the family or other group. 
Names that begin with North, South, East or West (and the like), Central, Upper or Lower, are with a few exceptions indexed under the substantive name. So Lower 
Chehalis and Upper Chehalis are found at C, not at L or U. 
The index tries to accommodate conflicting ways of naming and classifying languages. Alternative names are given in parentheses (curves). Where it may be helpful, 
names are followed by the larger grouping to which they belong [in brackets] or occasionally a place name or other clarification (including Guthrie numbers — e.g., Gwa 
[Bantu: C90] — for African languages). 
Page numbers in italics show the name is mentioned only in a figure or diagram, not in the text; maps are indexed by their map number. Page ranges (such as 34-7) 
typically indicate a series of scattered references; there may also be more than one reference on a single page. 
Abbreviations: N. North, S. South, E. East, W. West. 


+hoa see Huang, Eastern 

‘khong (!IX60) 313, Map 74 

'khung (!'o-!xung) 312-13, Map 74 

"o-!khung (!'o-!xung) 313, Maps 74, 84 

"o-lxung see !khung; !'o-!khung 

‘ora (Korana) Map 74 

!wi-Taa (South Khoisan) 307-8, Map 74 

!xung see !khung 

“Ababda Map 71 

“Ajman Map 70 

“Amarat 279, Map 70 

‘Anazi [Arabic dialects] 279, Map 70 

“Aniza see “Anazi 

“Awazim 279, Map 70 

‘Umir Map 70 

‘Utayba Map 70 

‘Are'are 109, Map 29 

‘Baka 330 

'‘Dongo 313, Map 79 

‘Dugwor 313, Map 94 

‘Kirdasca' 233n. 

‘Kirmanca'’ 233n. 

Ani 313, Map 74 

‘Auni Map 74 

Gana (Gllana) Map 74 

Gui (G/ui) 313, Map 74 

Kham Map 74 

Kh'au-ll'e (IIX'au-ll'e 
77 

Khegwi Map 74 

EKhomani Map 74 

X'au-ll'e see IIKh'au-ll' 

Sassaorta [Saho dialect] 286 


313, Maps 74, 


o 


A-Puckiwar [Andamanese] 217 
A-uo (Red Gelao) [Gelao variety] 182, 
184 

A40 [subgroup] Map 93 

Aari (Ari, Are) 288,289, Map 72 

“Ababda Map 71 

baga [Shilingol subdialect] 191 

Abaga nar 191 

baga [ Trans-New Guinea: Finisterre] 
110, 112, Maps 32-33 

bai Sungai 121, Map 41 

bakan Tatar see Khakas 

baknon 119, Map 40 

basakur 112 

bau 113, 115, Maps 33-34 

bayongo 313, Map 101 

baza 233-4, 238, Map 59 

bdal Uighur see Aynu 

bdalla 313, Map 85 

bdwak 313, Map 85 

belam (Abulas, Ambelas) 113, Map 33 

benaki 7, 17, Maps 1, 8-9 

Eastern 14, Map 1 

Western 25, Maps 1, 8-9 

benlen Ayta 119, Map 40 

bey 313, Map 105 

bia 110, Map 31 

biem [Jieng dialect] 284 

biji-Ajukru-Abey Map 105 

Abiji_ 313, Map 105 

bini 313, Map 101 

binomn 99 

binsi 313, Maps 95-96 

bipén 72, Map 22 

bitana-Kumana 65, Maps 16, 20 

bkhaz-Adyghe 238, Map 59 

Abkhaz 234, 238, Map 59 

see also Caucasian, North, West 

bor see Adi 

boriginal Malay see Malay 

borlan-Tagbanwa 119, Map 40 

brap see Taikat 
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Abron (Abrong, Brong) 303, 311, 313, 
Maps 103, 105 
Abruzzese 261, Map 67 
Absa 313, Maps 96, 101 
Absakur Map 33 
Abu-Chesu (Arum-Tesu) 313, Map 96 
Abu-Odual [Bantoid-Cross] Map 101 
Abua 313, Map 101 
Abua [Central Sudanic] 313, Map 79 
Abui 119, Map 38 
Abulas see Abelam 
Abuldugu [Burun dialect] 284 
Abulu (Nabulu) 323, Map 79 
Abure 313, Map 105 
Acadian French (Cajun) 26 
Acateco see Akateko 
Acaxee Map 3 
Acehnese (Aceh) 109, 121, 176, 182, 
Map 43 
Achagua see Achawa 
Achang (Ngochang, Maingtha) 167, 171, 
175, Maps 46, 49 
see also Chintao 
Achawa (Achagua) 65, Maps 12-13 
Aché see Guayaki 
Achik see Lashi (Achik Garo) 
Achipa 313, Maps 100-101 
Acholi 313, Maps 72, 79, 86 
Achuar-Shiwiar see Hivaro-Kawapdnan 
(Jivaro-Cahuapana), Hivaro 
Achumawi 8, 10, Maps 4, 8-9 
Acoma-Laguna 9-10, Maps 3, 5, 8-9 
Adai 7,9-10, Map 2 
Adal see Afar 
Adamawa-Ubangi 305 
Adamawan 306, 308, 330-1 
see also Gur-Adamawa 
Adangme (Dangme) 313, 332, 
Maps 102-103 
Adasen 119, Map 40 
Adelbert 
Madang (Madang-Adelbert Range) 99, 
Map 33 
Southern Adelbert Range-Korak-Waskia 
99, Map 33 
Adele (Sedere) 313, Maps 102-103 
Aden [Yemeni dialect] 280, Map 70 
Adhola 313, Map 86 
Adi-Mising-Nishi 168-9, 175, Map 51 
Adi (Abor) 169, 213, Maps 45, 51 
Adie 313, Map 92 
Adim 313, Map 101 
Adja (Aja) 311, 313, Map 102 
Adjora (Adjoria) 113, Map 33 
Admiralty group 109, Map 30 
Eastern 109, Map 30 
South-East 109, Map 30 
Western 109, Map 30 
Adnyamathanha 101, 117, Map 35 
Adur [Nosu dialect] 172 
Adyghe (West Circassian) 233-4, 238, 
Map 59 
Adzera 111, 113, Maps 32-33 
Aeka 110-11, Maps 31-32 
af-Boon (Boni) 285, 287, Map 72 
af-Ciise [Somali dialect] 285, 287 
af-Degodiya (Degodia) 287, 315, 
Map 85 
af-Dolbohaante 287 
af-Geedabuursi 287 
af-Isaaq 287 
af-Jiddu 287 
af-Majerteen 287 
af-Ogaadeen (Ogaden) 287, 325, 
Map 85 
af-Warsangeli 287 


Afa'da (Afa'de) 313, Maps 90, 99 
Afar-Saho 286, Map 72 

Afar (Qafar) 285-6, Map 72 
African American Vernacular (Ebonics) 


ay 
Afrikaans 302, 307, 309, 311, 312-13, 329, 
332-3, Map 74 


Afro-Asiatic see Berber, Chadic, Cushitic, 
Iranian; Omotic, Semitic 
Afro-Seminole Creole Maps 8-9 
Aga Buryat 190, 239 
Agala 110, 112, Maps 32-33 
Agara'iwa 313, Maps 100-101 
Agaw see Cushitic, Central 
Agawr see Kunfal 
Agbo 313, Map 101 
Aghem 308, 313, Map 95 
Aghu [Awyu-Dumut] 114, Map 34 
Aghu-Tharnggal 101, 117, Map 35 
Agi [Moru dialect] 284 
Agi [ Torricelli: Palei] 113, Map 33 
Agni see Anyi 
Agob (Dabu) 111, Map 32 
Agoi-Doko-Iyoniyong Maps 93, 95, 101 
Agoi 313, Maps 93, 95 
Agta see Cagayan Agta 
Aguacateco see Awaketeko 
Aguaruna see Awaruna 
Agul 234, 239, Map 59 
Agusan Manobo 119, Map 40 
Agutaynon 119, Map 40 
Agwagune [Upper Bantoid-Cross] 
Maps 93, 101 
Agwagune 313, Maps 93, 101 
Agwamin 101, 117, Map 35 
Agwok [Jieng dialect] 284 
Ahanta 313, Map 103 
Aheu (Phonsung) 177-8, Map 47 
Ahlo-Bowiri Maps 102-103 
Ahlong (Ahlo) 313, Maps 102-103 
Ahom 180, 184 
Ahtna 8, 10, Maps 7-9 
Aian 113, Map 33 
Aiang 166 
Aibondeni 115, Map 34 
Aicham see Jin 
Aigang see Keiga 
Aikana 74, Maps 16-17, 20-21 
Aiki-Kibet see Runga-Kibet 
Aiku 113, Map 33 
Aikwakai 115, Map 34 
Aimaq 211, 216 
Aimaq-Firozkohi Map 53 
Aimaq-Jamshidi Map 53 
Aimaq-Taimani Map 53 
Aimaq-Taimuri Map 53 
Aimara (Aymara) 70, Maps 20-23 
Aimol 168 
Aimoré 72 
Ainu 185-6, Map 50 
Aiome 113, Map 33 
Aion 113, Map 33 
Airo-Sumaghaghe 114, Map 34 
Airoran 115, Map 34 
Aisor see Aramaic 
Aiton 180, 184,217, Map 51 
Aiwo 109, Map 29 
Aizi Map 105 
Aizi 313, Map 105 
Aja [Kresh-Aja] 283-4, Map 72 
Aja (Gbaya) 285, 313, Maps 72, 79, 
89 


Aja [Kwa: Gbe] see Adja 
Ajachemem (Juanefio) see Luisefio 
Ajié 108, Map 27 

see also Wailu 


“Ajman Map 70 
Ajukru 313, Map 105 
Ajuran (Ajuraan) 313, Maps 72, 85 
Ak [Papuan: Sepik] 113, Map 33 
Aka [Bantu: Bobangi-Ntomba] 313, 330, 
Maps 82, 87 
Aka [Bantu: Mbosi] 313, Map 87 
Aka [East Sudanic] 284, Map 72 
Aka [ Tibeto-Burman] see Hrusso 
Aka-Ngombe [Bantu: Mbosi] 313, 
Map 89 
Akan 302, 303, 310-11, 331-2, 
Maps 102-103, 105 
Akateko (Acateco) 45, 52 
Akawaio 76 
Akawe see Awe 
Ake 313, Map 96 
Akei 108, Map 27 
Akha 173, 175, 181, Maps 46-49 
Akheu 173-5, Map 46 
Akhvakh 233-4, 239, Map 59 
Akiapmin see Tuwari 
Akimel O'odham (Pima) see Piman, 
Upper 
Aklanon 119, Map 40 
Akoid 173, 175, Map 46 
Akokisa Map 2 
Akokoid Map 101 
Akoko 313, Map 101 
Akoose 313, Maps 92-93 
Akpa 313, Map 101 
Akpafu-Lolobi (Siwu) 313, 
Maps 102-103 
Akpes 306, Map 101 
Akpes 313, Map 101 
Akpet Map 93 
Akposo 313, Maps 102-103 
Akroa (Koroa) 73, Map 18 
Akrukay 113, Map 33 
Aksana 71, Map 22 
Akuawa (Parakana-Asurini) 74, 
Maps 18-19 
Akuriyo (Tiryometesem) 75, Map 12 
Akutu 312-13, Maps 79-81 
Akwa [Bantu] 313, Maps 81, 83, 87 
Akwapem 303, 313, Maps 102-103 
Akwe 73 
al Murra Map 70 
al-Bir‘at Map 71 
al-Hasa Map 70 
Ala [Ngwi] see Aluo 
Alaba Map 72 
Alabama 9-10, 17, Maps 1-2, 8-9 
Alabo 287 
Alabugat Tatar see Tatar 
Aladian Map 105 
Aladian 313, Map 105 
Alago 313, Maps 95-96 
Alagwa 287, 313, Maps 76, 86 
Alak 177-8, Maps 47-48 
Alam see Zilmamu-Olam 
Alamblak 113 
Alamblak 113, Map 33 
Alangan 119, Map 40 
Alas-Kluet 121, Map 43 
Alaskan Inupiaq, North 7, 15, Maps 7-8 
Alaskan Yup'ik, Central 7, 12, 27, 
Maps 7-8 
Alawa 116, Map 35 
Albanian 259 
Albanian (Shqip) 233-4, 237, 259-60, 
Maps 61, 66 
Arvanatika 260 
Gheg 259-60, Map 66 
Tosk 259-61, Map 66 
Albay Bikol 119, Map 40 


Alege 313, Map 95 
Alemannic 261-2, Map 65 
Aleut 7,10, Maps 7-8 
Aleut 234, 239, Map 55 
see also Mednyj Aleut 
Alfendio (Arafundi) Map 33 
Algeria [Arabic subgroup] Map 71 
Algic 7, Maps 1-2, 4-9 
Algiers [Arabic dialect] Map 73 
Algonkin-Wakashan 9 
Algonquian 7, 10, 46, Maps 1-2, 5-7 
Algonquin 7 
Northern (Algonquin du Nord) 7, 
20, Maps 1, 7-9 
Old 7,20, Maps 1, 7 
Southern (Nipissing) 7, 23, 
Maps 8-9 
Virginia Algonquian 7, 25, Map 1 
Algonquin [Ojibwe dialect] 14 
Ali [Austronesian] 113, Map 33 
Ali [Ubangi] 313, Maps 87, 89 
Allentiac see Alyentiyak 
Alngith 101,117, Map 35 
Alor 119, Map 38 
Algosh see Fillihi 
Alruguya [Evenki dialect] 192 
Alsean 7-8, 10, Map 4 
Alta 
Northern Alta 119, Map 40 
Southern Alta 119, Map 40 
Altaic 159, 231, 237, Maps 45, 53, 55-60, 
66, 70 
Altai (Oirot, Altai-Kizhi) 190, 233-4, 
238, Maps 55-56 
see also Manchu-Tungus, Mongolic, 
Turkic 
u 113, Map 33 
une 118, Map 37 
uo 172,175, Map 46 
ur (Jonam) 313, Maps 79, 86 
utiiq (Pacific Yupik) 7,20, Maps 7-8 
utor 234, 239 
lyawerr 101, 117, Maps 35-36 
lyentiyak (Allentiac) 71, Map 22 
lyerrereng 101 
Ama (Waniabu) 112, Map 33 
Amadi 313, Map 79 
Amahai 118, Map 37 
Amaimon 112, Map 33 
Amal 113, Map 33 
Amanab 112, Map 33 
Amanayé 62,74, Maps 18-19 
Amandara see Mandara 
Amapa Creole 27 
Amapari Wayampi 74 
Amaranna see Amharic 
Amaracaire see Amarakaéri 
Amarakaéri (Amaracaire) 65, 
Maps 16-17, 20-21 
“Amarat 279, Map 70 
Amarizana 65, Maps 12-13 
Amarro see Koyra 
Amawaka 71 
Amawaka (Amahuaca) 71, 
Maps 14-16, 20-21 
Amazon, Upper 65-6, Maps 12-13, 16-17 
Amba 313, Map 79 
Ambae, North-East 108, Map 27 
Ambai 115, Map 34 
Ambala Ayta 119, Map 40 
Ambam 313, Maps 88, 92 
Ambasi (Dawari) 110-11, Maps 31-32 
Ambelas see Abelam 
Ambelau 118, Map 37 
Ambele 313, Maps 93, 95 
Amberbaken 115, Map 34 
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Amblong 108, Map 27 
Ambo 313, Maps 75, 77 
Ambonese see Malay, Malayu Ambon 
Ambrym 
North 108, Map 27 
South-East 108, Map 27 
Amchoke 166 
Amdo [N.E. Tibetan] 164 
Amele 105, 112, Map 33 
American Arctic-Paleosiberian 9 
Amerind 9 
Amgbe 313, Map 101 
Amharic 277-8, 285, 287-8, 310, Map 72 
Amis 182, 184, Map 50 
Amniapé Maps 16-17, 20-21 
Amo 313, Map 97 
Amo 313, Map 97 
Amoesha (Yanesha) 66, Maps 14-15, 
20-21 
Amonap (Matipu, Kuikuro, Kalapalo) 76, 
Maps 18-19 
Ampale see Safeyoka 
Ampanang 120, Map 41 
Ampari 313, Map 104 
Amri 168 
Amto-Musian 113, Map 33 
Amto (Ki) 113, Map 33 
Amu [Bantu: Kongo] 313, Map 85 
Amur [Daur dialect] 192 
Amur Tungusic 238 
Amurag 116, Map 35 
Amuru 289 
Amusgo-Mistekan 53 
Amusgo (Amuzgo) 52-3, Maps 10-11 
Amuzgo see Amusgo 
Ana 313, Map 102 
Anakalanga 118, Map 38 
Anal 167-8, 175,213, Map 51 
Anambé 74 
Anang (Anaang) 313, Map 101 
Anatolian 259 
Anauya 65 
‘Anazi [Arabic dialects] 279, Map 70 
Ancash Kechua (Kichua, Quechua) 70, 
Maps 15, 21 
Andaki (Andaqui) 63, Map 12 
Andalusian (Spanish) Map 68 
Andamanese 217, Map 52 
Andarum 113, Map 33 
Andi 233-4, 239, Map 59 
Andio [Sulawesi] 120, Map 39 
Andoke (Andoque) 69, Maps 13, 15 
Andokero (Andoquero) 69, Map 12 
Andoque see Andoke 
Andoquero see Andokero 
Andra-Hus 109, Map 30 
Andri [Moru dialect] 284 
Andro (Undro, Phayeng) 167 
Aneityum 108, Map 27 
Aném 99, 109, Map 30 
Anfillo (Southern Mao) 288, 289, Map 72 
Angal (Mendi)-Kewa 110, 112, 
Maps 32-33 
Angal (Mendi) 110, 112, Maps 32-33 
Angama-Pochuri (Naga C1) 167-8 
Angami 167-8, 213 
West Map 51 
Angan 99, 99,111, Map 32 
Angaataha 99, 111, Map 32 
Nuclear Angan 99 
Angas Maps 96-98 
Angas 313, Maps 96-98 
Angaua 111, Map 33 
Angba 313, Map 79 
Anggor (Senagi, Watapor) 112, Map 33 
Angika 214, Map 51 
Angka [Tibeto-Burman] see Hrusso 
Angkamuthi [Uradhi dialect] 101, 117, 
Map 35 
Angkola-Mandailing 121, Map 43 
Angkuic (Bulang) 175-6, 178, Map 46 
Angoram (Olem, Pondo) 113, Map 33 
Angwangku see Konyak 
Anhui (Gan) see Chinese (Sinitic), Central 
Animere 313, Map 103 
Anindilyakwa 100, Map 36 
Anishanabe, Anishinaabemowin see 
Ojibwe, Central Southern 
“Aniza see “Anazi 
Ankave 111, Map 32 
Anmatyerr 101, 117, Map 35 
Anor 113, Map 33 
Anson Bay 100, 116, Map 35 
Ansus 115, Map 34 
Antakarrinya 101, 117, Map 35 
Antekerrepenh 101, 117, Map 35 
Antoniano [Salinan] 22 
Anuak see Anywa 
Anuki 110, Map 31 
Anung (Nu) 169, 175, Map 46 
Anuta 107, Map 26 
Anwak see Anywa 
Anyama 313, Map 101 
Anyi (Agni) 301, 311, 313, Maps 103, 105 
Anyo see Meluri 
Anyt see Parauhano 
Anywa (Anuak, Anwak) 284-5, 289, 
Map 72 
Ao (Naga B) 167,175 
Ao 167, 169, 213, Map 51 
Aohan [Ju Ud subdialect] 191 
Aoheng 120, Map 41 
Aomie see Omie 
Aore 108, Map 27 
Aoshedd see Khyamnyungan 
Aowin 313, Maps 103, 105 
Apa Tani (Apa Tanang) 168-9, Map 51 


Apache 8 
Kiowa (Plains) 8, 16, Maps 2, 5, 8-9 
Western 8, 25, Maps 3, 5, 8-9 
Apakibeti 313, Maps 79, 82 
Apalachee 9-10, Map 2 
Apalai (Aparai) 76, Maps 12-13, 18 
Aparai see Apalai 
Apay see Chorti 
Apiaka (Apiaca) 74, 76, Maps 18-19 
Apiakd-Apingi 76 
Apinajé see Apinayé 
Apinayé (Apinajé) 72, Maps 18-19 
Apma 108, Map 27 
Apo Duat 120, Map 41 
Apowasi see Bitara 
Applegate see Galice-Applegate 
Apuk [Jieng dialect] 284 
Apurina (Kangite) 67, Maps 16-17, 20-21 
Aqaw (Aqao) [Gelao variety] 182, 184 
Aquacateco II 49 
Ar Horgin [Ju Ud subdialect] 191 
Arabana 101, 117, Map 35 
Arabela-Andoa 69, Maps 14-15 
Arabian 
Eastern Map 70 
Epigraphic South Arabian 287 
Modern South Arabian (MSA) 277, 
279, 281-2, Map 70 
Old South Arabian 280 
Arabic 1, 277-81, Maps 53, 59-60, 66, 90, 
99-100 
beyond the Middle East 
in Africa 302, 311 
Algerian 262 
Chadic colloquial 282 
in Ethiopia 285 
Moroccan 262 
Sudanese colloquial 282 
Tunisian 262 
in America 26-7 
in Australia Map 36 
Lebanese Map 36 
Cypriot 260, 280 
in Eurasia 233—4, 239, 262-3 
in Southern Asia 181, 211, 213, 216 
Classical (CA) 277-8, 281-2 
Eastern 278-81, Maps 70, 99 
bedouin and settled dialects 279-81 
Modern Standard Arabic 278, 282 
Northern Map 67 
pidgins 281-2 
Western 281, Maps 71-73, 99-100 
settled dialects 281, Map 73 
Arafundi 113, Map 33 
Arafundi see Alfendio 
Aragonese (Spanish) 262, Map 68 
Arai (Leftmay) 111-13, Map 33 
Arakaji 76, Maps 12, 18 
Arakanese (Rakhine, Mogh, Magh) 166, 
171, 175, 213, 215, Maps 49, 51 
Arakanese That see Sak 
Araki 108, Map 27 
Aralo-Caspian see Turkic, Central 
Aramaic 234-5, 239, 277, 279, Maps 53, 
59-60, 70 
Assyrian (Aisor, Neo-Aramaic) 239, 
278 
Aramaue see Aramo 
Aramo (Aramaue) 113, Map 33 
Aranama 7, 9-10, Maps 2-3 
Arandai [S. Bird’s Head] 114, Map 34 
Arandic 101, 117, Map 35 
Aranda 101, 117, Map 35 
Araona 71, Maps 16-17, 20-21 
Arapahoan 7, Maps 2, 5-9 
Arapaho 7, 10, Maps 5, 8-9 
Southern see Nawathinehena 
see also Gros Ventre 
Arapesh 113, Map 33 
Bumbita Map 33 
Mountain (Bukiyip, Kavu) Map 33 
Southern (Muhiang, Mufwian) 
Map 33 
Arara Uruku (Arara do Jiparand) 75-6, 
Maps 16-19, 21 
Arara Urukti [Carib] 75 
Arara-Pariri (Arara do Para) 76, Map 18 
Arawakan 61,67, Maps 12-13, 16-17 
Arawa [Arawan] 67, Maps 15-16 
Arawak (Aruak, Lokono) [Wahiro] 66, 
Maps 12-13 
see also Maipurean 
Arawe 109, Map 30 
Araweté 74, Map 19 
Arawum 112, Map 33 
Arbore (Ibore) 287, 313, Maps 72, 86 
Archi 234, 239, Map 59 
Are 110, Map 31 
Are (Mukawa) 110, Map 31 
Are [Omotic] see Aari 
‘Are'are 109, Map 29 
Areba 101, 117, Map 35 
Arem 177-8, Map 47 
Argobba 288, Map 72 
Argu-Tharnggal see Aghu-Tharnggal 
Arguni 115, Map 34 
Arha 108, Map 27 
Arh6é 108, Map 27 
Ari see Aari 
Ari-Waruna 111, Map 32 
Ariawia see Mikarew 
Arifama (Arifama-Miniafla) 110, Map 31 
Arigibi 111, Map 32 
Arikapt 73, Maps 16-17, 20-21 
Arikara 9-10, Maps 5, 8-9 
Arikem 75, Maps 20-21 
see also Karitiana; Tukanarikem 


Ariseachi 54 
Aritinngithigh 101, 117, Map 35 
Arizona Tewa (Hano, Tano) 9, Maps 3, 5, 9 
Arkhangain-Odld Map 45 
Arleng (Karbi, Mikir) 167-9, 213, 
Map 51 
Armenian 231-5, 237, 259-60, Maps 53, 
59 
beyond Armenia 27, 211, 236 
Armenian [Aramaic subgroup] Map 70 
Aroid 288, 289 
Aromanian see Macedo-Rumanian 
Arosi 109, Map 29 
Arrente 103 
Arrernte Map 36 
Eastern 101, 117, Map 35 
Lower 101, 117, Map 35 
Western 101, 117, Map 35 
Arsi Map 72 
Arso see Taikat 
Arta 119, Map 40 
Arteiga [Beja dialect] 286 
Arthare 166 
Aru [Papuan, Palei] 113, Map 33 
Aru-Kei-Tanimbar [Austronesian] 115, 
119, Maps 34, 38 
Aru [subgroup] 115, Map 34 
Arua 75, Maps 17-19, 21 
Aruak see Arawakan, Arawak 
Artian (Arudn) 62, 66, Maps 12, 18 
Aruek (Djang) 113, Map 33 
Arum-Tesu see Abu-Chesu 
Aruma see Wapishana 
Arumanian (Macedo-Rumanian) 233, 
237, Map 66 
Arunachal, Western Map 51 
Aruop 113, Map 33 
Arusha 313, Map 86 
Arussi dialects 286 
Arutani see Awaké-Kaliana, Awaké 
Arvanatika Albanian see Albanian 
Arwako see Ika 
Arzeu, Old Map 73 
Asa' see Kadu 
Asa [S. Cushitic] 287 
Asala 313, Maps 90, 99 
Asante (Ashanti) 303, 313, 
Maps 102-103, 105 
Asas (Kow) 112, Map 33 
Asat 112, Map 33 
Ashaninca (Ashaninga) 67, Map 21 
Ashanti see Asante 
Ashéninga 67, Maps 14-15, 20-21 
Ashing 169 
Ashkun 211, 216, Map 53 
Ashraaf [dialect group] 287, Map 72 
Ashtiani 216, Map 53 
Asienara 114, Map 34 
Asilulu 118, Map 37 
arabic'Asir Map 70 
Aslian 121, 175, 178, Maps 44, 48 
Central 121, 178, Map 44 
Northern 121, 178, Maps 44, 48 
Southern 121, 178, Map 44 
Asmat-Kamoro 114, Map 34 
Asmat 99, 99 
Casuarina Coast 114 
Central 114, Map 34 
Citak 114 
North 114, Map 34 
Asom 313, Maps 87, 91 
Assamese 166, 212-14, 217, Map 51 
Assiniboine (Nakon) 9, 14, Maps 5, 7-9 
Assiringia see Phom 
Assyrian see Aramaic, Assyrian 
Astrakhan Nogai see Nogai 
Asturian (Spanish) 262, Map 68 
Asu see Pare 
Asua 313, Map 79 
Asumboa 109, Map 29 
Asuri 217, Map 51 
Asurini [do Tocantins] 74 
Asurini [do Xingu] 74, Map 19 
Asyat 281, Map 71 
Ata [Papuan: Yele] 99, 109, Map 30 
Ata Manobo [Austronesian: S. Philippine] 
119, Map 40 
Ataitan 113, Map 33 
Atakapan 7,9-11, Map 2 
Eastern Atakapan 11, Map 2 
Western Atakapan 11, Map 2 
Atakat 313, Maps 96-97 
Atamanu 118, Map 37 
Atampaya [Uradhi dialect] 101, 117, 
Map 35 
Atanke (Atanque, Kampanake, Busintana, 
Buntiwa, Kankuama) 64, Map 12 
Atanque see Atanke 
Atayalic 182, 184, Map 50 
Atayal 182, 184, Map 50 
Atemple 111, Map 33 
Aten 313, Maps 96-97 
Athabaskan 8, Maps 2-9 
and Dene 14 
Southern (Apachean) 8, Map 2 
Athpare-Yakkha 166 
Athpare 165-6, Map 51 
Atie Map 105 
Atie 313, Map 105 
Atinggola 120, Map 39 
Atlantic 304, 306, 308, 332, Maps 72-73, 
89-91, 94-95, 97, 99 
North 304, Maps 73, 79, 89-91, 94-95, 
97, 99-102, 104, 106-108 
South 306, Maps 106-108 
West Map 72 
Atong [Bodo-Garo] 166, Map 51 


Atong [Grassfields] 313, Maps 93, 95 
Atruahi see Yawaperi 
Atsa' see Sak 
Atsam see Chawai 
Atsawaka-Yamiaka (Atsahuaca-Yamiaca) 
71, Maps 14, 16, 20 
Atsi [Bantu: Fang] 313, Maps 83, 88 
Atsi (Tsaiwa, Zi, Zaiwa) [Burmish] 167, 
169, 171, 175, Maps 46, 49 
see also Maru 
Atsina see Gros Ventre 
Atsugewi 8,11, Map 4 
Atta 
Faire 119, Map 40 
Pamplona 119, Map 40 
Pudtol 119, Map 40 
Attikamek (Téte de Boule) 7, 11, Maps 1, 
7-9 
see also Cree-Montagnais 
Aturu 111, Map 32 
Au 113, Map 33 
Auaqué see Awaké 
Aukaan (Ndjuka) [creole English] 27, 
Map 13 
Aulihan 313, Map 85 
Aulua 108, Map 27 
Auna 313, Map 101 
l'Auni Map 74 
Aushi 313, Map 77 
Austral 107, Map 26 
Australian 100-2, 111, 115, 309, Map 24 
Australian Creole 107, Map 25 
Australian Pidgin English 107, Map 25 
Austro-Thai 178-9, 217 
see also Austronesian; Miao-Yao; 
Tai-Kadai 
Austronesian 46, 97-8, 97-8, 108-11, 
118-19, 121, 178, Maps 24-25, 27, 
29-30, 44, 48, 50, 52, 75 
Austro-Asiatic 97, 121, 175, 213, 217, 
Maps 24, 44, 52, 54 
see also Mon-Khmer,; Munda 
Malayo-Polynesian 100, Map 44 
Oceanic 98, 107-9, 111, 114 
Autu 113, Map 33 
Auvergnat (French) Map 69 
Auyana 110, Map 32 
Ava (Canoeiro) 74, Maps 18-19 
Avadhi 214 
Avajfie'e see Paraguayan Guarani 
Avar-Andi(c) 239, Map 59 
Avar 234—5, 239, Map 59 
Avatime-Yangbo 305, 313, Maps 102-103 
Avatime (Siya) 313, Maps 102-103 
Avestan 216 
Aveteian 108, Map 27 
Avikam Map 105 
Avikam 313, Map 105 
Avishiri see Tekiraka-Kenichana, Tekiraka 
Avok 108, Map 27 
Avoyel Map 2 
Avukaya 313, Map 79 
Awa 110, Map 32 
Awa Wa [Mon-Khmer] 176, 178, Map 46 
Awa-Coaiquer see Koaiker 
Awabakal 102, 118, Map 35 
Awadhi 214, Maps 51, 54 
Awahun see Awaruna 
Awak (Yebu) 313, Map 98 
Awakateko (Awaketeko, Aguacateco) 52, 
Maps 10-11 
Awaké-Kaliana 67-8, Maps 12-13 
Awaké (Arutani, Uruak) 68, 
Maps 12-13 
Awan [Jieng dialect] 284 
Awar 113, Map 33 
Awaruna (Aguaruna, Awahun) 69, 
Maps 14-15 
Awaswas see Costanoan 
“Awazim 279, Map 70 
Awe (Akawe) [Achik subdialect] 166 
Awe (Indi) [Dravidian] 217 
Aweikoma see Shokleng 
Aweti 62,74, Maps 18-19 
Awi (Awiya) see Awngi 
Awin-Pa 99, 99, 110, 112, Maps 32-33 
Awin 110, 112, Maps 32-33 
Awing 313, Maps 93, 95 
Awjila Map 71 
Awngi (Awiya, Awi) 286, Map 72 
see also Kunfal 
Awngthim 101 
Awras (Awrés) 282, Maps 71, 73 
Awun 113, Map 33 
Awutu 313, Maps 102-103 
Awyi 115, Map 34 
Awyu-Dumut-Asmat-Kamoro 114 
Awyu-Dumut 99, 99, 110, 114, 
Maps 32, 34 
Awyu 114 
Axamb 108, Map 27 
Axi 172,175, Map 46 
Ayacucho Kechua (Kichua, Quechua) 70, 
Map 21 
Ayaman see Ayoman 
Ayapaneko Map 11 
Ayapathu 101, 117, Map 35 
Ayautla Soyaltepec Masateko 53 
Ayerrereng 117, Map 35 
Ayizi 172,175, Map 46 
Aymiilal see Soddo 
Aymara see Aimara 
Aymasa [Kunama dialect] 290 
Aynu (Eynu, Aynu) [mixed language] 
186, 193, Map 45 
Ayo see Mulao (Ayo) 
Ayoman (Ayaman) 63, Map 12 


Ayoreo 72, Maps 18-21, 23 

Ayta [N. Philippine subgroup] 119, 
Map 40 

Ayu 313, Maps 96-97 

Aza [Gurage] see Ezha 

Aza [Teda-Daza] 313, Map 99 

Azerbaijani (Azeri) 216, 234-5, 237-8, 
Maps 53, 59 

beyond Azerbaijan 211, 236 

Azerbaijanian, Soviet [Aramaic subgroup] 
Map 70 

Azha 172,175, Map 46 

Azhe 172, 175, Map 46 

Aziana see Kenati 

Azoyt Tlapaneko 53 

Aztec-Tanoan 9 


B'atz'i k'op see Tzotzil 
B'atz'il k'op see Tzeltal 
Ba see Kwa 
Baarundji 102, 118, Map 35 
Baatonum see Bariba 
Baba Indonesian see Chinese Indonesian 
Baba Malay (Straits Malay, Chinese 
Malay) 103-4, 162, Map 25 
Babale-Kunda 313, Maps 82, 87 
Babale-Lenge 313, Maps 82, 87 
Babanki 313, Maps 93, 95 
Babatana 102, 106 
Babende 313, Map 87 
Babenzele 313, Map 87 
Babine 8, 11, Maps 6-9 
Babole 313, Map 87 
Baburiwa 115, Map 34 
Babuyan 119, Map 40 
Babuza 182, 184 
Bacairi see Bakairi 
Bachama 313, Maps 91, 98 
Bactrian [Iranian] 216 
Bada [Bantoid] 313, Maps 96-98 
Bada [Central Sulawesi] 120, Map 39 
Badaga 217, Map 52 
Badara 313, Maps 96-97 
North-East 324, Map 96 
South-West 327, Map 96 
Bade-Warji (Bade-Warje) Maps 97-98, 
100 
Bade [subgroup] Map 100 
Bade 313, Map 100 
Badjala (Fraser Island) 102 
Badjiri 117, Map 35 
Badyara see Pajade 
Baendn (Baena) 73 
Baetora 108, Map 27 
Bafek 313, Map 93 
Bafia 313, Maps 92-93 
Bafut 313, Maps 93, 95 
Baga Map 107 
Baga Binari 313, Map 107 
Baga Koba 313, Map 107 
Baga Ma(n)duri 313, Map 107 
Baga Sitemu Map 107 
Baga Sobane 313, Map 107 
Baga Mboteni Map 107 
Bagasin see Girawa 
Bageto 313, Map 87 
Baggara 281, Maps 71-73, 99 
Baghati Map 54 
Baghdad (Qaltu) Map 70 
Bagheli Maps 51-52, 54 
Bagirmi 284, Map 90 
see also Sara-Bagirmi 
Bago 313, Map 102 
Bagobo 120, Map 40 
Bagri Map 54 
Bagupi 112, Map 33 
Bagval(al) 233-4, 239, Map 59 
Bagyeli 313, Maps 88, 92 
Baham 114, Map 34 
Bahamian Creole (Bahamas English) 26, 
27, Map 9 
Bahariyya 281, Map 71 
Bahasa Malayu 119, Map 38 
Bahasu Melayu Loloan (S.W. Bali Malay) 
121, Map 42 
Bahau 120, Map 41 
Bahau River Kenyah 120, Map 41 
Baheng (Pa Then, Pathen) 183-4, 
Maps 45, 47 
Bahinemo 113, Map 33 
Bahinemo (Wogu) 113, Map 33 
Bahing (Rumdali) 165, 214, Map 51 
Bahnaric 175, 177-8, Maps 47-48 
Bahnar 177-8, Map 48 
Bahrathar see Sunwar 
Bahrayn 279, Map 70 
hBahthang 164 
Bai [Papuan] see Dumun 
Bai [ Tibeto-Burman] 159, 163, 169-70, 
175, Map 46 
Baibai 112 
Baibai 112, Map 33 
Baihong 173, 175, Map 46 
Baiku Yao see Baonao 
Bailundu 313, Map 84 
Baima 169-70, 175, Map 45 
Baimak 112, Map 33 
Bainapi 110, Map 32 
Baining 99 
Baining-Taulil 109, Map 30 
Baining 109, Map 30 
Bairin [Inner Mongolian] 191, Map 45 
Ju Ud subdialect 191 
Bajan Creole (Bajan English) 26, 27 
Bajau; Bajo see Sama, Sama Bajau; Sama 
Bajo 
Bajjika [Indo-Aryan] 214 
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Bajuni 330 
Bajun (Tikuu) 313, Map 85 
Bajwe 313, Map 87 
Bajwo 313, Map 95 
Bak [Atlantic subgroup] Maps 107-108 
Baka [Central Sudanic] 284, 313, Map 79 
‘Baka [Ubangi] 330 
Bakairi (Bacairi) 62, 76, Maps 18-19 
Bakhtiari 211 
Baki [Bantu] 313, Maps 87, 91 
Baki [Central Vanuatu] 108, Map 27 
Bako see Aari 
Bakoko 313, Map 92 
Bakole 313, Maps 92-93 
Bakoni 313, Map 93 
Bakosongo 313, Map 83 
Bakpinka 313, Map 93 
Bakumpai 120, Map 41 
Bakundu 313, Maps 92-93 
Bakung Kenyah 120, Map 41 
Bakwe-Wane Map 105 
Bakwe 313, Map 105 
Bala [Bantu] 313, Map 84 
Balaesan 120, Map 39 
Balangaw 119, Map 40 
Balanta 311, 332, Map 108 
Balanta 313, Map 108 
Balantak 120, Map 39 
Baldamu 313, Map 94 
Balé [ Maiptirean] see Baré 
Bale [Omotic] see Zilmamu 
Baledha 283, 284 
Bale (Baledha) [Nilo-Saharan] 284, 
313, Map 79 
Bali [Arabic] Map 70 
Bali [Bantu: Ngombe] 313, Maps 79, 83 
Bali [Leko-Nimbari] 313, Map 98 
Bali Malay, South-Western Map 41 
Bali Nyonga (Balinyonga) [Grassfields] 
313, Maps 93, 95 
Bali-Sasak (Balisask) 118, 121, Maps 38, 
42 
Balinese 118, 121, Maps 38, 42 
Bali-Vitu 109, Map 30 
B'alib'a [Moru dialect] 284 
Balinyonga see Bali Nyonga 
Balkan Romance 259 
Balochi see Baluchi 
Balong 313, Maps 92-93 
Balti [Tibetan] 163-4, 211,213, Map 54 
Balto-Finnic (Baltic Finnic) 238, 259, 
Maps 58, 62 
Balto-Slav(on)ic 231, 236, 259, Maps 45, 
50, 55-62, 64-66 
Baluan-Lou-Pam 109, Map 30 
Baluchi (Balochi) 211-12, 216, 234, 
236-7, Maps 53-54, 57 
Southern 211 
Western 211 
Balygu 116, Map 35 
Bam [Finisterre] 110, Maps 32-33 
Bam [Schouten] 113 
Bamankan see Bambara 
Bambama 313, Maps 83, 88 
Bambara (Bamankan) 302, 312-13, 
Maps 104, 107-108 
Bambassi (Mao kole) 289 
Bambili (Bamili) 313, Maps 93, 95 
Bambomba 313, Map 87 
Bambuka 313, Map 98 
Bamenyam see Mamenyam 
Bamileke 311, Map 93 
Bamili see Bambili 
Bamnyo 314, Maps 91, 95 
Bamu Kiwai see Kiwai 
Bamun (Shipamon) 314, Maps 93, 95 
Bamunka 314, Maps 93, 95 
Bana (Pana) 174, 314, Map 94 
Banam Bay 108, Map 27 
Banaro 113, Map 33 
Banawa 67 
Banda [Austronesian subgroup] 115, 118, 
Maps 34, 37 
Banda 115, Map 34 
Banda [Bantu subgroup] Maps 79, 82, 87, 
89-90 
Banda 311, 314, Maps 79, 82, 87, 90 
Mid-Southern 314 
South-Central 314 
Banda-Bambari Maps 82, 89 
Banda-Banda 314-15, Maps 82, 89 
Banda-Mbres 314, Maps 82, 89-90 
Banda-Ndele 314, Maps 89-90 
Bandawa-Minda 314, Map 98 
Bandi [Mande] 314, Map 106 
Bandjalang 102,118, Map 35 
Bandjigali 102, 118, Map 35 
Bandyi 101 
Banga [Benue-Congo] 314, Maps 100, 102 
Banga [Gur-Adamawa] 314, Map 98 
Bangandu 314, Map 87 
Bangari Me 314, Map 104 
Bangba-Mayogo 314, Map 79 
Bangem 314, Map 93 
Banggai [Sulawesi] 120, Map 39 
Banggi [Philippine] Map 41 
Bangkeng 314, Map 93 
Bangla see Bengali 
Bangni (Bengni) 169, 175 
Bango 314, Maps 79, 82 
Bangru 169, Map 51 
Bangu-Bangu 314, Maps 78, 80 
Bangwinji 314, Map 98 
Baniata 99, 109, Map 29 
Baniva 66, Maps 12-13 
Baniwa 66 
Banjogi see Bawm 


<3) 


i) 


Bankal 314, Maps 96-98 
Banma Tegu 314, Map 104 
Banna 288, 289, Map 72 
Bannock see Northern Paiute 
Banoni 108, Map 29 
Banoo 314, Map 92 
Banpara see Wancho 
Bantawa 165-6, 214, Map 51 
Bantayanon (Bantoanon) 119, Map 40 
Bantik 120, Map 39 
Bantoanon see Bantayanon 
Bantoid-Cross 305-6, Map 101 
Bantoid 305-6, 308-9, 331, Maps 91, 
95-96 
see also Bantu; Cross 
Banton 119, Map 40 
Bantu 46, 290, 305-6, 309, 329-32, 
Maps 52, 72, 74-83, 85-86, 89 
Central 329, Map 75 
Central-East 329, Maps 74, 78 
Coast Maps 75-76 
Central-West Maps 78, 80-81 
North Maps 79-81 
as lingua franca 303, 330 
North-East 304, 309, 330, Maps 52, 75, 
78-81, 89 
Coast Maps 72, 76, 85-86 
Congo Maps 79-81, 89 
Interior Maps 72, 76, 85-86 
Lake Maps 78-80, 86 
Transitional Maps 85-86 
North-West 306, 330, Maps 79-83, 89, 
92 
Coast Map 92 
South 308 
Central Maps 74, 81 
South-East 329, Maps 74-75 
South-West Map 74 
Bantu Pidgin see Fanakalo 
Bant ‘Attiyya Map 70 
Bant Hajir Map 70 
Bana Hilal 281 
Banu Iznasen Map 73 
Bani Khalid Map 70 
Bani Khartis 280, Map 70 
Bant Sakhr Map 70 
Bani Salah Map 73 
Bani Sniis Map 73 
Bani Sulaym 281 
Bani Swayf (Banti Swéf) 281, Map 71 
Bant Wahb Map 70 
Bani Yam Map 70 
Bant Saamir [Beja dialect] 286 
Banyun 314, Map 108 
Bao'an 191, Map 45 
Baonao (Nao Klao, Nao Gelao, Baiku 
Yao) 183-4 
Bapakum 314, Map 93 
Bapu 115, Map 34 
Bapuko (Bapuku) 314, Maps 88, 92 
Bara see Boro 
Bard-Tuyuka 68, Maps 12-13, 16 
Baraba Tatar see Tatar 
Baradha 101, 117, Map 35 
Barag see Bargu-Buriat 
Baraic 110-11, Map 32 
Barai 110-11, Maps 31-32 
Barakai 115, Map 34 
Baram [Himalayan] see Bhramu 
Baram Kayan 120, Map 41 
Baram Kenyah, Upper 120, Map 41 
Barambo 314, Map 79 
Barani Ful 314, Map 104 
Barasano see Buhagana 
Barau 114, Map 34 
Barawa see Mwini 
Barbakéan 63-4, Maps 12-13 
Barbarefio [Chumashan] 8, 11, Maps 3-4 
Bardi 116, Map 35 
Baré (Balé) 66, Maps 12-13, 16 
Barea see Nara 
Barein 314, Map 89 
Barga [Altaic] see Bargu-Buriat, Bargu 
Bargu [Gur] see Bariba 
Bargu-Buriat 188, 191, Map 45 
New Bargu (Barga) 191, Map 45 
Old Bargu (Barga) 191, Map 45 
Bari (Motilén, Dobokubi) [Chibchan] 
64, Maps 12-13 
Bari [Nilotic subgroup] 307, Maps 79, 86 
Bari 284, 314, Maps 72, 79, 86 
Bariai 109, Map 30 
Bariba (Bargu) 305, 331, Maps 101-102, 
104 
Bariba (Baatonum) 311, 314, 
Maps 101-102, 104 
Baric see Sal 
Bariji 110-11, Maps 31-32 
Barim 113, Map 33 
Barito 120, Map 41 
Barito-Mahakam 120, Map 41 
East 120, Map 41 
North-East Barito 120, Map 41 
South-East Barito (Ma'anyan) 120, 
Map 41 
West 120, Map 41 
Barka [Kunama dialect] 290 
Barma 284, 314, Maps 90, 99 
Barngarla 101,117, Map 35 
Barnya 101, 117, Map 35 
Barok 109, Map 30 
Baropasi 115, Map 34 
Barrow Point 101, Map 35 
Bartangi 236-7 
Baruga 110, Map 31 
Barundji 102, 118, Map 35 
Barunggam 102, 118, Map 35 


Baruya 111, Map 32 
Barwe (Wesa) 314, Maps 74-75 
Barya see Kunama; Nara 
Basa Kupang 119, Map 38 
Basa-Kamuku [subgroups 1-3] 
Maps 100-101 
Basa Kaduna 314, Map 101 
Basaa Maps 92-93 
Basaa 314, Maps 92-93 
Basaa ba Duala 314, Map 92 
Basap 120, Map 41 
Basari-Konyagi Maps 107-108 
Basari 314, Maps 107-108 
Basari-Tobote 314, Maps 102-104 
Basay 182, 184 
Bashar 314, Map 98 
Bashgali 211, 216, Map 53 
Bashkardi 211,216, Map 53 
Bashkarik Map 53 
Bashkir 233-6, 238, Maps 55, 58 
Basho 314, Map 95 
Basila-Adele Maps 102-104 
Basila 314, Map 104 
Basila-Adele 314, Map 102 
Basketo 288, 289, Map 72 
Basque (Euskara) 262, Maps 68-69 
Basra (Qoltu) Map 70 
Bassa 311, Map 106 
Bassa 314, Map 106 
Basum 314, Map 91 
Bata see Gbwata 
Bataan Ayta 119, Map 40 
Batak 121, Maps 40, 43 
Batak [Palawan] 119 
Batanga [Bantu] 314, Map 92 
Batangan 119 
Batangan [Meso-Philippine] Map 40 
Batanic [Austronesian subgroup] 119, 
Map 40 
Batek 121, 178, Map 44 
De' 178 
Iga 178 
Nong 178 
Te’ 178 
Bathari 281 
Bati [Mbam: A40] 312,314, Maps 92-93 
Bati [Ngombe: C50] 312,314, Maps 79, 82 
Batomo 314, Map 95 
Bats, Batsbi see Tsova Tush 
Batu 314, Map 95 
Batumerah 118, Map 37 
Batyamal 116, Map 35 
Bau [Madang: Mabuso] 112, Map 33 
Bau [Tabi dialect] 284 
Baukan 121, Map 41 
Baule [Kwa] 311 
Baule-Ando 314, Map 105 
Baure 67, Maps 16-17, 20-21 
Bauro 109, Map 29 
Baushi 314, Map 101 
Bauwaki 110, Map 31 
Bauzi 115, Map 34 
Bavaro-Austrian 261, Map 65 
Bawm (Banjogi, Bum) 168, 215 
Baxoje see Chiwere 
Bay [Bantu] 314, Map 81 
Bay Islands English 26,27, Map 11 
Bay Miwok see Miwok, Eastern 
Baya 314, 331, Map 95 
Bayad 188, Map 45 
Bayali 102, 117, Map 35 
Bayogoula Map 2 
Bayot-Essin 314, Map 108 
Bayso 287, Map 72 
Baza (Bazen) see Kunama 
Bazaar (Pasar, Bazar) Malay 103, Map 25 
Bazanche 314, Maps 102, 104 
Be (Ong-be, Lingao) 179, 184, Map 45 
Beami (Bedamini) 110, 112, Maps 32-33 
Bearlake (North Slavey) 8, 11, Maps 7-8 
Beaufort Murut 121, Map 41 
Beaver 8, 11, Maps 6-9 
Beba’ 314, Map 95 
Beboid 306, Map 95 
East Map 95 
Bebe 314, Map 95 
West Map 95 
Bedamini see Beami 
Bedik 314, Maps 107-108 
Bedoanas 115, Map 34 
Bedouin Maps 71-73, 99-100 
Jordanian Map 70 
Negev Map 70 
Saharan Maps 99-100 
Bedouin dialects Map 73 
Constantine Map 71 
Hilalali_ Map 73 
Hilali Map 71 
Libyan Map 71 
Ma'qili Map 73 
Saharan Maps 71, 73 
Sahel Map 73 
Sulaymi Maps 71, 73 
Western Map 71 
Beeke 314, Map 79 
Beekol 314, Map 87 
Beeten (Beten) 314, Maps 87, 91 
Befang 314, Map 95 
Beik (Meguese) 171, 175, Map 48 
Beira (Portuguese) Map 68 
Beja 282, 286-7, Map 72 
Bekati 120, Map 41 
Bekware (Bekwarra) 314, Map 95 
Bekwil 314, Map 87 
Bekwil Peda 314, Map 87 
Bel 113, Map 33 
Bel (Gedaged) 105, 113, Map 33 


Belarusian (Belorussian) 231—2, 234-6, 
Maps 58, 62 
Bele 314, Map 91 
Bolen see Bilin 
Belepa see Keuru 
Belhare 165-6, Map 51 
Beli 284, Map 33 
Belingo 314, Map 89 
Belize Creole English 27 
Bella Bella see Heiltsuk-Oowekyala 
Bella Coola (Nuxalk) 8, 11, Maps 6-9 
Belorussian see Belarusian 
Bemal (Kein) 112, Map 33 
Bemba Maps 76-78 
Bemba 302, 312, 314, Maps 76-78 
as lingua franca 329 
Bembe [Kongo: H10] 314, Maps 83, 88 
Bembe [Shi-Havu: J50] 314, Map 80 
Bembi see Ifas 
Ben'gan see Pakanic, Pakan 
Ben (Gan) [Mwa-Gan] 314, Map 105 
Bena-Kinga Maps 76-77 
Bena 312, 314, Map 76 
Bena-Hehe 312 
Benaadir [dialect group] 287, Map 72 
Benabena 110, 112, Maps 32-33 
Bench (Gimira) 288-9, Map 72 
Bende 314, Maps 76, 78 
Bendeghe 314, Maps 93, 95 
Bendi [Cross] 305 
Bendi [Li subvariety] 179 
Benga 314, Map 88 
Bengali-Assamese 217, Map 51 
Bengali 212-14, 214, 215, 234, 259, 
Maps 51-52 
dialects 166 
overseas 181, 263 
Benge 314, Maps 79, 82 
Benghazi [Arabic dialect] Map 71 
Benglong see De'ang 
Bengni see Bangni 
Beni Shangul see Berta; Gumuz 
Benue-Congo 304-6, 308-9, 316, 318, 
322, 331-2, Maps 72, 75-89, 91-93, 
95-98, 100-102 
East Maps 96-97 
Benue-Kwa 305-6 
Benyadu' 120, Map 41 
Beothuk 7,9, 11, Maps 1, 7 
Bepour 112, Map 33 
Beraku see Bubalia 
Berawan 120, Map 41 
Berber 277, 281-2, Maps 71, 73, 100 
Berbice Creole Dutch Map 13 
Bereby Kru 314, Map 105 
Berik 115, Map 34 
Beromic 305, 306, Maps 96-97 
Berta 283, 285, 289, Map 72 
Berta 285, 290, Map 72 
Berti-Zaghawa 283 
Berti 283, Map 71 
Besap see Besep 
Besawunena 10, Map 5 
Besep (Besap) 314, Map 87 
Besisi (Mah Meri) 121, 178, Map 44 
Besleri 314, Map 94 
Besum 314, Map 94 
Bete 311,314, Map 105 
Bete-Bendi 314, Map 95 
Beten see Beeten 
Beti see Mbeti 
Betise’ [Besisi dialect] 178 
Betoi (Hirara, Jirara) 63, Map 12 
Bewani 112, 114, Maps 33-34 
Beygo [Daju dialect] 284 
Bezhta (Kapuchi) 234, 239, Map 59 
Bhadrawahi Map 54 
Bhajjika Map 51 
Bharmauri (Gaddi, Gadi) Map 54 
Bhatri Map 52-1 
Bhili (Bhilodi) 213, Map 54-2 
Bhoi 168 
Bhojpuri 213-14, 216, Maps 51-52 
see also Bajjika 
Bhotia Tibetan 164, 213 
Bhramu (Baram, Maramu) 165, 214, 
Map 51 
Bhujel (Khawas) 214 
Bhumij 213,217, Map 51 
Bi Ka 175 
see also Biyue; Kaduo 
Bia Maps 102-105 
Biafada-Pajade Maps 107-108 
Biafada (Biafade) 314, Map 108 
Biak [Austronesian lingua franca] 103-4, 
115, 118, Maps 25, 34, 37 
Biaka [Papuan] 112, Map 33 
Biami see Piame 
Bian Marind see Marind 
Biangai 111, Map 32 
Biao 179, Map 45 
Biaoman [Yao variety] 183 
Biaomin [Yao variety] 183 
Biat [Mnong dialect] 177 
Biatah 120, Map 41 
Bichelamar see Bislama 
Bidai Maps 2-3 
Bide (Kunini) Map 32 
Bideyat 283 
Bidyara 101, 117, Map 35 
Bieno 312,314, Map 84 
Bierebo (Bierbo) 108, Map 27 
Bieri [Gur] 314, Maps 102-104 
Bieria 108, Map 27 
Big Nambas 108, Map 27 
Bihari 213, Map 51 
Bihari 213 


Bijago 332, Maps 107-108 
Bijago 314, Maps 107-108 
Biji (Northern Tujia) 170, Map 45 
Bijugi 314, Map 87 
Bikaru (Bugalu) 113, Map 33 
Bikol 103, 119, Map 40 
Standard Bikol 119 
Biksi 113, Maps 33-34 
Bilakura 112, Map 33 
Bilala 314, Maps 90, 99 
Bilbil 113, Map 33 
Bile 314, Map 98 
Biliau (Sengam) 113, Map 33 
Bilin (Bolen) 286-7, Map 72 
Biloxi 9, 11, Map 2 
Biltine see Mimi 
Bilua 99, 109, Map 29 
Bima [Bantu] 314, Map 93 
Bima-Sumba [Austronesian subgroup] 
118, Map 38 
Bima 118, Map 38 
Bimin 112, Map 33 
Bimoba see Moba 
Bina 110, Map 31 
Binahari 110, Map 31 
Binandere 111 
Binanderean [subgroup] 99, 110-11, 
Maps 31-32 
Binandere 99, 105, 110-11, 
Maps 31-32 
Binbinga 116, Map 35 
Bindal 101, 117, Map 35 
Bindi 314, Maps 78, 80-81 
Bine (Kunini) 111, Map 32 
Binga 284 
Bini see Edoid, Central, Edo 
Binisaya, West 119, Map 40 
Binji (Binja) 314, Map 80 
North Binja 324, Map 80 
South Binja 327, Map 80 
Bintauna 120, Map 39 
Bintukwa see Ika 
Bintulu 120, Map 41 
Binukid 119, Map 40 
Binumarien 110, Map 32 
Binza 314, Maps 79, 82 
Bira [Nyoro-Ganda: J10] 314, Map 80 
Bira-Huku Maps 78-81, 89 
Bira [D30] Map 79 
Birale 285, 290, Map 72 
Birao 109, Map 29 
Birarchen [Orogen dialect] 187 
Birawa 314, Map 97 
Bird’s Head [subgroups] 
East 99, 115, Map 34 
North and Central 115, Map 34 
South 99, 114, Map 34 
West 115, 118, Maps 34, 37 
Birgid [Nubian] 282-3 
Birgit [Chadic] 314, Map 89 
Birhor 217, Map 51 
Biri [Pama-Nyungan] 101, 117, Map 35 
Birifor (Birifur, Lobr) 314, 
Maps 103-105 
Birifur see Birifor 
Birom 314, Maps 96-97 
Birri [ Nilo-Saharan] see Bvirri 
Birria 101, 117, Map 35 
Bisa [Mande] 314, 332, Maps 102, 104 
Bisa-Lamba [Bantu subgroup] 
Maps 75-77 
Bisa [M50] 314, Maps 75-77 
East 316, Map 77 
North 324, Map 77 
South 327, Map 77 
Bisayan [N.E. Borneo subgroup] 121, 
Map 41 
Northern Bisaya 121, Map 41 
Southern Bisaya 121, Map 41 
Bisayan [Philippine subgroup] 
North 119, Map 40 
South 119, Map 40 
Biseni-Oko(r)dia 314, Map 101 
Bishari [Beja dialect] 286 
Bishnupriya Manipuri [mixed language] 
168 
Bishnup(u)riya 213, 215, Map 51 
Bisis 113, Map 33 
Bislama (Bichelamar) [pidgin English] 
102, 106, Map 25 
Bisoid 173, 175, Map 46 
Bisoo 314, Maps 92-93 
Bisu 174-5, 181, Maps 46-47, 49 
Bit see Pasing 
Bitara (Apowasi) 113, Map 33 
Bitare 314, Map 95 
Bitieku 314, Map 93 
Bitjara 101, 117, Map 35 
Biu-Mandara [Central Chadic] 308, 331, 
Maps 71, 89-91, 94-97, 99, 100 
Biwat (Mundokuma, Mundugumor) 113, 
Map 33 
Biyom (Sasime) 112, Map 33 
Biyue (Piyo) 173, 175, Maps 46-47 
Bizerta [Arabic dialect] Map 73 
Blaan-Tboli 119, Map 40 
Blablanga 109, Map 29 
Black Karen see Rianglang 
Black Tai see Tai-Kadai, Tai Dam 
Blackfoot 7, 11, Maps 5-9 
Blackwater see Carrier 
Blagar 119, Map 38 
Blimaw (Bwe Karen) 174-5, Map 49 
Blood see Blackfoot 
Blue/Green Mong see Hmong 
Bo [Bantu] 314, Map 80 
Bo (Po) [Leftmay] 112, Map 33 
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Bo-y see Zhuang, Northern (Buyi) 
Boanari 76, Maps 12, 16, 18 
Boano 118, Map 37 
see also Bolano 
Boazi 111, 114, Maps 32, 34 
Boazi 111, 114, Maps 32, 34 
Bobangi-Ntomba Maps 79, 81-83, 86 
Bobangi 314, Maps 81-83, 86 
Bobo Maps 104-105 
Bobo (Sya) 314, Maps 104-105 
Bobot 118, Map 37 
Bodede 314, Map 88 
Bodega Miwok see Coast Miwok 
Bodi [Me'en dialect] 289 
see also Me'en 
Bodic [Western Tibeto-Burman] 175, 
Map 51 
Bodish [subgroup] 163, 175, Maps 51, 
54 


Central see Tibeto-Burman, Tibetan 
East 163-5 
West (Gurung, Tamang) 163-4 
Bodo-Garo [subgroup] 166, 175, 
Map 51 
Bodo 166, Map 51 
Meche 214 
Bodikiri 314, Map 90 
Bodiman 314, Maps 92-93 
Bodo [Bantu] 314, Maps 79, 89 
Bodo [Baric] see Boro 
Bodo [Bodic] see Bodic, Bodo 
Bodo-Garo see Bodic, Bodo-Garo 
Bodongo 314, Maps 82, 86 
Bofi 314, Map 87 
Bogadjim see Bom 
Bogaia see Pogaya 
Bogar see Bokar 
Boghom Maps 95-98 
Boghom 314, Maps 96-98 
Bogon [Burun dialect] 284 
Bogos [Tigre dialect] 287 
Bogota see Bokota 
Boguru 314, 330, Map 79 
Bohuai 109, Map 30 
Bohura 71, Map 16 
Bohutu 109, Map 31 
Boianaki 110, Map 31 
Boiken 113, Map 33 
Bokar (Bogar) 169, 175 
Bokare 314, Map 87 
Bokare-Mbakolo 314, Map 87 
Bokas (Bokkos) 314, Map 96 
Boki see Bokyi 
Bokkos see Bokas 
Bokmal see Norwegian 
Bokota (Buglere, Bogota) 64, Maps 12-13 
Bokote [Bantu] 314, Maps 81-82 
Bokpan 314, Map 87 
Bokyi (Boki) 314, Maps 93, 95 
Bola'ang Mongondow 120, Map 39 
Bola [Atlantic] see Makanya 
Bola [Meso-Melanesian] 109, Map 30 
Bola [Tibeto-Burman] 171, 175, Map 46 
Bolango 120, Map 39 
Bolano (Boano) 120, Map 39 
see also Boano 
Bole-Angas Maps 91, 96-98 
Bole Maps 97-98, 100 
Bole 314, Map 100 
Bolgo 314, Maps 89-90 
Bolia 314, Map 81 
Bolinao 119, Map 40 
Bolivian 71, Maps 16-17 
Bolivian Guarani 74 
Bolo 314, Map 84 
Boloki 314, Maps 82, 86 
Bolon 314, Map 104 
Boloven see Loven 
Bom (Bogadjim, Lalok) 112, Map 33 
Bom-Mani 314, Map 106 
Boma [Bantu] 314, Maps 81, 83 
Bombali 314, Map 106 
Bomberai 115, 118, Map 37 
West 99, 114, Map 34 
Bomboma 314, Maps 82, 86 
Bombwanja 314, Map 81 
Bompoka [Nicobarese dialect] 178, 217 
Bomudi 314, Map 88 
Bomvana 314, Map 74 
Bomwali 314, Map 87 
Bon Gula 314, Map 89 
Bonda see Remo 
Bondanga-Lingonda 314, Maps 82, 86 
Bonde 314, Map 82 
Bondei 314, Maps 76, 85 
Bondo 314, Map 84 
Bondum Dom 314, Map 104 
Bonerate 120, Map 39 
Bonerif 115, Map 34 
Bong'omok 314, Map 86 
Bonga see Malamalai 
Bongo [Austronesian] 115, Map 34 
Bongo [Gur-Adamawa] 314, Maps 82, 89 
Bongo-Bagirmi 283 
Bongo [Nilo-Saharan] 283, 284, 314, 
Maps 72, 79 
Bongo dialects 314, Map 72 
see also Sara-Bagirmi 
Bongu [Papuan: Madang] 112, Map 33 
Boni see Af-Boon; Kenyan Boni 
Bonjo 314, Map 87 
Bonkiman 112, Map 33 
Bontok 119, Map 40 
Booli 315, Maps 80-81 
as Mongo dialect 330 
Boon see af-Boon 
Bor-Muthun see Wancho 


Bora-Witotoan 69, Maps 12-17 
Boran 69 
Bora-Miranya 69, Maps 12-17 
Bora-Muinane 69, Maps 12-15 
Boraan 286-7 
Borai-Hattam 115 
Borai 115, Map 34 
Borana 315, Maps 72, 85-86 
Border 99, 112, 114, Maps 33-34 
Northern (Border-Tor-Lake Plain) 112, 
114, Maps 33-34 
Borduria see Nocte 
Bori 169 
Borjefio Maps 3-4 
Borneo Coastal Malay Map 41 
Borneo, Northeast Map 41 
Bornhomlsk (Danish) Map 64 
Boro (Bodo, Bara, Bara, Plains Kachari) 
[ Tibeto-Burman] 166, 212-13 
Boré [Omotic] see Bworo 
Borobo 315, Map 105 
Borondo see Londo 
Bororoan 72, Maps 18-20 
Bororo 72, Maps 18-20 
Boruca see Boruka 
Boruka (Boruca, Borunka) 64, 
Maps 12-13 
Borunca, Borunka see Boruka 
Bosaka 315, Maps 80-81 
Bosavi 99, 99, 110, 112, Maps 32-33 
Bosha [Omotic] see Garo 
Bosho [Bantu: Kilimanjaro] 315, 
Map 86 
Bosilewa 110, Map 31 
Bosman (Bosngun) 113, Map 33 
Bosngun see Bosman 
Bosnian 232-4, 236, Maps 61-62, 65-67 
Bosoko 315, Map 91 
Botahari Map 70 
Bote 214 
Botlikh 233-4, 239, Map 59 
Botolan 119, Map 40 
Bougainville 
North 99, 107-9, Map 29 
South 99, 107, 109, Map 29 
South-West 108, Map 29 
see also Nehan-North Bougainville 
Bougie [Arabic dialect] Map 73 
Bourgignon (French) Map 69 
Bouye 113, Map 33 
Bowiri 315, Maps 102-103 
Bowol (Bwol) 315, Map 96 
Bozo 312, 315, Maps 73, 104 
Brabants [Dutch dialect] 262 
Bragat 113, Map 33 
Braggyab 164 
Braghgo [rTahu variety] 164 
Brahui 211-12, 217, Maps 53, 54 
Brao (Love, Lave) 177-8, Map 48 
Brat 115, Map 34 
Bre (Brek, Kayaw) 174-5, Map 49 
Breri 113, Map 33 
Breton 259, 262, Maps 63, 69 
Bria 315, Map 89 
Bribri 64, Maps 12-13 
hBristod 164 
Britannic see Brythonic 
British Jamaican Creole see Jamaican 
Creole 
hBrogpa 164, 175, Map 45 
Broken see Torres Strait Pidgin English 
Brokpa [Tibetan] 215 
Brong see Abron 
Bru (Vankieu) 177-8, Map 47 
hBrugchu 164, 175, Map 45 
Brythonic 259, Maps 63, 69 
Bu-za-ngem 315, Map 95 
Bua Maps 89-90 
Bua 315, Maps 89-90 
Bualkhua 168 
Bubalia (Beraku) 315, Maps 90, 99 
Bubi-Benga Maps 88, 92 
Bubi 311, 315, Map 92 
Bubia 315, Map 92 
Buda-Mono 315, Map 87 
Budibud 110, Map 31 
Budu 315, Map 79 
Budukh 235, 239, Map 59 
Buduma 315, Maps 71, 73, 99 
Budya 315, Map 78 
Budza 315, Map 82 
Buena Vista Yokuts 8, 11, Map 4 
Buga 315, Maps 74, 77 
Bugalu see Bikaru 
Bugis 120, Maps 39, 41 
Buglere see Bokota 
Bugotu 109, Map 29 
Bugun Map 51 
Bugun (Khowa) 169 
Bugun-Khowa 168, 175 
Bugunidja 116, Map 35 
Buhagana (Barasano) 68 
Buhid 119, Map 40 
Buin 99, 109, Map 29 
Buja 315, Maps 79, 82 
Bujeba 315, Maps 88, 92 
Bujheli see Gharti 
Buji 315, Map 97 
Buka 108, Map 29 
Bukar Sadong 120, Map 41 
Bukat 120, Map 41 
Bukawa 111, Map 32 
Bukidnon Manobo, West 119, Map 40 
Bukitan 120, Map 41 
Bukiyip see Arapesh, Mountain 
Bukusu 315, Map 86 
Bulang see Angkuic 


Bulang Wa [Mon-Khmer] 176, 178, 
Map 46 
Bulba (Nootre) 315, Map 102 
Bulgarian 233-6, 259-60, Maps 58, 
60-61, 66 
Bulgebi 112, Map 33 
Buli [Austronesian] 118, Map 37 
Buli [ Ubangi] 315, Map 91 
Buli-Konni [Gur] Maps 103-104 
Buli 315, Maps 103-104 
Bulkley Valley see Babine 
Bulla see Sheko 
Bulom-Kisi Map 106 
Bulom Map 106 
Bulu [Bantu] 315, Map 92 
Bulu [Fang-Mbeti] 303 
Bulu [Meso-Melanesian] 109, Map 30 
Bum [Chin] see Bawm 
Bum [Narrow Grass] 315, Map 95 
Buma [Austronesian] 109, Map 29 
Bumaji 315, Map 95 
Bumang see Dai (S.W. Tai), Dai Sa 
Bumbita see Arapesh, Bumbita 
Bumthang 164-5, 215, Map 51 
Bun [Papuan: Yuat] 113, Map 33 
Buna [Papuan: Torricelli] 113, Map 33 
Bunaban [Australian] 100, 116, Map 35 
Bunaba 116, Map 35 
Bunabun [Trans-New Guinea] 112, 
Map 33 
Bunak 119, Map 38 
Bunama 110, Map 31 
Bunan (Gari, Gahar, Gahri) 165, Map 54 
Bunar [Mnong dialect] 177 
Bunda 315, Map 84 
Bundeli Maps 52, 54 
Bundi see Gende 
Bungain 113, Map 33 
Bungandidjan 102, 118, Map 35 
Bungandidj 102, 118, Map 35 
Bungku-Mori (Bungu-Mori) 120, Map 39 
Bungku 120, Map 39 
Bungla 166 
Bunong [Mnong dialect] 177 
Buntiwa see Atanke 
Bunu (Dongnu) [Miao-Yao] 179, 183-4, 
Map 45 
Bunuo [dialect] 183 
Bunun [Taiwan Austronesian] 182, 184, 
Map 50 
Bunyoro see Nyoro 
Buol 120, Map 39 
Bura-Higi Maps 91, 94, 98-100 
Bura Maps 91, 94, 98-100 
Bura 315, Maps 94, 98, 100 
Burak 315, Map 98 
Buraka see Gbanzili 
Burarra 116, Map 35 
Burduna 100, 116, Map 35 
Burhinyi 315, Map 80 
Buriat (Buriad) see Buryat 
Burji (Daasé) 287, 315, Maps 72, 85-86 
Burmese-Ngwi 170-1, 175, Maps 45, 
47-49, 51 
Burmish [subgroup] 175, Maps 47-49, 
51 
Burmese (Myanmar) 171, 175, 181, 
194, Map 49 
Burmeso [Papuan] 97 
Burmish see Burmese-Ngwi 
Buru-Sula-Taliabo 118 
Buru 118, Map 37 
B'urulo [Madi dialect] 284 
Burum [Huon] 110, Map 32 
Burun 284, Map 72 
Burunge 287, 315, Maps 76, 86 
Burushaski 211, 217-18, Map 53 
Burusu 120, Map 41 
Buryat (Buriat, Buryaad) 188, 190-1, 
233-4, 238-9, Maps 45, 55 
dBus [Tibetan] 164, 175, Map 45 
Central (U, Lhasa) 164 
Busa [Niger-Congo] 315, 331, 
Maps 100-102 
Busa [Papuan] 99, 113, Map 33 
Busam [Grassfields] 315, Maps 93, 95 
Busami [Cenderawasih Bay] 115, 
Map 34 
Busang 120, Map 41 
Bushi [Malagasy dialect] 216 
Bushong (Kuba) 330, Map 81 
Bushong (Kuba) 315, 320, Map 81 
Busintana see Atanke 
Bussa (Mosiya) 286 
Butam 99, 109, Map 30 
Butha [Daur subdialect] 192 
Butmas-Tur 108, Map 27 
Buton 120, Map 39 
South 120, Map 39 
Butu-Ningi 315, Map 97 
Butuanon 119, Map 40 
Buu-1 [Bantu: Pokomo] 315, Map 85 
Buu-2 [Bantu: Pokomo] 315, Map 85 
Buu-3 [Bantu: Pokomo] 315, Map 85 
Buwal 315, Map 94 
Buxia see Sangkong 
Buyang 181-2, 184, Map 45 
Buyi see Zhuang, Northern 
Buyu 315, Maps 78, 80 
Bveri 315, Maps 91, 94 
Bviri see Sere-Ngbaka-Mba 
Bviri (Birri) [Sara-Bagirmi] 315, 
Maps 79, 89 
Bwa 315, Map 79 
Bwaidoga 110, Map 31 
Bwaidoga 110, Map 31 
Bwaka Map 82 


Bwamu-Kurumfe Maps 104-105 
Bwamu 315, Maps 104-105 

Bwari 315, Map 80 

Bwe see Kayah [Karen] 

Bwe Karen see Blimaw 

Bwem see Lelemi 

Bwende 315, Map 83 

Bweru 315, Map 80 

Bwiko 315, Map 88 

Bwile 315, Map 78 

Bwisi [Nyoro-Ganda: J10] 315, Map 79 

Bwisi [Shira-Punu: B40] 315, Maps 83, 88 

Bwol see Bowol 

Bworo (Bor6, Shinasha) 288, 289, 

Map 72 

Byangs(h)i 165, 214, Maps 51, 54 

Byelorussian see Belarusian 

Byep 315, Map 87 


Ca’ 315, Map 93 
Caac 108, Map 27 
Cabécar see Kabékar 
Cabichi see Kabishi 
Cacaopera see Kakaopera 
Cacua see Kakua 
Caddoan 7, Maps 2-3, 5, 8-9 
Caddo 9, 11, Maps 2-3, 5, 8-9 
Cadegomefio Map 3 
Caduveo see Kadiwéu 
Cagayan Agta, Central 119, Map 40 
Cagayan Negrito, East Map 40 
Cahita see Kahita 
Cahita see Yaqui 
Cahto see Kato 
Cahuilla 9, 11, Maps 3, 8-9 
Caijia see Chinese (Sinitic), Caijia 
Caingangue see Kaingang 
Cairo 280-1 
Cajamarca Kechua (Kichua, Quechua) 
70, Map 15 
Cajun (Acadian French) 26 
Caka [Bantoid: Tivoid] 315, Map 95 
Caka [Bantu: Makua] 315, Map 75 
Cakchiquel see Kakchikel 
Calao see Romani, Calé 
Calabrian (Italian) 261, Map 67 
Caliana see Kaliana 
California Athabaskan 8 
Callahuaya see Qalyawaya 
Calé see Romani, Cal6é 
Calusa 7,9, 11, Map 2 
Camorta [Nicobarese dialect] 178 
Campanian (Italian) 261, Map 67 
Campeba see Omawa-Kampeva 
Camsa see Kamsa 
Canarese see Kannada 
Candoshi see Kandoshi 
Cangin Map 108 
Canichana see Kanichana 
Canoeiro see Ava 
Cantonese see Chinese (Sinitic), 
Cantonese 
Caolan 179, 184, Map 47 
Capanahua see Kapanawa 
Cape Cumberland 108, Map 27 
Cape Fear Map 2 
Cape Khoe Map 74 
Capiznon 119, Map 40 
Caqueta see Korewahe 
Car [Nicobarese dialect] 178, 217 
Cara see Chara [Central Nigerian] 
Caranqui see Kara 
Carapana see Karapana 
Care see Kare 
Carib see Kari'nya 
Cariban see Kariban 
Caribbean Creole 26, 27 
Southwestern 26, Map 13 
Carijona see Karihona 
Caripuna see Karipuna 
Cariri see Kariri 
Carolina Algonquian 7, 11, Maps 1-2 
Carrier (Dakelh) 8, 11, Maps 6-9 
Northern Carrier see Babine 
Southern Carrier Maps 8-9 
Cashibo-Cacataibo see Kashibo 
Cashinahua see Kashinawa 
Casiguran Dumagat see Dumagat 
Castilian (Spanish) 262, Map 68 
Casuarina Coast Asmat Map 34 
Catacao see Katakdoan, Katakao 
Catalan 259, 261-2, Maps 67-69 
Catanduanes Bikol, Northern 119, 
Map 40 
Catawba 9-11, Maps 1-2 
Cathlamet (Kathlamet) 8, 11, Maps 4, 6 
Catuquina see Katukina 
Caucasian 231, 238 
North 159, 231, 238 
East Caucasian (Nakh- 
Daghestanian) 238-9, Map 59 
West Caucasian (Abkhaz-Adyghe) 
238-9, Map 59 
South (Kartvelian) 231, 238, 
Maps 59-60 
Cavitefio [Chavacano dialect] 103 
Caviyari see Kaviyari 
Cayagan Negrito see Negrito, East 
Cayagan 
Cayapa see Kayapa 
Cayman Islands Creole (English) 26, 27 
Cayubaba see Kayuvava 
Cayuga 9, 11-12, Maps 1, 8-9 
Cayuse 7,9, 12, 19, Maps 4, 6 
Cayuvava see Kayuvava 
Cazcan see Kaskan 
C'ara see Chara [Omotic] 


Caxa see Chaha 
Cralga [Kemant variety] 286 
Cebuano 103, 119, Map 40 
Celtic 259, Maps 63, 69 
Cemuhi 108, Map 27 
Cenderawasih Bay 108, 114-15, Map 34 
East 99, 115, Map 34 
Central Sierra Miwok see Miwok, Eastern 
Cenufara see Sintfana 
Cerquin see Serkin 
Ch'ol see Chol 
Ch'orti' see Chorti 
Cha Cui see Ruc 
Chabao (N.E. Jiarong) 170 
Chabmdo 164 
Chachapoyas Kechua (Kichua, Quechua) 
70, Map 15 
Chachi see Kayapa 
Chacobo see Chaékobo 
Chadian [Arabic subgroup] 281, Maps 71, 
73 
Chadic colloquial Arabic 282 
Chadic 304, 307-9, 330, Maps 71, 73, 
89-91, 94-98, 100-101 
Central (Biu-Mandara) 308, Maps 71, 
89-91, 94-97, 99, 100-100 
East 308, Maps 89-90, 94 
West 308, Maps 71, 73, 91, 95-98, 
100-102 
hagga (Chaga) 312, 315, 330, 
Maps 85-86 
Central 310 
hagos French Creole (Ilois) 216 
haha (Caxa, Chiha) 287-8, Map 72 
hahar see Tsakhar 
hai [Omotic] 289 
hairel (Chakpa) 167 
hak [unclassified] 215 
hakali 315, Maps 103-104 
hakchiuma Map 2 
hakfam-Moshere 315, Map 96 
hakhar see Tsakhar 
hakhesang 168, 213, Map 51 
hakma 166, 215 
hakobo (Chacobo) 71, Maps 16-17, 
20-21 
hakosi 315, 331, Maps 102-104 
hakpa see Chairel 
hakru see Chokri 
hala 315, Maps 102-103 
halikha 165, Map 51 
Chalon see Costanoan 
Cham see Chamic 
Cham-Mona Map 98 
Cham-Mwana 315, Map 98 
Chama 71 
Chamakoko (Chamacoco) 72, 
Maps 18-21, 23 
hamalal 233-4, 239, Map 59 
hamba Daka 315, Maps 91, 95, 98 
hamba Lahuli [Pattani variety] 165 
hambri (Tchambuli) 113, Map 33 
hameali Map 54 
hamic 176-7, 182, 184, Maps 45, 48 
Cham 181-2, 184, Map 48 
hamikuro (Chamicuro) 66, Map 15 
hamir [Khamtanga dialect] 286 
hamling (Rodong) 165-6, 214, Map 51 
hamorro-Palauan 108, Map 28 
Chamorro 107-8 
hampenois (French) Map 69 
hamus 315, Maps 72, 86 
han see Laz 
hana 72, Map 22 
hang (Majung, Mojung, Manjung) 167, 
213, Map 51 
hangjan see Wancho 
hangriwa 113, Map 33 
hangsha (New Xiang) see Chinese 
(Sinitic), Central 
hannel Islands French see French 
hantel (Chantyal) 164, Map 51 
hao Thalee 182-3 
haozhou (Teochiu) [Mandarin Chinese 
dialect] 161-2, 181 
Chapakuran 65, Maps 16-17, 20-21 
Chapakura (Wachi, Guachi, Huachi) 
65, 72, Maps 18, 20 
haplinsky Yupik see Siberian Yupik, 
Central 
hara (Cara) [Central Nigerian] 315, 
Map 97 
hara [N. Omotic] 289, Map 72 
hare-Kigwe Maps 96-97 
hari Maps 71, 73 
harrtia 72, Map 22 
hasta Costa see Tututni 
hathare Limbu 166 
hatino 53, Maps 10-11 
hatong 177-8, Map 47 
hatthare Limbu Map 51 
hattisgarhi Maps 51-52, 54 
haudangsi 165, Maps 51, 54 
haura see Chowra 
haurasia (Umbale, Ombale, Wambule) 
165, 214, Map 51 
havacano [Creole Spanish] 103, 120, 
Map 40 
havikwa 315, Map 84 
hawai (Atsam) 315, Maps 96-97 
hawasha Map 2 
hawte (Chote, Chaw, Kyaw) 168, 175, 
Map 51 
hayahuita see Chayawita 
hayawita (Chayahuita) 69, Maps 14-15 
Chayma see Kumana 
Che see Rukuba 
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Language index 


Che’ Wong see Che'wong 
Che'wong (Che' Wong) 121, 178, Map 44 
Chechen 233-6, 239, Map 59 
Chedepo 315, Map 106 
Chehalis 8 
Lower 17, Map 6 
Upper 24, Maps 4, 6 
Chehe see Tak Bay 
Chelkan 234, 238 
Chemakum 8, 12, Map 6 
Chembara 315, Maps 104-105, 107 
Chemehuevi see Ute-Chemehuevi 
Chenapian (Tsenap, Zenap) 113, Map 33 
Cheng see Oi 
Chepang (Cho'bang) 163, 165, 214, 
Map 51 
Chepya 173-5, Map 47 
Cherchel [Arabic dialect] Map 73 
Cheremis see Mari (Cheremis) 
Cherepong 315, Maps 102-103 
Cherkessian see Kabardian 
Cherokee 9, 12, Maps 1-2, 8-9 
Chesu 172, 175, Map 46 
Chewa-Nyanja 312, 329, Maps 75-76 
Chewa 315, 329, Maps 75-77 
Chichewa 302, 310 
Malawian see Manganja 
see also Nyanja 
Chewele 315, Map 85 
Cheyenne 7, 12, Maps 5, 8-9 
Chhantyal 214 
Chhiling 214 
Chhingtang 214 
Chi Kunda see Chikunda 
Chiapaneko (Chiapaneco) 45, 50, 53, 
Map 10 
Chiapas 50 
Chiapas Soke 51 
Chibcha-Misumalpan 64-5, Maps 12-13 
Chibchan 61-2, 64, Maps 12-13 
Chibcha (Muiska) 64, Map 12 
Chicahuaxtla Trike 53 
Chichewa see Chewa 
Chichimeko (Chichimeca) 53, Map 11 
Chickasaw 9, 12, Maps 2, 8-9 
Chicomucelteco see Kabil 
Chiguimuliya 54 
Chikaon (Txikao) 76, Map 19 
Chikitano-Bororoan 72, Maps 16, 18-21 
Chikitano (Chiquit(an)o) 72, Maps 16, 
19-21 
Chikunda (Chi Kunda, Kunda) 301, 315, 
329, Maps 74-75, 77 
Chilanga see Poté 
Chilcotin (Tsilhgot'in) 8, 12, Maps 6-9 
Chilean Kechua (Kichua, Quechua) 70 
Chilis Map 53 
Chiltepec Chinanteko 53 
Chimakuan 7-8, Map 6 
Chimane (Tsimané) 71, Map 20 
Chimariko 7-8, 12,51, Map 4 
Chimba (Zemba) [Hoanib: R30] 315 
Chimba-Herero 315, Maps 74, 84 
Chimbu-Wahgi 99, 99,110, 112, 
Maps 32-33 
Chimbu (Kuman) 102, 110, 112, 
Maps 32-33 
Chimila 64, Maps 12-13 
Chimtian 69, Maps 14-15, 20 
Chin 168, 175, Map 51 
Central 175, Map 49 
North 168, 175, Maps 49, 51 
South 168, 175, Map 49 
Chinanteko 53 
Chinanteko (Chinanteco) 49-50, 53, 
Maps 10-11 
Chinbok see Kcho 
Chinbon see Uptu 
Chinese Indonesian (Peranakan, Baba 
Indonesian) 104, Map 25 
Chinese Malay see Baba Malay 
Chinese (Sinitic) 1, 159, 262 
beyond China 261 
in America 26-7, 161 
in Asia 103, 104, 181, 194-5, 214, 
216 
in Australia Map 36 
in Eurasia 234, 263 
influence 184, 186, 189, 191 
Cantonese see Southern, Yue (below) 
Central 161 
Gan (Anhui) 159-61, Map 45 
Gan 160-1 
Hui 160-1 
Wu 160-1, Map 45 
Chuqu 160-1, Map 45 
Oujiang 160-1, Map 45 
Taihu 160-1, Map 45 
Taizhou 160-1, Map 45 
Wuzhou 160-1, Map 45 
Xuanzhou 160-1 
Xiang (Hunan) 160-1, Map 45 
New Xiang (Changsha) 160-1 
Old Xiang 160-1 
Northern 161 
Eastern (Xiangjiang, Jianghuai) 160, 
Map 45 
Mandarin (Northern) 103, 104, 161, 
Maps 45, 50 
Taiwan (Guoyu) 159, Map 50 
North-western (Jin) 160 
South-western (Han) Map 46 
Old South-western 160-1, 183, Map 45 
Caijia 159, 161, Map 45 
Pinghua 159, 161 
Waxiang (Xianghua) 159, 161, 
Map 45 


Southern 161 
Hakka (Kejia) 103, 104, 161-3, 183, 
Maps 45, 50 
Hokkien (Min) 103, 104, 160-3, 
Map 50 
Central (Minzhong) 161 
Eastern (Mindong, Fuzhou, 
Puxian) 161 
Hainanese (Qiongwen) 103, 161, 
163 
Northern (Minbei) 161 
Southern (Minnan) 161, Map 50 
Teochiu (Teochew) 103, 104, 181 
Yue (Cantonese) 27, 103, 104, 
160-1, 183, 262, Map 45 
as lingua franca 161-2 
see also Dunganese 
hingmengnu see Phom 
hinook Jargon 26 
hinookan 7-8, 10, Maps 6, 8-9 
Chinook see Kiksht 
Lower Chinook 17, Maps 4, 6 
hintang 165-6, Map 51 
hintao (Chintau, Xiandao) 171, 175, 
Map 46 
hip 315, Map 96 
hipaya 70, Maps 20-21 
hipewyan (Dene Soun'line) 8, 12, 
Maps 7-9 
hippewa see Ojibwe, Central Southern; 
see also Ojibwe, Eastern 
hiquihuitlan Masateko 53 
hiquito, Chiquitano see Chikitano 
hiricahua 8, Maps 3, 9 
hiriguano 62,74, Maps 19, 21, 23 
hiripa-Nyandeva 74, Maps 19, 23 
hiru 168 
hisak [Achik subdialect] 166 
hitimacha 7,9, 12, Map 2 
hitkhuli Kanauri 165 
hitrali 211 
hittagonian 215, Map 51 
hitwania Tharu Map 51 
hiwere 9, 12, Maps 2, 5, 8-9 
ho'bang see Chepang 
hocangacakha 215 
hochenyo see Costanoan 
Chochoan 53 
Chocho 50, 53, Maps 10-11 
hoctaw 9, 12,27, Maps 2, 8-9 
hoinumne (Choynimni) [Yokuts 
dialect] 25 
Choiseul 107-9, Map 29 
Central-East 109, Map 29 
East 106 
North-West 108, Map 29 
Chok6é 63, Maps 12-13 
Chokobo 315, Map 97 
Chokri (Chakru) 167-8, 213 
Chokwe-Luchazi Maps 74, 77, 81, 84 
Chokwe 315, Maps 77, 81, 84 
Cholan 50 
Chol 52, Maps 10-11 
Cholti 50,52, Map 10 
Cholén 69, Map 14 
Chomo (Chomo-Karim) 315, 
Maps 95-96, 98 
Chon 71 
Cc 
C 
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Cc 
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Cc 
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hong 177-8, Map 48 

hontal (Tequistlateco) [Hokan] 51-2, 
Maps 10-11 

Lowland (Huamelulteco) 52 

hontal (Yokot'an, Yokotan) [Mayan] 52, 
Maps 10-11 

honyi 315, Map 85 

hopi [Bantu subgroup] Map 75 

Chopi 315, Map 75 

hopi [ Nilotic] see Dhopaluo 

horote 72, Maps 20-23 

horotega 50, Map 10 

Chorotega-Mange 61 

horti (Apay) 52, Maps 10-11 

hoskule Rai 166 

hote see Chawte 

howra (Chaura) [Nicobarese dialect] 
178, 217 

hoynimni see Choinumne 

hoyo (Queyu) 170, 175, Map 45 

hrau 177-8, 184, Map 48 

hru 182, Map 48 

hu-ung 178, Map 48 

huane Map 33 

huave 110, 112, Map 32 

hujean 52, Maps 10-11 

Chuj 52, Maps 10-11 

huka 315, Maps 72, 85-86 

hukchansi [Yokuts dialect] 25 

Chukotko-Kamchatkan (Chukchi- 
Kamchatkan) 9, 231, 239, Map 55 

Chuckchi-Koryak Map 55 
Chuckchi (Chukchee) 234, 239, 

Map 55 

hukudum 284 

hukwa 165-6, Map 51 

hulikata see Idu 

hulung 165-6, Map 51 

hulupi (Churupi, Nivaclé) 72, Maps 19, 
21 

hulym Turkic see Tatar, Chulym 

humashan 9, 12-13, Maps 3-4 

Northern Chumash 9 

Southern Chumash 9 

humbia Map 10 

humbuli 315, Map 102 

humburung (Yeji) 315, Maps 102-103 

hungli [Ao dialect] 167 

huqu see Chinese (Sinitic), Central, Wu 
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Churahi Map 54 
Churauti 214 
Churupi see Chulupi 
Churuya 65, Maps 12-13 
Chushur 164 
Chutiya see Deori 
Chuvash 234-7, Map 58 
Chwabo 312, 315,329, Map 75 
Chwaka 315, Map 85 
Ci-Banda 315, Map 83 
Ci-Mongo 315, Map 83 
Cibak see Kyibaku 
Ciga see Kiga 
Ciinaluul 315, Map 78 
Ciise see af-Ciise 
Ciley 315, Map 78 
Ciluba see Luba, Ciluba 
Cinda [Basa-Kamuku subgroup 3] 
Maps 100-101 
Cinda 315, Maps 100-101 
Circassian 233, 238, Map 59 
East see Kabardian 
West see Adyghe 
Citak Asmat Map 34 
Clallam (Klallam) 8, 13, Maps 6, 8-9 
Clatskanie see Kwalhioqua-Clatskanie 
Coahuilteco (Pajalate) 7,9, 13,51, 
Maps 2-3 
Coaiquer see Koaiker 
Coast Miwok see Miwok, Western 
Coast Tsimshian see Tsimshianic, 
Maritime Tsimshian 
Coast Yuki 25 
Coastal Malay Map 41 
Coca see Koka 
Cocama-Cocamilla see Kokama- 
Kokamilya 
Coche see Kamsa 
Cochimi-Yuman 7-8, Maps 3-5, 8-9 
Cochimi 8, 10, 13, Maps 3-4 
Cochiti see Rio Grande Keresan 
Coconuco see Kokonuko 
Cocopa 8, 13, Maps 3-4, 8-9 
Cocos Malay 216, Map 52 
Coeruna see Koeruna 
Coeur d’Alene 8, 13, Maps 5-9 
Cofan see Kofan 
Colan see Kolan 
Colapissa 19, Map 2 
Colo [Lenkan] see Kolo 
Colo (Shilluk) [Nilotic] 284,289, Map 72 
Colorado see Tsafiki 
Columbian 8, 13, Maps 6-9 
Colville see Okanagan 
Comanche 8, 13, Maps 2-3, 5, 8-9 
Comecrudan 7, 13, Map 3 
Comecrudo (Mamulique) 9, 13, 
Maps 2-3 
see also Yeméan 
Comorian 216, 311, 315, 329-30, Map 52 
Comox 8, 13, Maps 6, 8-9 
Cona Menba see Monpa, Western 
Concho Map 3 
Condul [Nicobarese dialect] 178 
Cone [Bodish] see Monpa, Western 
Cong see Phunoi 
Congaree Map 2 
Congo Map 78 
Conoy (Piscataway) 19 
Conquaco 49 
Constantine [Arabic] Map 73 
dialects Map 73 
Cook Islands see Maori 
Coorg(i) see Kodagu 
Coosan 7-8, Map 4 
Copala Trike 53 
Copper Island Aleut 7, 13 
Coptic 277, 308 
Coqui see Kogi 
Coquille see Tututni 
Cora Maps 8-9 
Corachol Maps 8-9 
Cordillera [N. America] 
Central 8 
Southeastern 8, Maps 6-7 
Cordilleran [Philippine subgroup] 119, 
Map 40 
Central 119, Map 40 
Northern 119, Map 40 
Southern 119, Map 40 
Core—Satellite [Nilo-Saharan] 306-7 
Coree Map 2 
Coreguaje see Korewahe 
Cornish 263 
Corsican 261-2, Map 67 
Costanoan 8, 10, 13, Map 4 
Cosung see Lahlu 
Cotabato Manobo 119, Map 40 
Coto see Koto 
Cotoname 7, 9, 13, Maps 2-3 
Cotoque see Kotoke 
Coushatta see Koasati 
Cowlitz 8, 13, Maps 4, 6 
Coyaima see Koyaima 
Coy6n see Gayon 
Cree-Montagnais 7, 13, 25, Maps 7-9 
East Cree 7, 14, Maps 1, 7-9 
see also Montagnais; Plains Cree; 
Swampy Cree; Woods Cree 
Creek (Muskogee) 9, 13,27, Maps 2, 8-9 
creoles 26-7, 103-4, 216, 263, 302, 309 
Crimean Tatar 234, 236, 238, 260, Map 61 
Crioulo [Portuguese creole] 302, 309, 
311, 315, 332, Map 108-68 
see also Kabuverdianu 
Croatian 232-4, 236, 259, Maps 61-62, 
65-67 


beyond Croatia 232, 260-1, 263 
in Australia Map 36 
Croisilles 99,112, Map 33 
Cross 305-6, 316, 322, 331, Maps 92-93, 
95 
Cross-Bendi Maps 93, 95 
Lower Maps 92-93 
Upper Maps 93, 95 
Crow 9, 13, Maps 5, 8-9 
Cruzefio see Island Chumash 
Cua 177-8, Map 48 
Cubeo see Kubewa 
Cueretti see Kuerettt 
Cuiba see Kuiva 
Cuicateco see Kwikateko 
Culina see Kulina 
Cuna see Kuna 
Cundimamarca 64 
Cunhua Li 179 
Cunza see Kunsa-Kapishana, Kunsa 
Cuoi see Ruc 
Cupan 9, Maps 3-4, 8-9 
Cupefio 9, 13-14, Maps 3-4 
Curuaia see Kuruaya 
Cusabo Map 2 
Cushitic 285-6, 304, 307, 309, 330 
Central (Agaw) 285-6, Map 72 
East 285-6, Maps 72, 85-86 
South 285, 287, Maps 72, 76, 85-86 
Cuvak-Mafa (D) Map 94 
Cuvak (Cuvok) 315, Map 94 
Cuveo see Kubewa 
Cuyama Map 3 
Cuzco Quechua see Quechua 
Cwa 315, Map 80 
Cypriot Arabic (Kormakiti) Map 70 
Czech 232-4, 236, 260, Maps 62, 65 


Daarood [dialect group] 287 
Dadsana¢ see Dasenech 
Daasé see Burji 
Daba Map 94 
North Map 94 
South Map 94 
Daba 315, Map 94 
Dabida 315, Map 85 
Dabra (Taworta?) 115, Map 34 
Dabu see Agob 
Daco-Romance Map 61 
Daco-Rumanian see Romanian 
Dadibi (Daribi, Mikaru) 111, Map 32 
Dadiya Map 98 
Dadiya see Looliya 
Dafla 169, 213 
Dafo-Batura 315, Map 96 
Daga (Dimuga) Map 31 
Daga-Nyonga (Samba Leeko) [Leko] 315, 
Maps 93, 95 
see also Leko 
Dagaari (Dagaare) 311, 315, 
Maps 103-105 
Dagalag 101, 117, Map 35 
Dagan 99,110, Map 31 
Daga (Dimuga) 99, 110, Map 31 
Dagana 315, Map 99 
Dagaura see Rana Tharu 
Dagbani 315, Maps 102-104 
Daghestanian 239 
Dagoman 116, Map 35 
Dagur see Daur [Mongol] 
Dahalo 287, 315, Maps 72, 85 
Dahating 112, Map 33 
Dahejla (Darhejia) [Mongolian dialect] 
191, Map 45 
Dahomey Ful 315, Map 102 
Dahula 166 
Dahur see Daur [Mongol] 
Dai [Gur-Adamawa] 315, Maps 89-90 
Dai (South-Western Thai, Shan) 179-80, 
Map 46 
Dai Lue (Ly, Xishuangbanna Dai) 179, 
184, Map 46 
Dai Mao (Dehong Dai, Shan Tayok) 
179-80, 184, Maps 46, 49 
Dai Neua 179-80, 184, Maps 46-47 
Dai Sa (Dai Duan, Dai Luomi, 
Bumang) 179-80, 184, Map 46 
Dai [Timor] 119, Map 38 
Daingna, Daingnet see Tongcengnya 
Dair see Dauri 
Daju 283,307, Maps 71-72, 79, 89-90 
Daju 283-5, 315, Maps 89-90 
East Daju 284, 307, 316, Map 79 
West Daju 284 
Dakaka 108, Map 27 
Dakelh see Carrier 
Dakhla (Dakhla) 281, Map 71 
Dakkang 177-8, Map 47 
Dakoid 305, 306, 331, Maps 91, 95, 98 
Dakotan 9, 14, Maps 1, 5-9 
Dakota see Sioux 
Dakpa 315, Map 89 
Dakpakha 165, 215, Map 51 
Dakunza [Gumuz dialect] 285 
Dalabon 116, Map 35 
Dalmatian 260 
Daloa 315, Map 105 
Daly [subgroups] 116, Map 35 
Eastern 100, 116, Map 35 
Northern 100, 116, Map 35 
Southern 100, 116, Map 35 
Western 100, 116, Map 35 
Dama 315, Map 91 
Damar 119, Map 38 
Dambiya [Kemant variety] 286 
Dameli 211, Map 53 
Damen 308 


Dampelas(a) 120, Map 39 
Dan (Gio) [Mande] 311, 315, 
Maps 105-106 
Dan Muure 315, Maps 91, 95 
Danakil see Afar 
Danao 119, Map 40 
Danaru 112, Map 33 
Danau [nr Inle Lake, Burma] 176 
Danaw 178, Map 49 
Dangal 111, Map 32 
Dangaura Tharu Maps 51, 54 
Dangla-Migama Maps 89-90 
Dangla 315, Map 90 
Dangme see Adangme 
Dani Pidgin, Dani Police Talk 105, 
Map 25 
Dani-Kwerba 99 
Dani 99, 114, Map 34 
Grand Valley Dani 114, Map 34 
Western Dani 114, Map 34 
Danish 233, 259, 261, 263-4, Maps 62, 
64-65 
East see Scanian 
Insular Map 64 
in North America Maps 8-9 
Danjong 163-4, Map 51 
Danu 171, 175, Map 49 
Danuwar (Dhanwar) 214, Map 51 
Daofu see Ergong 
Daonda 112, Map 33 
Darai 214, Map 51 
Darang Deng see Taraon ‘Mishmi’ 
Darasa see Gedeo 
Dardic 216, Maps 53-54 
Dargi Map 59 
Dargi (Dargwa) 234-5, 239, Map 59 
Darhan [Ulanqab subdialect] 191 
Darhejia see Dahejla 
Dari [S.W. Iranian] 192,211,216, Map 53 
Dari [Persian variety] 237 
Daribalang 102, 117, Map 35 
see also Goeng-Goeng 
Daribi see Dadibi 
Dariganga 188 
Darkhad 188, Map 45 
Darkinyung 102, 118, Map 35 
Darlong 166 
Darmiya 165, Maps 51, 54 
Darood-Digil Map 72 
Daru [Tibeto-Burman] see Dvru 
Daru Kiwai [S. Kiwai dialect] 105 
Das Map 98 
Das (Dass) 315, Maps 96-98 
Dasenech (Dadsana¢, Galab) 287, 315, 
Maps 72, 86 
Dathina 280, Map 70 
Datoga 307, 315, Map 86 
Daur (Dahur, Dagur) [Mongol] 187, 
191-2, Map 45 
Dauri (Dair) 284, Map 71 
Davawejio [Chavacano dialect] 104 
Davaweiio (Davaoefio) [mixed language] 
104, 119, Maps 25, 40 
Daw see Kaman 
Dawari see Ambasi 
Dawasir Map 70 
Dawawa 110, Map 31 
Dawera-Daweloor 119, Map 38 
Dawro 289 
Day Maps 89-90 
Dai Maps 89-90 
Dayak see Land Dayak 
Dayr al-Ziar (Qoltu) Map 70 
Daza [Saharan] 301, 331 
Deiriku see Gciriku 
De'ang see Palaung-Wa, Palaungic 
De (Dewoin) 315, Map 106 
Debaba 315, Map 90 
Debbarma 166 
Debri see Kadaru-Debri 
bDechen 164 
Dedua 110, Map 32 
Defaka-Nkoro Map 101 
Defaka-Nkoro 315, Map 101 
Deg (Degha, Mo) 315, Maps 103, 105 
Deg Hit'an see Ingalik 
Deg Xinag see Ingalik 
sDege 164 
Degema 315, Map 101 
Degenan 112, Map 33 
Degha see Deg 
Degodia, Degodiya see af-Degodiya 
Dehong Dai see Dai, Dai Mao 
Dekoka [Gumuz dialect] 285 
Delaware (Lenape) 7, 17-19, 23-6, 
Maps 8-9 
Canadian Delaware see Munsee 
see also Unami 
Delaware Pidgin (or Jargon) 26 
Delo (Ntrubo) 315, Maps 102-103 
Delta Arabic 
Central 280, Map 71 
Eastern 280-1, Map 71 
Western 281, Map 71 
Delta Cross 331 
Delta Igboid Map 101 
Delta Igbo 315, Map 101 
Dem 99, 99,114, Map 34 
Demta 115, Map 34 
Dena'ina see Tanaina 
Dendi [Gur-Adamawa: Ngbandi] 315, 
Maps 82, 89 
Dendi [Nilo-Saharan: Songhay] 315, 331, 
Map 102 
Dene 8, 14, Maps 7-9 
Dene Soun'line see Chipewyan 
Dengese see Ndengese 
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Dengfap 315, Map 93 
Dengnak see Tongcengnya 
Deni 67, Map 17 
Denje (Na) 316, Maps 89-90 
Denkwop 316, Map 93 
Denya 316, Maps 93, 95 
Deori (Chutiya, Deuri) 166, 213, Map 51 
Dera (?) [Chadic: Bole] 316, Map 98 
Dera (Kanakuru) [Chadic: Bole] 316, 
Maps 91, 98 
Dera (Dra, Kamberatoro) [Papuan] 112, 
115, Maps 33-34 
Derasa see Gedeo 
sDerong 164 
Desana-Siriana 68, Maps 12-13, 16 
Dese 284, Maps 71-72 
Desert, Western [Australia] 100, 103 
Deuri see Deori 
Dewiya [Gumuz dialect] 285 
Dghwe'de 316, Map 94 
haangu 100, 116, Map 35 
hafir 279, Map 70 
haiso 330 
hammai (Miji) 169, Map 51 
hangar Kurux (Jhangar) 214, Map 51 
hanggagarli 102, 118, Map 35 
hanwar see Danuwar 
harambal (Dharumbal) 102, 117, 
Map 35 
harawal 102, 118, Map 35 
haruk 102, 118, Map 35 
harumbal see Dharambal 
hay'yi 100, 116, Map 35 
he Lwo see Luo 
hegiha 9, Maps 2-3, 5, 8-9 
himalish 168-9, Map 51 
Dhimal 165, 169, 214, Map 51 
hirrari 101, 117, Map 35 
hofar 287 
hopaluo (Chopi) 316, Maps 72, 79, 86 
hudhuroa 102, 118, Map 35 
hundari Map 54 
hurga 102, 118, Map 35 
huwal 100, 116, Map 35 
see also Gubabuyngu 
huwal-Dhuwala 103, Map 36 
huwala 100, 116, Map 35 
huwaya 100, 116, Map 35 
Dia 316, Map 81 
Dibabawon 119, Map 40 
Dibo 316, Map 101 
Dibumum 316, Map 93 
Dida Map 105 
Dida 316, Map 105 
Didessa 289 
Didinga 284, 307, 316, Map 86 
Dido see Tsez 
Didra [Todrah dialect] 177 
Dieguefio-Cocopa 8, 14, Maps 3-4, 8-9 
Dieguefio (Kumeyaay) 8, 14, 
Maps 3-4, 8-9 
Digarish Mishmi 168-9, 175, Maps 45, 51 
Digil [dialect group] 287 
Digo 316, 330, Maps 76, 85 
Digul, Upper 114, Map 34 
Dihbri [Mnong dialect] 177 
Dii 316, Maps 90-91 
Dilling (Warki-m-be) 284, Map 71 
Dilpali 166 
Dimali 166 
Dimasa (Hills Kachari) 166, 213, Map 51 
Dimba 316, Map 84 
Dimbambang 316, Maps 92-93 
Dimbong 316, Map 93 
Dime [Cushitic] 288, 289, Map 72 
Dime [ Trans-New Guinea] see Ginuman 
Dimir-Malas 112, Map 33 
Dimir 112, Map 33 
Dimuga see Daga 
Ding 316, Map 81 
Dingri 164 
Dinik Maps 71-72 
Dinka see jieng 
Diodio 110, Map 31 
Diola-Papel Map 108 
Diola 306, 308, 311, 332, Map 108 
Diola-Fogny 312, 316, Map 108 
Dirak 108, Map 27 
Dirasta see Gidole 
Dirayta see Gidole 
Diriku Map 84 
Dirim 316, Maps 91, 95 
Dirya 316, Maps 97-98 
Disa 316, Map 89 
Disoha [Gumuz dialect] 285 
Ditidaht see Nitinaht 
Divehi see Maldivian 
Divenie 316, Map 83 
Diyari 101,117, Map 35 
Dizoid 288,289, Map 72 
Dizi (Dizi nuu, Maji) 288, 289, Map 72 
Djamindjungan 116 
Djamindjung 116, Map 35 
Djang [Kombio] see Aruek 
Djapa 68 
Djawi 116, Map 35 
Djeebbana 116, Map 35 
Djendewal 102, 118, Map 35 
Djerba see Jarba 
Djinang 100, 116, Map 35 
Djinba 100, 116, Map 35 
Djingili-Wambayan 108, 115-16, Map 35 
Djingili 116, Map 35 
Djongkang 120, Map 41 
Djula [Mande] 311 
Dobase 287 
Dobel 115, Map 34 
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Dobokubi see Bari 
Dobu-Duau 110, Map 31 
Dobu 105, 110, Map 31 
Doda Siraji Map 54 
Dodos see Dodoth 
Dodoth (Dodos) 316, Maps 72, 86 
Doe 316, Maps 76, 85 
Doemak 316, Map 96 
Doga 110, Map 31 
Dogo Dum 316, Map 104 
Dogon 305-6, 332, Maps 73, 104 
Dogon 312-14, 316, Maps 73, 104 
Dogon-Congo 304 
Dogoro 110, Map 31 
Dogose (Dogosye) 316, Maps 104-105 
Dogri 212-13, Map 54 
Dogrib 8, 14, Maps 7-9 
Dohoi 120, Map 41 
Doingnak see Tongcengnya 
Doka 316, Maps 96-97 
Doko [Cross] 316, Map 93 
Doko [Ometo] 289, Map 72 
Doko-Yumba [Bantu] 316, Map 82 
Dolakha Newari 165 
Dolbohaante see af-Dolbohaante 
Dolgan 234, 238 
Dolo [Basketo dialect] 289 
Dolpo 164, Map 51 
Domaki (Dumaki) Map 53 
Domari 211 
Dombondola 316, Map 84 
Dominica Creole [French] 27 
Dompago see Likpa 
Domu 110, Map 31 
Domung 112, Map 33 
Dondo [Bantu] 316, Map 83 
Dondo [Sulawesi] 120, Map 39 
Dong, Dong see Kam-Sui (Dong), Kam 
(Dong) 
Dong [Bantoid] 316, 331, Map 98 
Dongmeng 183-4 
Dongnu see Bunu 
Dongo [Central Sudanic: Kresh dialect] 
284 
‘Dongo [Ubangi] 313, Map 79 
Dongolawi see Kenuz-Dongola 
Dongotono [Lotuko dialect] 284 
Dongxiang (Sharaigol Mongol, Santa) 
191, Map 45 
Doni (Jirru) 284, Maps 71-72 
Donno So 316, Map 104 
Dooyaayo (Dooyaangyo) 316, Maps 91, 
98 
Doraske Changena 64, Map 12 
Dori'o 109, Map 29 
Doriri 110-11, Maps 31-32 
Doromu 110, Maps 31-32 
Dorro 111, Map 32 
Dors(h)a see Sheko 
Dorungkecha Rai 166 
Dérvéd 188 
D6rvén Hiitihed (Siziwang) [Ulangab 
subdialect] 191 
Dorze 288, Map 72 
Dotanga 316, Map 93 
Douirat Map 73 
Doura 111, Map 32 
Downriver [Nez Perce dialect] 19 
Dra see Dera 
Dravidian 163, 213,217, Maps 51-54 
Central 217 
North 217 
Southern 217 
Drehu 108, Map 27 
Driafleisuma see Mehek 
Dru, Druna see Ngiti 
Drukpa see Dzongkha 
Dschang see Yemba 
Dual [Achik subdialect] 166 
Duala Maps 92-93 
Duala 311, 316, Map 92 
as lingua franca 331 
Duan (Halang Doan) 177-8, Map 48 
Duano' 121, 183, Map 44 
Duau 110, Map 31 
Dubea 108, Map 27 
Dubu 114, Map 34 
Dubulu 316, Map 83 
Duduela 112, Map 33 
Dugbo 316, Map 105 
Dugukan see Wor 
Duguri 316, Maps 96, 98 
North-East 324, Map 97 
South-West 327, Map 97 
Duguza 316, Maps 96-97 
'‘Dugwor 313, Map 94 
Duit 64 
Duka 306, Maps 100, 102 
Duka 316, Maps 100, 102 
Duke of York 109, Map 30 
Dukuna [Gumuz dialect] 285 
Dulbu 310, 316, Maps 97-98 
Duleri Dom 316, Map 104 
Duli Map 91 
Duli 316, Map 91 
Dullay (Werize) 286-7, Map 72 
Dullay Map 72 
Dulong (Dulung, Trung) 169, 175, 
Maps 45-46 
Duma 316, Maps 83, 88 
Dumagat 119, Map 40 
Casiguran Dumagat 119, Map 40 
Umiray Dumaget 119, Map 40 
Dumaget see Dumagat 
Dumaki see Domaki 
Dumbule 316, Maps 92-93 
Dumi 165-6, 214, Map 51 


Dumpas 121, Map 41 
Dumpu (Watiwa) 112, Map 33 
Dumun (Bai) 112, Map 33 
Dumut 114 
Duna-Pogaia 99, 112, Map 33 
Duna 99, 112, Map 33 
Dunganese (Hwei, Hui) 234-6, 239, 
Map 57 
Dungi 316, Map 97 
Dungmali 165-6, 214, Map 51 
Dupaninan Agta see Negrito, East 
Cayagan 
Dura 164,214, Map 51 
Durango Nahuatl see Nahuatl 
Duranmin 113, Map 33 
Durbeye 316, Maps 91, 94 
Duriankere 114, Map 34 
Duro see Turo 
Duru Maps 90-91, 95, 98 
Duru-Dii Map 91 
Durubulic 102, 118, Map 35 
Duruma 316, Map 85 
Dusun Deyah [Barito] 120, Map 41 
Dusunic [N.E. Borneo] 121, Map 41 
Dusun Map 41 
Dutch see Netherlandic 
Duungidjawu [Wakka-Wakka dialect] 
102 
Duupa 316, Map 91 
Duwai 316, Map 100 
Duwet 111, Map 32 
Duwiret Map 71 
Dvngsar (Tangsarr) 169, 175, Map 49 
Dvru (Daru) 169, 175, Map 49 
Dwentza Ful 316, Map 104 
Dyaberdyaber 116, Map 35 
Dyabugay 101, 117 
Dyak see Dayak; Land Dayak 
Dyakanka 316, Map 106 
Dyan-Lobi Maps 103-105 
Dyan 316, Maps 103-105 
Dyangadi 102, 118, Map 35 
Dyapa (Southern Katukina) 68, 
Maps 14-17 
Dye 316, Map 102 
Dyimini 316, Map 105 
Dyirbal 101, 117, Map 35 
Dyirringany 102,118, Map 35 
Dyuezun 116, Map 35 
Dyula [Manding] 302, 302, 316, 
Maps 104-105 
Dzalakha (Cona Menba) 165, 215, 
Map 51 
Dzhek see Kryz 
Dzihana see Jibana 
Dzodinka 316, Map 95 
Dzongkha (Dzonkha, Drukpa) 163-4, 
214-15, Map 51 
Dzunza 316, Map 85 


E 109, Map 30 
East Cape Yupik (Naukan Yupik) 7, 14, 
Map 7 
East Cree see Cree-Montagnais 
Easter Island 107, Map 26 
Eastern Canadian Inuktitut see Inuktitut 
Eastern Eskimo see Inuit 
Eastern Trans-Fly see Trans-Fly 
Ebira 316, Map 101 
Ebkuo see Kuo [Grassfields] 
Eboi 316, Map 83 
Ebonics (African American Vernacular) 
27 
Ebrie 316, Map 105 
Echie 316, Map 101 
Ede see Rhade 
Ederah 114, Map 34 
Ediamat 316, Map 108 
Edoid 306, 331, Map 101 
Central Map 101 
Edo (Bini) 312, 316, Map 101 
Delta Map 101 
Northwest Map 101 
South Map 101 
Eel River Athabaskan 8, 14, Map 4 
Efate, South 108, Map 27 
Efe 316, Map 79 
Efik 316, 331, Maps 92-93, 101 
Efik-Ibibio 312 
Efium 316, Maps 95, 101 
Efutop 316, Maps 93, 95 
Efutu 316, Map 103 
Ega Map 105 
Ega 316, Map 105 
Egan 316, Map 101 
Egbema 316, Map 101 
Egene see Engeni 
Eggon 316, Maps 95-96 
Egyptian 277, 308 
Egyptian Arabic 279-80 
Ehabo 316, Map 93 
Ehanda 316, Map 84 
Ehow 316, Map 93 
Eipo-Mek 114, Map 34 
Eitiep 113, Map 33 
Eivo 109, Map 29 
Ejagham 316, Maps 93, 95 
East 316, Map 93 
Ejine (Ejin) 191, Map 45 
Ekagi-Wodani-Moni 114 
Ekagi 114, Map 34 
Ekajuk 316, Map 95 
Eki 316, Maps 87, 91-93 
Ekit 316, Maps 93, 101 
Eko [Yoruboid] 316, Maps 101-102 
Ekoid-Nyang 306, Maps 93, 95 
Ekoid Maps 93, 95 


Ekoid-Mbe Map 95 
Ekombe 316, Maps 92-93 
Ekota 316, Maps 79-81 
Ekpanu (Logba) 316, Maps 102-103 
Ekpeye 316, Map 101 
El Nayar Cora see Cora 
Elem [Ogoni Cross] 316 
Elem-Ogoi (Eleme-Ogoi) 316, 325, 
Map 101 
Eleman 99, 99,111, Map 32 
Eastern 111, Map 32 
Western 111 
Eleme-Ogoi see Elem Ogoi 
Elepi (Sampi) 113, Map 33 
Elkei 113, Map 33 
Elmolo 287, 316, Maps 72, 86 
Eloyi 316, Maps 95-96 
Elung 316, Map 93 
Emae 107-8, Maps 26-27 
Eman 316, Map 95 
Embera 63, Maps 12-13 
Embu 316, Maps 72, 85-86 
Emerillon 74 
Emerum 111, Map 33 
Emilian (Italian) 261, Map 67 
Emira-Mussau 109, Map 30 
Emmi 116, Map 35 
Emok see Maskoi 
Empeo see Zeme 
Emumu 112, 114, Maps 33-34 
En Wa 176 
Enar 288 
Ende-Lio 119, Map 38 
Enekan see Woj 
Enenga 316, Map 88 
Enets (Yenisey Samoyed) 233-4, 238, 
Maps 55-56 
Enga [N.W. Bantu] 316, Maps 82, 87 
Engan [Trans-New Guinea] 99, 99, 110, 
112, Maps 32-33 
Enga 99, 99,110, 112, Maps 32-33 
Enga 102, 110, 112, Maps 32-33 
Engeni (Egene) 316, Map 101 
Enger (Engle) 192 
Enggano 109, 121, Map 43 
Engle see Enger 
English 259, 263, 308, Maps 63, 67-69 
abroad 
in Africa 261-2, 282, 302, 303, 
310-11, Map 74 
South Africa 309, 312, 316, 329, 
332-3 
in the Americas 26-7, Maps 8-9 
Northern Central-American 26, 
Map 11 
South and Meso-America 
Maps 11, 13, 23 
in Asia 162, 188, 193-5, 213-16, 
Map 52 
in Australia/Oceania 102, Map 36 
creoles 26,27, Maps 9, 11, 13, 25, 
106 
in E. Europe 232-4 
pidgins 302, 309, 311, 325 
see also Australian Pidgin; 
Bislama; Dani Pidgin; 
Hawaiian English; Liberian 
Pidgin; Nigerian Pidgin; 
Pacific Pidgin; Pidgin Dobu; 
Police Motu; Solomon Pijin; 
Tok Pisin; Torres Strait 
Pidgin; Weskos 
British Black 263 
Channel Islands Map 63 
Hiberno-English 
Northern Map 63 
Southern Map 63 
Manx Map 63 
Midlands and East Anglian Map 63 
Northern Map 63 
Scots 
Highlands and Islands 263, Map 63 
Lowlands 263, Map 63 
Ulster 263, Map 63 
South-Eastern Map 63 
Southern and South-Western Map 63 
Welsh Map 63 
Enindhilyagwa 116, Map 35 
Enlhet see Sanapana-Enenlhit 
Enow 316, Map 93 
Enxet see Lengua 
Enya Maps 79-81 
Enya 316, Map 79 
Enyele 316, Map 86 
Enyom Map 101 
Enyom Igbo 316, Map 101 
Eotile 316, Map 105 
Epie-Atisa 316, Map 101 
Erap 110, Map 32 
Erei 316, Map 101 
Erem 113, Map 33 
Erenga 283-4 
Erengo Maps 71-72 
Ergong (sTau, Daofu) 170, 175, Map 45 
Erie Map 1 
Erima-Duduela 112 
Erima 112, Map 33 
Erokwanus 115, Map 34 
Erre 116, Map 35 
ErRomanga 108 
Ersu (Tosu, Liisu, Liizi) 170, 175, Map 45 
Eruwa 316, Map 101 
Erzya Mordvin see Mordvin 
Esan 316, Map 101 
Esaw 11 
Ese'eha (Ese Ejja, Ese Exa) 71, 
Maps 14-15, 17, 20-21 


Esel 316, Map 87 
Eselen see Esselen 
Esimbi 316, Map 95 
Esinga 316, Map 84 
Eskimo 234 
Eskimo-Aleut 7, 9,239, Maps 7-8, 55 
Eskimoan 7,234, Maps 7-8, 55 
Eastern Eskimo (Inuit) 7, 15, 
Maps 7-8 
Western Eskimo (Yupik) 7,239, 
Maps 7-8 
Esselen (Eselen) 7-8, 14,51, Map 4 
Estonian 232, 234, 238, Maps 58, 62, 64 
South 232, 238 
Estremenho (Portuguese) Map 68 
Etchemin 7, 14, Map 1 
Ethio-Semitic (Ethiopian Semitic) 277, 
287 
Northern 287, Map 72 
Southern 287-8 
Outer 288, Map 72 
Transversal 287-8, Map 72 
Ethiopic see Ge‘ez 
Eton 316, Map 92 
Etono 316, Map 101 
Etsako (Yekhee) 316, Map 101 
Etulo 316, 331, Map 95 
Etung 316, Maps 93, 95 
North 324, Map 93 
South 327, Map 93 
Euchee see Yuchi 
Eudeve 54, Map 3 
Eunda-Kolonkodhi 316, Map 84 
Euskara see Basque 
Evale 316, Map 84 
Evand 316, Map 95 
Even (Lamut) 234, 238, Map 55 
Evenki (Ewenke, Solon, Tungus) 187, 
192, 234, 238, Maps 45, 55-56 
Khamnigan Evenki 189 
Ewage see Notu 
Ewaku 316, Map 82 
Ewe 302, 311-12, 316, 331-2, 
Maps 102-103 
Ewenke see Evenki 
Ewondo (Yaunde) [Fang-Mbeti] 302, 303, 
316, Map 92 
Eyak-Athabaskan 8, Maps 2-9 
Eyak 8, 10, 14, Maps 7-9 
Eyeish Map 2 
Eza 316, Map 101 
Ezha (Aza) 288, Map 72 


Fa’ 316, Map 93 
Fadashi [Berta dialect] 285, 290 
Fadl 279, Map 70 
Faeroese Map 64 
Faga-Uvea 108, Map 27 
Fagani 109, Map 29 
Fagululu 110, Map 31 
Faire Atta see Atta 
Faita 111, Map 33 
Faiwol 112, Map 33 
Falam 168 
Falasha (Kayla) [Kemant variety] 286 
Fali 305, Maps 91, 94 
Fali Proper Map 94 
Fali 316, Map 94 
North Maps 91, 94 
South Map 91 
Fam Map 95 
Fam 316, Map 95 
Fanakalo [Bantu pidgin] 316 
Fang 331, Maps 83, 87-88, 91-93 
Fang-Mbeti (Fang-Beti) 303, 311, 316, 
330, Maps 87, 92 
Fang 302, 311, 330, Maps 88, 92 
Fanian 316, Maps 89-90 
Fante 303, 310, 316, Maps 102-103 
Farafira 281, Map 71 
Faroese 259, 263-4 
Fars Map 70 
Farsi see Persian 
Fasu 111, Map 32 
Fataluku 119, Map 38 
Fe'fe' 316, Map 93 
Fer 316, Map 89 
Fez [Arabic dialect] Map 73 
Fid‘an Map 70 
Fijian 107, Map 26 
Pidgin Map 25 
Filipino (Pilipino) 102-3 
Fillihi (Alqosh) 278, Map 70 
Finisterre-Huon 99, 99, 110, Maps 30, 
32-33 
Finisterre 99, 110, Maps 30, 32-33 
Finno-Ugric 259, Maps 55-56, 58, 61-62, 
65 
Finnic 259 
Finnish (Suomi) 231, 234, 238, 259, 
263, Map 64 
Finungura 110, Map 32 
Fipa 316, Maps 76-78 
Fiqiq Map 73 
Fiwaga 110, Map 32 
Fiyadikka 283-4 
Flathead see Spokane-Kalispel-Flathead 
Flemish see Netherlandic/Dutch 
Flinders Island 101, 117, Map 35 
Flores-Lembata 119, Map 38 
Foau 115, Map 34 
Foe 110, Map 32 
Fon 311, 317, 331, Map 102 
Fongoro see Gulu 
Foochow [Hokkien dialect] 163 
For see Fur 
Forak 112, Map 33 
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Fore 110, Map 32 
Forest Uriankhai see Tsaatan 
Formosan 98 
Fox 22 
see also Sauk-Fox-Kickapoo, Sauk-Fox 
Frafra-Nankani (Gurenne) 317, Map 103 
Francien (French) Map 69 
Franco-Provengal 262, Map 69 
Franconian 
Low Map 65 
Upper Map 65 
Fraser Island see Badjala 
Fraser/Nechako see Carrier 
French 259, 261-2, Maps 63, 65, 67-69 
Channel Islands 259, 263, Map 63 
creoles 26, 27, 262, Maps 9, 13 
Haitian 26, Map 9 
Langue d’Oc 262, Map 69 
Provencal 262, Maps 65, 67, 69 
Langue d’Oil 262, Map 69 
in Africa 281, 302, 303, 310, 311, 330 
in Asia 194,216, Map 52 
in Australia Map 36 
in E. Europe 233-4 
in Meso- and South America 
Map 13 
in North America 26-7, Maps 8-9 
Acadian (Cajun) 27, Maps 8-9 
Louisiana Creole (Négue) 27 
pidgins see New Caledonia Pidgin 
French 
see also Auvergnat; Bourgignon; 
Champenois; Francien; Franco- 
Provengal; Gascon; Limousin; 
Lorrain; Normano-Picard; 
Poitevin; Savoyard; Walloon 
Frisian 259 
East 259, 261, Maps 64-65, 69 
North 259, 261, Maps 64-65 
West (Fries) 259, 262, Maps 65, 69 
Friulian 259, 261, Maps 65, 67 
Fu'da 317, Map 93 
Fu'sap 317, Map 93 
Fuai 108, Map 27 
FuDa see Fu'da 
Fula-Serer Map 108 
Fula-Wolof Maps 73, 107-108 
Fulani (Ful) 282, 301, 306, 308-9, 
311-12, 330-2, Maps 73, 79, 
89-91, 94-95, 97, 99-102, 104, 
106-108 
Fulbe 282, Maps 71-72 
Fuliro 317, Map 80 
Fulnio 73, Maps 18-19 
Fumu 317, Maps 81, 83 
Fumuwumbu 317, Map 81 
Fundi (Shirazi) 317, Map 85 
Fungor Map 72 
Fur 283, 307, Map 89 
Fur (For, Furakan) 283, 317, Maps 71, 
89 
Furi 317, Maps 103-104 
Furu 317, 330, Maps 82, 89 
Fusap see Fu'sap 
Futa-Jalon Ful 311, 317, Map 107 
Futuna, East 107, Map 26 
Futuna-Aniwa 107-8, Maps 26-27 
Fuyuge 111, Map 32 
Fuzhou see Chinese (Sinitic), Southern, 
Hokkien (Min) 
Fwe 317, Maps 74, 77 
Fyam (Fyem) 317, Maps 96-97 
Fyer Maps 96-97 
Fyer 317, Maps 96-97 


Ga see Ga-Dangme, Ga 
Gdaliyyin 281, Maps 71-72 
Ga‘aliyyin Maps 71-72 
Ga'ana (Ga'anda) 308, 317, Maps 91, 98 
Ga-Dang [N. Philippine] 119, Map 40 
Ga-Dangme [Kwa] 305, 311, 332, 
Maps 102-103 
Ga 317, Maps 102-103 
Gaagudju see Gagudju 
Gaam see Tabi 
Gabadi 111, Map 32 
Gabalbara 101, 117, Map 35 
Gabiano 113, Map 33 
Gabra 286, 317, Maps 72, 85-86 
Gabri 317, Map 90 
Gabrielino (Tongva) 9, 14, Maps 3-4 
Gabutamon 112, Map 33 
Gadaba (Kondekor Gadaba) 213, 217, 
Maps 51-52 
see also Gutob 
Gadang [Chadic] 317, Map 90 
Gaddang [N. Philippine] 119, Map 40 
Gaddi, Gadi see Bharmauri 
Gade 317, Map 101 
Gadjerawang 116, Map 35 
Gadsup-Auyana-Awa 110, Map 32 
Gadsup 110, 112, Map 32 
Gaelic (Scots Gaelic) 259, 263, Map 63 
see also Irish 
Gafat 288, Map 72 
Gagauz 234—5, 238, Map 58, 61 
Gagu see Gban 
Gagudju (Gaagudju) 100, 116, Map 35 
Gahar, Gahri see Bunan 
Gahuku 110, 112, Maps 32-33 
Gahuku-Asaro 110, 112, Maps 32-33 
Gaikunti see Sawos 
Gaina 110, Map 31 
Gal 112, Map 33 
Galab see Dasenech 
Galambi (Galambu) 317, Maps 97-98 
Galela 118, Map 37 


Galeshi 216, Map 53 
Galeya 110, Map 31 
Galibi see Kari'nya 
Galice-Applegate 8, 14, Map 4 
Galician [Ibero-Romance] 259, 262, 
Map 68 
Galila see Aari 
Galimbamu 101, 117, Map 35 
Galla see Oromo 
Galler [Bru dialect] 177 
Gallo-Romance 259, Maps 63, 65, 67-69 
in Asia Map 52 
in North America Maps 8-9 
Galo (Gallong) [Tibeto-Burman] 169 
Galoli 119, Map 38 
Galu 113, Map 33 
Galwa 317, Map 88 
Gamargu 317, Maps 94, 99 
Gamba 317, Map 90 
Gambera 116, Map 35 
Gambiri see Tregami 
Gamei 113, Map 33 
Gamela 73, Map 18 
Gamila see Berta 
Gamilaraay-Yuwaalaraay-Yuwaalayaay 
102, Map 35 
Gamilaraay 117 
Gamo (Gemo dona) 288, 289, Map 72 
Gan [Anhui] see Chinese (Sinitic), Central, 
Chinese (Sinitic), Central, Gan 
Gan [Gur] see Gan-Dogosye, Gan 
Gan [Mwa-Gan] see Ben 
Gan Yi see Aluo 
Gan-Dogosye [Gur] Maps 104-105 
Gan 317, Maps 104-105 
\|Gana (Gllana) [Khoisan] Map 74 
Gana [N.E. Borneo] 121, Map 41 
Ganan [Sak] 167, 175, Map 49 
Ganati see Kenati 
Ganay 102, 118, Map 35 
Ganching [Achik subdialect] 166 
Ganda [Bantu] Map 86 
Luganda (Ganda) Maps 79-80, 86 
Ganda [Umbundu] 317, Map 84 
Gangam 317, Map 102 
Gangela 317, Map 77 
Ganglau 112, Map 33 
Gangte 168, 213 
Gangulu 101, 117, Map 35 
Ganja (Ja) [Atlantic] 317, Map 108 
Ganja [ Trans-New Guinea] 112, Map 33 
Gants 111, Map 33 
Ganyoa 317, Map 105 
Ganza 285, 288, 289, Map 72 
see also Gumuz 
Gao 109, Map 29 
Gaoyang [Yue dialect] 160 
Gapun (Taiap) 99, 113, Map 33 
sGar (Gartok) 164, Map 45 
Gar [Mnong dialect] 177 
Gara [Achik subdialect] 166 
Garawan 100, 108, 115-16, Map 35 
Garawa 116, Map 35 
Garhwal Bhotia see Jad 
Garhwali 213, Map 54 
Gari [Austronesian] 102 
Gari [Himalayan] see Bunan 
Garia see Sumau 
Garifuna 61, 66, Map 13 
Garig 116, Map 35 
Garingbal 101, 117, Map 35 
Gariyimar 116 
Garlali 101, 117, Map 35 
Garo (Bosha) [Omotic] 289 
Garo [N.E. Borneo] 121, Map 41 
Garo [N.E. India] 166, 213,215, Map 51 
Garre Map 72 
Garreh 317, Map 85 
Garrwan see Garawan 
Gar 66, Map 13 
Garuh 112, Map 33 
Garus 112, Map 33 
Garuwahi 110, Map 31 
Garza (Comecrudan) 9, 13, Map 2 
Gascon (French) Map 69 
Gashowu 8, Map 4 
Gasmata 109, Map 30 
Gato 286 
Gauri (Kauri, Hkauri) 167 
Gauro Map 53 
Gavar 317, Map 94 
Gaviao do Jiparara 75 
Gawanga see Kwanga 
Gawar-Bati 211, Map 53 
Gawwada 287 
Gaya 317, Map 86 
Gayabara 102, 118, Map 35 
Gayardilt 116, Map 35 
Gayiri 101, 117, Map 35 
Gayo 121, Map 43 
Gayon (Coyén) 63, Map 12 
bagyi (Gwari-Matai) 317, Maps 96, 
101 
baka-Manza 317, Map 89 
ban (Gagu) 317, Map 105 
banu 317, Maps 87, 89-90 
banzili Map 82 
Gbanzili (Buraka) 317, Maps 82, 87, 
89 
Gbaya see Aja; see also Gbaya-Manza- 
Negbaka, Southwest, Gbaya 
Gbaya-Dara [Kresh dialect] 284 
Gbaya-Gboko [Kresh dialect] 284 
Gbaya-Manza-Ngbaka 311, 317, Maps 82, 
87, 89-91, 95 
Central Maps 87, 89 
Central Gbaya 315, Map 87 
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Core Map 87 
Gbaya-Bokoto 317, Map 90 
East Map 87 
North Maps 90-91 
Gbaya Kara 317, Maps 90-91 
North-West Gbaya 324, Map 91 
Southwest Maps 87, 90-91 
Gbaya Map 87 
Gbaya Toongo 317, Maps 90-91 
Gbaya-Biyanda 317, Maps 87, 91 
Gbaya-Yangere 317, Map 87 
st Maps 87, 90 
Gbaya-Ngombe 317, Map 87 
West Gbaya 328, Map 90 
Gbaya-Ndogo [Kresh dialect] 284 
Gbaya-Ngbongbo [Kresh dialect] 284 
Gbaya-Ngombe see Gbaya-Manza- 
Negbaka, West 
Gbe 305, 331-2, Maps 102-103 
Gbede 317, Map 101 
Gbee (Gbii) 317, Map 106 
Gbende 317, Maps 82, 89 
Gbete 317, Maps 87, 91 
Gbeya 317, Maps 89-90 
Gbii see Gbee 
Gbwata 308, Maps 91, 94, 98 
Gbwata (Bata) 301,317, Maps 91, 94, 98 
Gcaleka 317, Map 74 
Gciriku (Dciriku) 317, Map 84 
Ge (Mina) [Kwa] 317, Map 102 
Ge [Yi] see Ayizi 
Ge'ez (Ethiopic) 287-8 
Geba [Karen] 174-5, Map 49 
Gebe 118, Map 37 
Gebelawin see Berta 
Gebeto [Berta dialect] 285, 290 
Gedaged see Bel 
Gedeo (Gede?o, Derasa, Darasa) 285, 
287, Map 72 
Geechee see Gullah 
Geedabuursi see af-Geedabuursi 
Geelvink Bay 118, Map 37 
Geeri see Poka-Kori-Wipsi-Geeri 
Ge’ez (Ethiopic) 287 
Geji Map 98 
Geji 317, Maps 97-98 
Gekhu (Gheko) 174-5, Map 49 
Geko, Gekho see Gekhu 
Gela-Guadalcanal 109, Map 29 
Gela 106, 109, Map 29 
Gelao Maps 45-46 
Gelao 181-2, Maps 45, 47 
Gele see Gulu 
Gelvaxdaxa 317, Map 94 
Geman, Geman Deng see Mijuish Mishmi, 
Miju 
Geme 317, Map 89 
Gemo dona see Gamo 
Gende (Bundi) 112, Map 33 
Gengle 317, Map 98 
Georgian 233-6, 238, Map 59 
Georgian [Aramaic subgroup] Map 70 
Gepo (Képu) 172, Map 46 
Gera 317, Maps 97-98 
dGergjas 164, Map 45 
Gerka Map 98 
Gerka (Yiwom) 317, Maps 96, 98 
Germanic 46, 259, Maps 36, 52, 58, 61-62, 
64-65, 67-69 
German 260-1, Maps 58, 61-62, 
64-65, 67, 69 
abroad Maps 58, 61-62, 64, 67, 69 
in America 27, Maps 8-9 
Pennsylvania Dutch 27 
in Asia 236-7, Maps 52, 58 
in Australia Map 36 
in E. Europe 231-7 
Low 27, 261, Map 65 
Middle High 261, Map 65 
Swiss dialects see Alemannic 
Upper High 261, Map 65 
see also Alemannic; Bavaro-Austrian; 
Franconian; Saxon 
West see Afrikaans; Netherlandic 
(Dutch) 
sGerrtse [Khams Tibetan] 164 
rGerrtse [Khams Tibetan] 164 
Geruma 317, Maps 97-98 
Geser-Goram 115, 118, Maps 34, 37 
Gésinan see Harar(i) 
Geta (Gtah) 217, Maps 51-52 
Gey 317, Maps 91, 94 
hadamis Maps 71, 73 
hale 164,214, Map 51 
hanongga 109, Map 29 
hari 106 
harti (Bujheli) 165, Map 51 
hat Maps 71, 73 
heg see Albanian 
heko see Gekhu 
homala 317, Map 93 
homozo 172, 175, Map 46 
hotuo 317, Map 101 
humara [Arabic subgroup] Map 73 
humara [Berber] Map 73 
Giay see Yay 
Gichode 317, Maps 102-103 
Gichugu 317, Map 86 
Gidar see Ka'da 
Gidicho 288, 289, Map 72 
Gidole (Dirayta, Dirasta) 286 
Gidra (Jibu) 111, Map 32 
Giimbiyu 100, 116 
Gikuyu 311, 317, Maps 72, 85-86 
Gilaki 211, 216, Map 53 
Gilbertese 108, Map 28 
Northern 108, Map 28 
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Gilyak see Nivkh 

Giman 118, Map 37 

Gimbiyu Map 35 

Gimi 110, Map 32 

Gimine (Gimnime) 317, Maps 91, 98 
Gimira (Bench-She) 

Gimira (Bench) 288-9, Map 72 
Gimme 317, Maps 91, 98 
Gimnime see Gimine 
Gina Map 71 
Ginao 66, Map 12 
Gingwak 317, Maps 96-98 
Ginuman (Dime) 110, Map 31 
Gio see Dan [Mande] 

Gira 112, Map 33 

Girawa (Bagasin) 112, Map 33 

Giri (Kire) 113, Map 33 

Giriama 317, Map 85 

Gisu 317, 330, Map 86 

Gitksan 8, 19, Maps 6-8 

Gitua 111, 113, Maps 32-33 

Giya 101, 117, Map 35 

Giziga-Mofu (C) [S. Mafa-Matakam] 
Map 94 

Giziga North 317, Map 94 

Giziga South 317, Map 94 
Gizra (Toga) 111, Map 32 
Glavda 308 
Glebo 317, Map 105 
gli-Krongo see Krongo 
Glio-Ubi 317, Map 105 
Gnau 113, Map 33 
Goba (Mozambian Korekore) 317, 

Map 77 
Godie-Kuya Map 105 

Godie 317, Map 105 
Godoberi 233-4, 239, Map 59 
Godwari Map 54 
Goemai 317, Maps 96, 98 
Goeng-Goeng 102 

see also Daribalang 
Gofa 288, 289, Map 72 
Gogo 312, 317, Map 76 
Gogodala-Suki 99, 99, 111 

Gogodala 102, 105, 111, Map 32 
Gogot 288, Map 72 
Goidelic 259, Map 63 
Goilalan 99, 111, Map 32 
Gojjam 288 
Gojri see Gujari 
Gokana 317, Map 101 
Gola 332, Map 106 

Gola 317, Map 106 
Gold see Lamay; see also Hezhe 
Golden Palaung 176 
Goldi see Hezhe 
Goliath 114, Map 34 
Gollango 287 
Golo 317, Maps 79, 82 
Golok (mGolog) 164 
Gomara see Gumar 
Gombo [Gumuz dialect] 285 
Gombo (Négue) see Louisiana Creole 
Gondar-Bagemder 288 
Gond(h)la see Tinan 
Gondi 213, 217, Maps 51-52, 54 
Gondorobba [Arbore variety] 287 
Gong (Ugong) 171, 175, 181, Map 48 
Gonga (Kefoid) 288,289, Map 72 
Gongduk 165, 215, Map 51 
Gonge 317, Map 90 
Gonja [Kwa] 317, Maps 102-105 
Goreng 102, 117, Map 35 

Goreng-Goreng 102, 117, Map 35 
Gorkhali see Nepali 
Gorlos [Jerim subdialect] 191-2 
Gorokan 99, 110, 112, Maps 32-33 
Gorontalo 120, Map 39 

Gorontalo 120, Map 39 
Gorovu (Yerani) 113, Map 33 
Gorowa 287 
Gorum see Parengi 
Gosiagu Talk see Pidgin Dobu 
Gosiute [Shoshone] 22 
Géta Swedish Map 64 
Gotlandic Swedish Map 64 
Goulou [Yue dialect] 160 
Goyi 317, Map 80 
Gozo Maltese 281, Maps 67, 73 
Grand Valley Dani Map 34 
Grass see Porapora 
Grassfields Maps 92-93, 95 

Narrow Grass 306, Maps 92-93, 95 

Wide Grass 306, 331, Map 95 
Great Dani 114 
Great Nicobar [Nicobarese dialect] 178 
Grebo 311,315, Maps 105-106 

Grebo 317, Maps 105-106 
Greek 233, 237, 259-60, Maps 60-61, 

66-67 

abroad 236-7, 261, 263 
in Australia Map 36 
in Russia 234-5 

Italiot 261 
Green Gelao see Hakhi 
Greenlandic (Inuit, Kalaallisut, 

Kalaallit oqaasii) 7, 15, 263, 
Maps 7-8 

East 7, 15, 263 

North (Polar Eskimo) 15, 263 

West 7, 15, 263-4 
Grigra Map 2 
Groma (Tromowa) 164, 175, Map 45 
Gros Ventre (Atsina) 7, 15, Maps 5-9 

see also Hidatsa 
Grusi Maps 102-105 

East Maps 102-104 


North Maps 103-105 
West Maps 102-105 
Gtah see Geta 
Gat'ame see Kachama 
Gu'de 308, 317, Map 94 
Guachi (Wachi) see Chapakura 
Guachichil 53, Map 3 
Guadalcanal 109, Map 29 
West Guadalcanal 109, Map 29 
Guahibo see Wahivo 
Guaiaqui see Guayaki 
Guaicurian (Waikurian) 7-8, 15 
Guaicuri (Waykuri) 8, 15, Map 3 
Guaipuinavi see Puinave 
Guaja 74 
Guajajara see Tenetehara 
Guajibo see Wahivoan, Wahivo 
Guajiro see Wahiro 
Guale Map 2 
Guamaca see Wamaka 
Guambiano see Wambiano 
Guamo see Wamo 
Guang 307, 332, Maps 102-104 
Guang Proper Map 102 
North Maps 102-103, 105 
Mountain Map 103 
River Map 103 
South Map 102 
Coast Map 103 
Hill Map 103 
Guangfu (Guangzhou) [Yue dialect] 160 
Guarani 27,74, Maps 18-22 
Guarao see Warao 
Guarayt 74, Maps 20-21, 23 
Guarequena see Warekena 
Guariba see Wariva 
Guarijio see Warijiyo 
Guasave Map 3 
Guaté 73, Maps 18-19 
Guatuso see Watuso-Wétar 
Guaviare see Waviare 
Guayabero see Wayavero 
Guayaki (Guaiaqui, Aché) 62, 74, Map 23 
Guayana see Wayana 
Guaymi see Mobe 
Guba 289 
Gubabuyngu (Dhuwal) 107 
Gubi 317, Maps 97-98 
Gudandji 116, Map 35 
Gudjal 101, 117, Map 35 
Gudu 317, Maps 91, 98 
Guduf 317, Map 94 
Guere Maps 105-106 
Guere (Wee) 301, 328, Maps 105-106 
Guéren 72, Map 18 
Guetar see Watuso-Wétar 
Gueve Maps 91, 94 
Gugadja 100, 116, Map 35 
Gugu Badhun 101, 117, Map 35 
Gugu Warra 101, 117, Map 35 
Gugu Yalandyi see Kuku Yalandji 
Guhayna (Juhayna) [Arabic subgroup] 
281, Maps 71-72 
Guhayna (Juhayna) Maps 70-72 
Guhu-Samane 111, Map 32 
\|Gui (Glui) Map 74 
Guiarak 112, Map 33 
Guichong 170, 175, Map 45 
Gujarati 211-13, 214, 217, 259, Maps 52, 
54 
in Britain 263 
Gujari (Gujuri, Gojri) 211, Maps 53-54 
Guji Map 72 
Gujuri see Gujari 
Gula Map 89 
Gulai 317, Map 90 
Gullah (Sea Island Creole) 27, Maps 8-9 
Gulu (Gele, Fongoro) 317, Map 89 
Fongoro 285, Maps 71-72 
Guluwarin 116, Map 35 
Gumalu 112, Map 33 
Guméar (Gomara) 288, Map 72 
Gumasi see Mikarew 
Gumawana 110, Map 31 
Gumbaynggiric (Gumbaynggir-Yaygir) 
102, 118, Map 35 
Gumbaynggir 102, 118, Map 35 
Gumis 285, 290 
Gumuz [Komuz subgroup] 283, 285, 290, 
307, Map 72 
Gumuz (Ingafi baxa, Gumis, Ganza) 
285, 290, Map 72 
Gun [Gbe] 311 
Gunardba 116, Map 35 
Gundungura 102, 118, Map 35 
Gungabula 101, 117, Map 35 
Gungaragany 116, Map 35 
Gunggari 101, 117, Map 35 
Gungu 317, Map 79 
Guniyan 116, Map 35 
Gunnan-Gurage 288 
Gunwinyguan 100, 116, Map 35 
Gunwinygu 107, 116, Map 35 
Gunya 101,117, Map 35 
Gunymal see Koinjmal 
Guoyo see Taiwan Chinese 
Gur-Adamawa 305, Maps 79, 82, 87, 
89-91, 93-95, 98, 100-101, 107 
Gur 306, 332, Maps 100-104 
Central-North 305, Map 104 
Central-South Map 104 
West 332, Map 104 
Gur-Senufo 305, 332, Maps 104-105, 
107 
Gura [Gurage] 288 
Gurage 288, Map 72 
Eastern 288 


381 


Language index 


Northern 288 
Western 288 
Guragone 116, Map 35 
Gurani (Gurani, Hawrami) 211, 216, 277, 
Map 53 
Gurara Map 73 
Gure-Kahugu 317, Map 97 
Gurenne see Frafra-Nankani 
Gurindji 100, 116, Map 35 
Gurma 331, Maps 102-104 
Gurma (Nchema) 311, 317, 
Maps 102-104 
Gurmana 317, Map 101 
Gurnu 102, 118, Map 35 
Guro-Tura (Tura-Guro) Maps 105-106 
Guro 317, Map 105 
Gurung (Tamu) 164-5, 214, Map 51 
Guruntum 308, Maps 97-98 
Guruntum (Guruntum-Mbaaru) 317, 
Maps 97-98 
Gusan 110, Map 32 
Gusap see Wasembo 
Gusap-Mot (Gusapmot) 110, 112, Map 32 
Gusii 311, 317, Map 86 
Gusilay 317, Map 108 
Gusu 317, Map 97 
Gutob-Remo-Geta Maps 51-52 
Gutob (Gadaba) 217, Maps 51-52 
Guugu Yimidhirr 101, 117, Map 35 
Guwa 101, 117, Map 35 
Guwamu 101, 117, Map 35 
Guwar 102, 118, Map 35 
Guwidj 116, Map 35 
Guyanais [creole French] 27, Map 13 
Guyanese [creole English] 26, 27, Map 13 
Gvnong 169, 175 
Gvoko see Ngweshe 
Gwa [Bantu: C90] 317, Map 81 
Gwa [Jarawan] 317, Map 97 
Gwa [Kwa] 317, Maps 102-103 
Gwandara-Gari (Gwandara-Gwari) 317, 
Maps 95-96, 101 
Gwano 317, Map 85 
Gwari-Matai see Gbagyi 
Gwari-Yama 317, Maps 97, 101 
Gwe 317, Map 86 
Gwedena (Gwede) 110, Map 31 
Gwembe 317, Maps 74, 77 
Gweno 317, Maps 85-86 
Gwere 317, Map 86 
Gwich'in (Kutchin) 8, 15, Maps 7-9 
Tukudh (Eastern Gwich'in) 15 
Gwom (Gwomun, Gwomum) 317, 
Map 98 
Gworam 317, Map 96 
Gwurang Map 72 
rGyalrtse (Gyantse) 164 
rGyalthang 164 
Gyando 317, Maps 82, 86 
Gyantse see rGyalrtse 
rGyarung see Jiarong 
Gyasumdo 164, Map 51 
Gyem 317, Map 97 
Gyeto (Gyeta) 288 
Gyong see Kagoma 


+H0a see Huang, Eastern 

Ha [Bantu] 312,317, Maps 78, 80, 86 

Ha [Li subvariety] 179 

Ha Ha Hae see Patash6-Hanhanhain 

Habaab 287 

Habbaniya Maps 71-72 

Habu 119, Map 38 

Habyot Map 70 

Hadad 317, Map 99 

Hadarib Map 72 

Hadendiwa [Beja dialect] 286 

Hadidin 279, Map 70 

Hadiyya 285, 287, Map 72 

Hadrami [Yemeni dialects] 280, Map 70 

Hadu [Saho dialect] 286 

Hadza 307-8, 317, Map 86 

Hagen 102, 110, 112, Maps 32-33 

Hahon 108, Map 29 

Hai-ll'om (San) 317, Maps 74, 84 

Haida 7-9, 15, Maps 6-9 

Hail Map 70 

Hailar [Daur dialect] 192 

Haimi 167, 175, Map 49 

Hainan see Chinese (Sinitic), Southern, 
Hokkien (Min) 

Haisla 8, 15, Maps 6-9 

Haitian Creole 26,27, Map 9 

Hajong 166, Map 51 

Haka (Hakha) [Kuki-Chin] 168 

Hakaru-Kauki (Jaqaru) 70, Map 20 

Hakhi (Green Gelao) [Gelao variety] 
182, 184 

Haki (Jaqui) 70 

Hakka see Chinese (Sinitic), Hakka (Kejia) 

Halabi see Halbi 

Halam 166, 213 

Halang 177-8, Map 48 

Halang Doan see Duan 

Halbi (Halabi) 213, Maps 51-52, 54 

Halh see Khalkh(a) 

Halhal [Bilin dialect] 286 

Halia 108, Map 29 

Haliti see Paresi 

Halkomelem 8, 15, Maps 6, 8-9 

Hallam see Langrong 

Halmahera 118, Map 37 

North 118 
South 98, 100, 114-15, 118, Maps 34, 37 

Halung 164, Map 51 

Ham (Hyam, Jaba) [Central Nigerian] 
317, Maps 96-97 


Ham [N. New Guinea] 113, Map 33 
Hamar Maps 71-72 
Hamba Sambala 317, Map 80 
Hamer 288, 289, 317, Maps 72, 86 
Hamid Maps 71-72 
Hamito-Semitic 277 
see also Berber, Coptic, Cushitic, 
Egyptian; Semitic 
Hamtai (Kapau) 111, Map 32 
Han (Han) [Athabaskan] 8, 15, 
Maps 7-9 
Han [Chinese] see Chinese (Sinitic), 
Northern, South-western (Han) 
Handa [ Bantu: R10] 317, Map 84 
Handa [Waja] 317, Map 98 
Handuri Map 54 
Hanga (Yaka) 317, Map 80 
Hanga-Kamara 317, Maps 103-104 
Hangaza 317, Map 80 
Hanhanhain see Patasho-Hanhanhain 
Hani 173, 175, Maps 46-47 
Hanis (Coos) 8, 15, Map 4 
see also Miluk 
Hano see Tano; Tewa 
Hanunoo 119, Map 40 
Hanya 317, Map 84 
Hao-Bai see Baihong; Haoni 
Haoni 173,175, Map 46 
Harakmbut see Wachipaéri 
Harar(i) 286-8, Map 72 
Harauti 213 
Harawa Map 73 
Harb Map 70 
Hare (North Slavey) 8, 15, Maps 7-9 
Haroi 182, 184, Map 48 
Hargqin [Ju Ud subdialect] 191 
see also Kharachin; Khorchin 
Harso 287 
Harstisi 281-2, 287, Map 70 
Hartevan 278, Map 70 
Haruku 118, Map 37 
Haruro see Kachama 
Haryanvi 214, Map 54 
Harzani (Taleshi) 211 
Hassaniyya 281, Maps 71-72 
Hattam Map 34 
Hausa [Chadic subgroup] 309, 331, 
Maps 97-98, 100, 102 
Hausa 282, 301, 302, 308, 310, 312, 
Maps 71, 73, 97-98, 100-102 
as lingua franca 331 
Hausa-Gwandara [Chadic subgroup] 
Map 95 
Gwandara-Gari Map 95 
Haush 71, Map 22 
Havasupai see Yuman, Upland 
Havu 317, Map 80 
Haw Map 47 
Hawai'ian 26, 107, Map 26 
Hawaiian English (Hawai'i Creole) 26-7, 
106, Map 25 
Hawaiian Japanese Pidgin English 106, 
Map 25 
Hawaiian Pidgin 106 
Hawajap see Nocte 
Hawawir Map 72 
Hawazma Map 72 
Hawrami see Gurani 
Hawran 280, Map 70 
Haya-Jita Map 80 
Haya 312, 317, Maps 80, 86 
Hayu (Vayu) 165,214, Map 51 
Hazaragi 211, 216, Map 53 
Hebrew Map 70 
Israeli Hebrew 277-9, Map 70 
in Russia 234 
Hehe 317, Map 76 
Hehenawa see Kubewa 
Heiban 304 
Heihe [Daur subdialect] 192 
Heiltsuk-Oowekyala 8, 15, Maps 6-9 
Hejaz see Hijazi 
Hekaine 71, Map 22 
Hellenic 237, 259 
Helong 119, Map 38 
Helong 119, Map 38 
Hemba 317, Maps 77-78, 80 
North 324, Map 78 
South 327, Map 78 
Henghua [Hokkien dialect] 163 
Her 317, Map 108 
Herde see Zime 
Herero 317, Map 74 
Herero-Chimba see Chimba-Herero 
Hermit 109, Map 30 
Hevero (Jebero) 69, Maps 14-15 
Hewa (Sisimin, Umairof) 113, Map 33 
Hewadie see Katawishi 
Hexigten [Ulanqab subdialect] 191 
Heyo 113, Map 33 
Hezhe (Gold(i), Nanai, Nanay) 187, 234, 
238, Map 45-55 
Hezhou [mixed language] 160-1, 193 
Hianakoto (Hianacoto-Umaua, 
Jianacoto) 75, Map 13 
Hibito see Hivito 
Hidatsa 9, 15, Maps 5, 8-9 
Higaonon 119, Map 40 
Highland East Cushitic 285-6, 288 
Highland Kechua (Kichua, Quechua) 70, 
Map 15 
Highland Sokean Map 10 
Highland Totonako 51 
Higi Map 94 
Higi (Kamwe) 317, Map 94 
Higir [Nara dialect] 290 
Hijazi [Arabic subgroup] 279-80, Map 70 


Hijuk 317, Maps 92-93 
Hilali 281 
Hiligaynon (Ilonggo) 103, 119, Map 40 
Hill Miri 169, Map 51 
Hill Nubian Map 71 
Himalayan [Tibeto-Burman] 163, 165 
Central 163, 165-6, Map 51 
Himalayish 163, Maps 51, 54 
West 165 
see also Kiranti 
Himba [Hoanib: R30] Map 84 
Himba [ Tsogo: B30] Maps 83, 88 
Hindi 1, 212-14, 214, 217, 259, 
Maps 51-52, 54 
abroad 181, 216, 234, 263 
in Australia Map 36 
in S. Africa 310 
creolized 27 
dialect 215 
Hindko 211, Map 53 
Hindu (Hindustani) see Hindi 
Hinga 317, Map 84 
Hinihon 112, Map 33 
Hinuq 234, 239, Map 59 
Hiraharan 63, Map 12 
Jirajara (Hirahara) 63, Map 12 
Hirara see Betoi 
Hiri Motu see Police Motu 
Hiri trading languages 105, Map 25 
see also Koriki Hiri; Toaripi Hiri 
Hishkariana (Hyxkaryana) 76, Maps 17, 
19 
Hitchiti see Mikasuki 
Hittite 304 
Hitu 118, Map 37 
Hivaro-Kawapénan (Jivaro-Cahuapana) 
69, Maps 14-15 
Hivaro (Jivaro, Achuar-Shiwiar) 69, 
Maps 14-15 
Hivito (Jibito) 69, Map 14 
Hiw [N. Vanuatu] 108, Map 27 
Hiw [Wahivoan] see Wayavero 
Hiwi see Wahivoan, Wahivo 
Hixkaryana see Hishkariana 
Hkauri see Gauri 
Hlai see Li 
Hlengwe 317, Maps 74-75 
Hlersu (Sansu, Shansu) 172-3, 175, 
Map 46 
Hmar 168, 213, Map 51 
Hmau 183-4 
Hmo 183-4 
Hmong (Meo, Maew, Myaing) 183-4 
Hmong Daw (White Mong) 183 
Hmong Njua (Blue/Green Mong) 183 
Hmong-Mien see Miao- Yao 
Hngizung 168 
Ho [Munda] 213,217, Map 51 
Ho-Chunk see Winnebago 
Hoanib Maps 74, 84 
Hoanya 182, 184 
Hoava 109, Map 29 
Hochank see Winnebago 
Hojai 166, Map 51 
Hokan 9, 46, 51-2, Maps 10-11 
Hokan-Siouan 9 
Hokchiu [Hokkien dialect] 163 
Hokka see Chinese (Sinitic), Hakka (Kejia) 
Hokkien see Chinese (Sinitic), Hokkien 
Hoklo [Chinese dialect] 162 
Holikachuk 8, 15, Maps 7-9 
Holoholo 317, 330, Maps 78, 80 
Holu 318, Map 84 
Hom 318, Map 93 
Homa 318, Maps 79, 89 
Hona see Hwana 
Hooz wandi see Hozo 
Hopi 9, 15, Maps 3, 5, 8-9 
Har see Tibeto-Burman 
Hor Mongol 192 
Horom 318, Map 96 
Horgin [Jerim subdialect] 191 
Horu-Muthun see Wancho 
Hoshut see Khoshuud 
Hote 111, Map 32 
Hoti (Yuwana, Waruwaru) 77, 
Maps 12-13 
Hotogoyt see Khotgoid 
Houailou see Wailu 
Houma Map 2 
Hovongan 120, Map 41 
Hozo-Sezo 290 
Hozo (Hooz wandi) 288-9, Map 72 
see also Seze 
Hpun (Hpon, Hpén, Phun) 171, 175, 
Map 49 
Hre 177-8, Map 48 
Hruso Map 51 
Hrusso (Aka, Angka) 169 
Hsana Map 70 
Htangan 167, 175, Map 49 
Htin [Mon-Khmer] 181 
Hu see Phuman 
Huachi see Chapakura 
Huachipaeri see Wachipaéri 
Hualapai see Yuman, Upland 
Hualngo see Mizo 
Huamelulteco see Chontal 
Huanca see Wanca 
Huang, Eastern (+H6a) 316, Map 74 
Hudanuco Kechua (Kichua, Quechua) 70, 
Map 21 
Huasteco see Wasteko 
Huautla-Mazatlan Masateko 53 
Huave see Wave 
Huaylla see Yaqa 
Huchiti see Uchiti 


Huchnom [Yuki] 25 
Huehuetla Mecapalapa Tepewa 51 
Hui [Anhui] see Chinese (Sinitic), Central, 
Hui; see also Dunganese 
Hui-Yimin [Evenki dialect] 192 
Huichol (Wichol) 53, Maps 8-9 
Huite Map 3 
Huitoto see Murui-Witoto 
Hujariyya [Yemeni dialect] Map 70 
Hukwe (Kwengo) 329 
Huli 110, 112, Maps 32-33 
Hulun Buir [Daur subdialect] 192 
Hulung 118, Map 37 
Humbe 318, Map 84 
Humbu 318, Maps 81, 83 
Humburi Senni 318, 332, Map 104 
Humene 111, Map 32 
Humli Tamang 164, Maps 51, 54 
Humr Maps 71-72 
Hunan see Chinese (Sinitic), Central, 
Xiang 
Hunde 318, Map 80 
Hung (Khong Kheng) 177-8, Map 47 
Hungana Map 81 
Hungan(a) 318, Map 81 
Hungarian (Magyar) 231-4, 238, 259-61, 
Maps 58, 61-62, 65, 67 
dialects 260 
Hungu 318, Map 84 
Hunjara 110-11, Maps 31-32 
Hunnish, Eastern see Turkic, Northern 
Hunzib 234, 239, Map 59 
Huon 99, 110, Maps 32-33 
Eastern 110, Map 32 
Western 110, Map 32 
see also Finisterre-Huon 
Huon Gulf 111, 113, Maps 32-33 
North 111, Map 32 
South 111, Map 32 
Hupa 8, 15, Maps 4, 8-9 
Hupda 67, Maps 12-13, 16 
Huron-Wyandotte (Wendat) 9, 15, 
Maps 1, 7 
Hurutshe 318, Map 74 
Hutaym Map 70 
Huuchid [Shilingol subdialect] 191 
Huwaytat Map 70 
Huzhu [Monguor dialect] 191-2, 
Map 45 
Hwana (Hona) 318, Maps 91, 94, 98 
Hwei see Dunganese 
Hya 318, Map 94 
Hyam see Ham [Central Nigerian] 
Hyanzi 318, Map 80 
Hyxkaryana see Hishkariana 


I-Wak 119, Map 40 

Taai 108, Map 27 

Iambi 318, Map 86 

Ianomami see Yanomama 

Iatmul 113, Map 33 

Iatmul Pidgin (Iatmul Jargon) 105, 

Map 25 

Iaualapiti see Yawalapiti 

Iba 318, Map 87 

Ibami 318, Map 93 

Iban 103, 120, Map 41 

Ibanag 119, Map 40 

Ibani 318, Map 101 

Ibero-Romance 259, Maps 67-69 
in North America Maps 8-9 
Papiamentu [creole] 26, 27, 262, 

Map 13 

Ibeto 318, Maps 100-101 

Ibibio 318, Map 101 

Ibie 318, Map 101 

Iboko 318, Maps 82, 86 

Ibore see Arbore 

Ibuno 318, Maps 93, 101 

Ibuti 318, Map 79 

Ica see Ika 

Icelandic 259, 264, Map 64 

in North America Maps 8-9 

Icen 318, Map 95 

Iceve 318, Map 95 

Ida'an 121, Map 41 

Idaitsa 318, Map 102 

Idakho 318, Map 86 

Idate 119, Map 38 

Idi 111, Map 32 

Idne 109, Map 30 

Idomoid 305, 331, Maps 93, 95-96, 101 
Central Idoma 315, Map 101 
Idoma 312, 318, Maps 95, 101 
North Idoma 324, Maps 96, 101 
South Idoma 327, Map 101 
West Idoma 328, Map 101 

Idon(g) 318, Maps 96-97 

Idu (Chulikata) 169, 175, Maps 45, 51 

Iduh 176, 178, Map 47 

Iduna 110, Map 31 

Ifas (Bembi) 112, Map 33 

Ife 318, Map 101 

Ifira-Mele 108, Map 27 

Ifugao 119, Map 40 

Igala 312, 318, Maps 96, 101 

Igana 113, Map 33 

Igboid 331, Maps 95, 101 
Central Map 101 

Central Igbo 315, Map 101 

Plains Map 101 
Savannah Igbo 318, Maps 95, 101 
standard Igbo 303, 310, 312 

Igede 318, Map 95 

Igembe 318, Map 85 

Ignaciano 67, Map 21 

Ignaciehio Map 3 


Igoji 318, Maps 72, 85-86 
Igom 113, Map 33 
Tha 114, Map 34 
Ihabe 318, Map 93 
Thatum 318, Map 95 
Tje 318, Map 102 
Tjesha 318, Map 101 
Tjo-Congo 304 
Tjoid 304-6, 331, Maps 101-102 
Tio 312, Map 101 
Central Maps 101-102 
Central Ijo 315, Map 101 
East Map 101 
West Map 101 
Tjuma 318, Map 101 
Ik (Teuso) 318, Map 86 
Ika (Ica, Bintukwa, Arwako) 64, 
Maps 12-13 
Ika [Igboid] 318, Map 101 
Ikale 318, Maps 101-102 
Ikito-Kawarano (Iquito-Cahuarano) 69, 
Maps 14-15 
Ikizu 318, Map 86 
Ikobi (Kasere) 111, Map 32 
Ikogo 318, Maps 80-81 
Ikom [Cross subgroup] Map 93 
Ikom 318, Map 93 
Ikom-Olulumo 318, 325, Map 95 
Ikoma [Bantu] 318, Map 86 
Ikpeshi 318, Map 101 
Tku-Gore-Ankwa 318, Maps 96-97 
Ikulu 318, Maps 96-97 
Ikundun 111, Map 33 
Ikwere 318, Map 101 
Ila 318, Map 77 
Tlande 318, Map 78 
Ilanum 119, Map 40 
Ilianen Manobo 119, Map 40 
llit [Kunama dialect] 290 
Illanun Map 41 
Illinois (Miami-IIlinois) 7, 18, Maps 1-2, 
5 
Ilois see Chagos French Creole 
Tlokano (Ilocano) 27, 103, 119, Map 40 
Ilonggo see Hiligaynon 
Tlongot 119, Map 40 
Ilwana 318, Maps 72, 85 
Imbaya see Kara 
Imenti 318, Map 85 
Imoma 318, Map 81 
Imoma-Mpongo 318, Maps 80-81 
Imperial Kechua (Kichua, Quechua) 70, 
Maps 20-21, 23 
Inafosa see Tauya 
Inamari see Inapari 
Inamwanga 318, Maps 76-78 
Inanwatan 114 
Inanwatan 114, Map 34 
Inapari (Inamari) 67 
Inar Map 72 
Inari Saami see Saami 
Indagin 288, Map 72 
Indo-European 46, 231, Maps 52-53, 
60-62, 64-69 
Indo-Iranian 216, 231, 237, 
Maps 53-54, 57, 59-60 
Indo-Aryan (Indic) 213, 216-17, 
231, 237, Maps 51-54 
Indo-Hittite 231, 304 
Indo-Portuguese [creole] 214, Map 52 
Indonesian 97, 103, 114, 183, 262 
in Australia Map 36 
see also Malay 
Inesefio [Chumashan] 8, 15, Maps 3-4 
Ingalik (Deg Hit'an, Deg Xinag) 8, 15-16, 
Maps 7-9 
Ingoafi baxa see Gumuz 
Ingessana see Tabi 
Ingrian 234, 238 
Ingush 234, 239, Map 59 
Inibaloi 119, Map 40 
Inland Gulf 99, 99, 111 
Innamor (Inor) 288, Map 72 
Innaqor 288 
Innu, Innu-Aimun see Montagnais; 
Naskapi, Eastern 
Inor see Innémor 
Insular Danish Map 64 
Interandine 63 
Interior Salish 8, Maps 4-9 
Intha 171, 175, Map 49 
Inuit (Eastern Eskimo) 7,16, Maps 7-8 
Inuktitut 7, 16, 26, 263, Maps 7-8 
Eastern Canadian 16 
Western Canadian 16 
see also Greenlandic 
Inupiaq [W. Eskimo] 7, 16, Maps 7-8 
Inyeri 66 
Towa see Chiwere 
Ipai (Northern Dieguefio) 8, 14, 
Maps 3-4, 8-9 
Ipeka-Kurripako Map 13 
Ipeka-Tapuia 66 
Ipewi (Kreen-Akarore) 72, Map 19 
Ipiko 111, Map 32 
Ipili 112, Map 33 
Ipulo 318, Map 95 
Iquito-Cahuarano see Ikito-Kawarano 
Irahutu 115, Map 34 
Tranian 211, 237, Maps 53, 57, 59-60 
Iranshe (Iranxe) 74, Maps 18-19, 21 
‘Traqi Qaltu Map 70 
Traqw (Mbulu) 287, 318, Map 86 
Iraya 119, Map 40 
Iresim 115, Map 34 
Iria 114, Map 34 
Irigwe 318, Maps 96-97 
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Irish (Erse, Gaeilge, Gaelic) 259, 263, 
Map 63 
Iro Gula (Kulaal) 318, Map 89 
Irob [Saho dialect] 286 
Iroquoian 9, Maps 1-2, 7-9 
Lake 9 
Southern see Cherokee 
Irula 217, Map 52 
Trumu 110, Map 32 
Isaaq see af-Isaaq 
Isabel 
East 107 
West 107 
Isabi (Maruhia) 112, Map 33 
Isan [ Tai-Kadai] see Lao 
Isan (Yupna) [Papuan] 110, 112, 
Maps 32-33 
Isanzu 318, Map 86 
Isconahua see Iskonawa 
Isebe 112, Map 33 
Ishang see Tamang 
Ishkashimi (Ishkashmi) 211, 216, 236-7, 
Maps 53, 57 
Isinai 119, Map 40 
Isirawa 99 
Iskateko (Ixcateco) 53 
Iskonawa (Isconahua) 71, Map 15 
Island Chon 71 
Island Chumash (Islefio, Cruzefio) 9, 12, 
16, Maps 3-4 
Island Comox 13 
Isleta-Sandia [Southern Tiwa] 9, 22, 
Maps 5, 9 
Isnag 119, Map 40 
Isoko [Bantu] 318, Map 81 
Isoko [Edoid] 318, Map 101 
Israeli Hebrew Map 70 
Isthmus Sokean 51n, Map 10 
Istriot 260 
Istro-Rumanian 232, 237, 259-60 
Isu 318, Map 92 
Isukha (Kakamega) 318, Map 86 
Italic 237, 259 
Italo-Romance 259, Maps 61-62, 65, 
67, 69 
Italian 233, 259, 261-2, Maps 61-62, 
65, 67, 69 
in America 27 
in Australia Map 36 
in Yugoslavia and Russia 232-4, 
237 
see also Abruzzese; Calabrian; 
Campanian; Emilian; 
Latium; Ligurian; Lombard; 
Lucanian; Marches; 
Piedmontese; Pugliese; 
Sardinian; Sicilian; Tuscan; 
Umbrian; Venetian 
Italiot see Greek, Italiot 
Itawis 119, Map 40 
Itbayaten 119, Map 40 
Itelmen see Kamchadal 
Itene (More) 65, Map 21 
Iteri (Yinibu) 112, Map 33 
Itik 115, Map 34 
Itneg 119, Map 40 
Itogapuk 75, Maps 17, 21 
Itonama (Machoto) 64, Maps 17, 20-21 
Itsa 318, Map 102 
Itsekiri 318, Maps 101-102 
Itutang 113, Map 33 
Itza (Itzaj) 52, Maps 10-11 
Tu Mien [Yao variety] 183 
see also Yao [Miao-Yao] 
Ivatan [Batanic] 119, 184, Map 40 
Ivatan [Philippine subgroup] Map 50 
Ivori 111, Map 32 
Ivorian 305 
Iwa 318, Maps 75-77 
Iwaidjan 100, 116, Map 35 
Iwaidja (Wargib) 116, Map 35 
Iwaidja 116, Map 35 
Iwam 113 
Iwam 113, Map 33 
Iwur 112, 114, Maps 33-34 
Ixilan 52, Maps 10-11 
Ixil 52, Maps 10-11 
Ixtenco Otomi 53, Map 10 
Tyive 318, Map 95 
Iyoniyong 318, Maps 93, 101 
Iyora 102, 118, Map 35 
Izarek (Zarek) 318, Maps 96-97 
Izi 312, 318, Map 101 


Ja‘aliyyin see Ga‘aliyyin 
Jaba see Ham [Central Nigerian] 
Jabaal 284 
Jabal Naftisa Maps 71, 73 
Jabutian 73, Maps 16-17, 20-21 
Jabuti (Yabuti) 73, Maps 16-17, 20-21 
Jacalteco see Popti 
Jad (Garhwal Bhotia, Jangali) 164, Map 54 
Jagoi 120, Map 41 
Jah Heét (Jah Het) 121, 178, Map 44 
Jaintia see Pnar 
Jaipuria see Nocte 
Jakalteko 45 
Jakarta Malay 121, Map 41 
Jaku 318, 331, Maps 96-98 
Jakun see Orang Asli 
Jalaid [Jerim subdialect] 191 
Jalapa Masateko 53 
Jalisco Otomi 53, Map 10 
Jalkuna 318, Maps 104-105 
Jama Mapun 119, Map 40 
Jamaican Creole 26 
British 263 


Jamamadi (Yamamadi) 67, Maps 16-17 
Jamatia 166 
Jamsay Tegu 318, Map 104 
Jan (Janjo) 318, Map 98 
Janadla Map 70 
Janet Maps 71, 73 
Jangali see Jad 
Janjero see Yemsa 
Janjo see Jan 
Japanese 179, 184—5, 194, Map 50 
abroad 26-7, 234 
see also Ryukyuan 
Japanese Pidgin English 107, Map 25 
Japejap see Nocte 
Japreria 75, Maps 12-13 
Jaqaru see Hakaru-Kauki 
Jaqui see Haki 
Jara (Jera) 318, Map 98 
see also Jera 
Jarai 177, 182, 184, Map 48 
Jarawa [Andamanese] 217 
Jarawan [Bantoid subgroup] 306, 331, 
Maps 91, 96-98 
Jarawara (Jarauara) 67, Map 17 
Jarba (Djerba) 282, Maps 71, 73 
Jargon see Chinook Jargon 
Jarrakan 100, 116, Map 35 
Jaru 100, 116, Map 35 
Jarud [Jerim subdialect] 191 
Jaruro (Yaruro, Yuapin) 68, Maps 12-13 
Jasagtu [Jerim subdialect] 191 
Jatapu 212-13 
Jauja see Nyuga 
Jatina (Yahuna) 68, Maps 12-13, 16 
Jaunsari Map 54 
Java languages 121, Map 43 
Javaé 73, Maps 18-19 
Javanese 97, 103, 121, Maps 41-42 
creolized 27 
Jawe 108, Map 27 
Jawony 116, Map 35 
Je 72-3, Maps 18-19 
Jebel 283,284, Map 72 
Eastern 283 
Jebero see Hevero 
Jeddah Hijazi Map 70 
Jegu 318, Map 89 
Jeh 177-8, Maps 47-48 
Jehai (Jahai) 121, 178, Maps 44, 48 
Jeiké 73, Map 18 
Jelgolgi Ful 318, Map 104 
Jemez (Towa) 9, 16, Maps 3, 5, 8-9 
Jen Map 98 
Jera Maps 96-97 
Jera 318, Maps 96-97 
see also Jara 
Jere 318, Maps 96-97 
Jeral (Nhengatu) 74, Map 13 
Jerim [Inner Mongolian dialect] 191 
Jerung (Jeromala, Jero) 165,214, Map 51 
Jerusalem Map 70 
Jerwang see Zorwang 
Jeti see Manem 
Jetko 318, Map 99 
Jeto 318, Maps 82, 89 
Jhangar see Dhangar Kurux 
Jiamao [Li subvariety] 179 
Jiamu see Jin 
Jianacoto see Hianakoto 
Jiang-huai [Mandarin dialect] Map 45 
Jiaogongmin [Yao variety] 183 
Jiarong (rGyarung) 169-70, 175, Map 45 
Jibana (Dzihana) 318, Map 85 
Jibbali (Sheri, Shehri) 281, 287, Map 70 
Jibito see Hibito 
Jibu [E. Trans-Fly] see Gidra 
Jibu [Jukun] 318, Maps 91, 95 
Jicaque 52n 
Jicarilla 8, 16, Maps 3, 5, 8-9 
Jidda-Abu 318, Map 96 
Jiddah see Jeddah Hijazi 
Jiddu see af-Jiddu 
Jie 318, Map 86 
Jieng (Dinka) 284, 318, Maps 72, 79 
Jijeli [Arabic dialect] 281, Maps 71, 73 
Jiji 318, Maps 78, 80 
Jijili see Migili 
Jilim 112, Map 33 
Jimajima 110, Map 31 
Jimbin (Zumbun) 318, Maps 97-98 
Jimi [Chadic] 318, Maps 95, 97 
Jimi [Trans-New Guinea subgroup] 112, 
Map 33 
Jimjimen 318, Map 94 
Jin Jiamu, Aicham) [Kam-Sui] 160, 179, 
Map 45 
Jina Map 99 
Jina 318, Map 99 
Jing [ Vietic] Map 45 
Jinghpaw (Jingpo, Singpho, Kachin) 163, 
167, 175, Maps 46, 49, 51 
Jinghpaw (Kachin) 163, 166, 169, 180, 
Map 51 
Jingpo see Jinghpaw 
Jinja see Zinza 
Jinjero see Yemsa 
Jinmen [Yao variety] 183 
Jinuo 173,175, Map 46 
Jiongnai 183-4, Map 45 
Jipal 318, Map 96 
Jirajara see Hirahara 
Jirara see Betoi 
Jirel 164, 214, Map 51 
Jirru [E. Sudanic] see Doni 
Jiru (Kir) [Jukunoid] 318, Map 98 
Jita 318, Map 86 
Jiuzi see Nungish 


Jivaro-Cahuapana see Hivaro-Kawapanan 
Jiwarli 100, Map 35 
Jiwere see Chiwere 
Jiye 284, Map 72 
Jju see Kaje 
Jluko 318, Map 105 
Jo'é (Zo'é) 74, Map 13 
Joboka see Wancho 
Jogo-Ligbi Maps 104-105 
Jogo (Ligbi) 318, Maps 103, 105 
Jokoba see Wancho 
Jomvu 318, Map 85 
Jonam see Alur 
Jonkor 318, Map 89 
Jordanian Bedouin Map 70 
Jordanian settled dialects Map 70 
Jorto (Jortol) 318, Map 96 
Josephstaal 111, Map 33 
Josotu see Zost 
Jova Map 3 
Jru' see Loven 
Ju [Chadic] 318, Maps 97-98 
Ju (Zhu, North Khoisan) 307-8, 330, 
Maps 74, 84 
Ju Ud [Inner Mongolian dialect] 191 
Juanefio (Ajachemem) see Luisefio 
Juang [Kwa] 318, Maps 102-103 
Juang [Munda] 213,217, Map 51 
Juarez Totonako 51 
Juba Arabic [pidgin] 282 
Judeo-German see Yiddish 
Judezmo see Ladino 
Juhayna see Guhayna 
Juk 177, Maps 47-48 
Jukuna (Yukuna) 66, Maps 13, 15 
Jukunoid 305, 306, 331, Maps 91, 95-96, 
98 
Jukun Maps 91, 95-96, 98 
Juma [Cariban] 76 
Juma [Tupi] 76 
Jumana 66, Maps 16-17 
Jumano see Suma-Jumano 
Jumaytepegue 54 
Jumeli Map 51 
Jumjum see Mabaan 
Junguru 318, Maps 79, 89 
Junin-Lima Kechua (Kichua, Quechua) 
70, Map 21 
Jur Lwo see Luo 
Jurchen 187, 189 
Juri-Tikuna 68, Maps 16-17 
Juri (Yuri) 68 
Jurruru 100, 116, Map 35 
Juruna (Xipaia) 75, Maps 18-19 
Jutiapa 54 
Jutlandic (Danish) Map 64 
Juwarliny 100, 116, Map 35 


K-dialects [Yemeni] Map 70 
Ka'da Maps 91, 94 
Ka'da (Gidar) 318, Maps 91, 94 
Ka-ngan see Yinbaw 
Ka-Togo 305, 332, Maps 102-103 
Kaado 318, 332, Map 104 
Kaana Masala see Masalit 
Kaang [Gur-Adamawa: Fali] 318, 
Map 91 
Kaang [Kuki-Chin] see M'kang 
Kaantju 101, 117, Map 35 
Kaba [Central Sudanic] 284, Map 90 
Kaba 318, Maps 89-90 
Kababish 281, Map 72 
Kabalai 318, Map 90 
Kabardian (Kabardo-Cherkessian, East 
Circassian) 234, 238, Map 59 
Kabare 318, Map 80 
Kabékar-Bribri 64, Maps 12-13 
Kabékar (Cabécar) 64, Maps 12-13 
Kabende 318, Map 77 
Kabeo see Pupeo 
Kabi 102,117, Map 35 
Kabi-Kabi 102, 117, Map 35 
Kabil (Chicomucelteco) 45, 50, 52, 
Map 10 
Kabishi (Kabixi, Cabichi) 65, 74 
Kabishiana 75 
Kabixi see Kabishi 
Kabiye [Gur: Kabre] 312 
Kabola 119, Map 38 
Kabras 318, Map 86 
Kabre (Kaliye) 318, Maps 102-104 
Kabui (Nruanghmei) 167-8, 213 
Kaburi 67-8, Maps 12, 16 
Kabuverdianu (Crioulo) 259, 262 
Kabye 302 
Kabylia (Kabyle) [Berber] 262, 282, 
Map 73 
Kacchi (Kachchhi) Map 54 
Kachad 164, Map 51 
Kachama (Got'sme, Haruro) 288, 289, 
Map 72 
Kachari 166 
Hills Kachari see Dimasa 
Plains Kachari see Boro 
Kachchhi see Kacchi 
Kachha Naga 167-8 
Kachin see Jinghpaw 
Kacicere see Kuchichere 
Kacipo-Balesi see Zilmamu-Olam 
Kaco' (Lamam) 177-8, Map 48 
Kadai see Tai-Kadai, Kadai 
Kadara 318, Maps 96-97, 101 
Kadaru-Debri 284, Map 71 
Kadaupuritana 66 
Kadazan, Eastern 121, Map 41 
Kadazan-Dusun 121, Map 41 
Kadiro [Moru dialect] 284 
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Kadiwéu (Caduveo, Mbaya) 72, 
Maps 18-20 
Kadu (Asa’, Kantu) [Sal] 167, 175, 
Map 49 
Kadu (gli-Krongo) [Nilo-Saharan] 304 
Kadugli-Krongo 283, 285, Maps 71-72 
Kadugli 285, Maps 71-72 
see also Krongo 
Kadu [Kordofanian] 307 
Kaduo see Khatu 
Kaeme, Upper 114, Map 34 
Kaeti 110, 114, Maps 32, 34 
Kafa(a) see Kefa 
Kafanchan 318, Maps 96-97 
Kafima 318, Map 84 
Kafiri see Nuristani 
Kafoa 119, Map 38 
Kag (bKag) [Tibetan] 164, Map 51 
Kagaba 64 
Kagate 164, 214 
Kagayanen 119, Map 40 
Kagbo 318, Map 105 
Kagoma (Gyong) 318, Maps 96-97 
Kagoro 318, Maps 96-97 
Kaguru 318, Map 76 
Kahayan 120, Map 41 
Kahe-Okuma 318, Map 86 
Kahita (Cahita) 54 
see also Mayo; Teweko; Yaqui 
Kahua 106, 109, Map 29 
Kaian 113, Map 33 
Kaibobo 118, Map 37 
Kaidipang 120, Map 39 
Kaiep (Samap) 113, Map 33 
Kaike (Tarali Kham) 164-5, 214, Map 51 
Kaili-Pamona 120, Map 39 
Kaili 120, Map 39 
Kainaa see Blackfoot 
Kainantu-Gorokan 99, 110, 112, 
Maps 32-33 
Kainantu 99, 110, 112, Map 33 
Kaingang (Caingangue, Kanhgag) 73, 
Maps 19, 22-23 
Kaingwa 74, Maps 19, 23 
Kainji 305-6, 331, Maps 96, 100-102 
East Maps 96, 101 
North-West Map 102 
West Maps 96, 101 
Kairi see Rumu 
Kairiru 113 
Kairiru 113, Map 33 
Kairouan [Arabic dialect] Maps 71, 73 
Kairui-Midiki 119, Map 38 
Kaiwa [Austronesian: Huon Gulf] 111, 
Map 32 
Kaiwai [ Austronesian: Bomberai] 115, 
Map 34 
Kaiwishana 66, Map 16 
Kajakse 282, 318, Map 89 
Kajaman-Sekapan 120, Map 41 
Kaje-Katab Maps 96-97 
Kaje (Jju) 318, Maps 96-97 
Kakaa (Kuri) 318, Maps 90, 99 
Kakabai 110, Map 31 
Kakabai 110, Map 31 
Kakamega see Isukha 
Kakanda 318, Map 101 
Kakaopera (Cacaopera) 65, Maps 10, 12 
Kakchikel see Kaqchikel 
Kakihum 318, Maps 100-101 
Kako Maps 87, 91 
Kako 318, Maps 87, 91 
Kakolo 318, Map 107 
Kakongo 318, Map 84 
Kakua (Cacua) [Puindvean] 68, 
Maps 12-13, 17 
Kakwa [Nilotic] 318, Maps 79, 86 
Kala [Bantu] 318, Map 81 
Kala Lagaw Ya [Australian] 101, 108, 111, 
115, 117, Maps 32, 35 
Kalaaku 101, 117, Map 35 
Kalaallisut see Greenlandic; Inuit 
Kalaallit oqaasii see Greenlandic 
Kalabakan 121, Map 41 
Kalabari 318, Map 101 
Kalabra 115, Map 34 
Kalagan 119, Map 40 
Kalahari Maps 77, 84 
East Map 74 
West Maps 74, 84 
Kalam-Kobon 99, 112 
Kalam 112, Map 33 
Kalamianon 119, Map 40 
Kalanga [Lega-Kalanga: D20] Maps 78, 
80 
Kalanga [Shona: S10] 301, 311, Maps 74, 
77 
Kalaopera 61 
Kalaotoa 120, Map 39 
Kalapalo see Amonap 
Kalapuya(n) 7-8, 16, Maps 4, 6, 9 
Kalasha 211, Map 53 
Kalenjin [S. Nilotic] 307,311 
Kalhiphona 66, Map 12 
Kali 318, Map 91 
Kaliana (Kariana, Sapé) 68, Maps 12-13 
Kalindi 318, Map 85 
Kalinga 119, Map 40 
Kalispel see Spokane-Kalispel-Flathead 
Kaliye see Kabre 
Kalkatungic 101, 117, Map 35 
Kalkatungu 101, 117, Map 35 
Kallahan (Kayapa) 119, Map 40 
Kalmyk (Kalmuck, Oirat) 190, 234-5, 
238-40, Maps 58-59 
see also ‘Sart Kalmuck’ 
Kalokalo 110, Map 31 


Kalou 113, Map 33 
Kaluli 110, Map 32 
Kalyokengnyu see Khyamnyungan 
Kam [Gur-Adamawa] Map 98 
Kam 318, Map 98 
Kam Myang see Yuan 
Kam-Tai 179, 184, Maps 44-46, 48, 51 
Kam-Sui (Dong) 179, 184, Map 45 
Kam (Dong) 179, 184, Map 45 
Tai (Thai) see Tai-Kadai, Tai 
Kamakanan 73, Map 18 
Kamakan 73, Map 18 
Kaman see Mijuish Mishmi, Miju 
Kaman (Daw) 68, Map 13 
Kamang 119, Map 38 
Kamano 110, 112, Maps 32-33 
Kamarian 118, Map 37 
Kamasa 111, Map 32 
Kamasau (Sagi) 113, Map 33 
Kamayo 119, Map 40 
Kamayura 62,74, Maps 18-19 
Kamazung 318, Map 95 
Kamba [Bantu: Kongo H10] 319, 
Maps 72, 83 
Kamba [Bantu: Thagicu] 311, 318, 330, 
Maps 85-86 
Kamba [Trans-New Guinea] 112, Map 33 
Kambaata 285, 287, Map 72 
Kambari Maps 97, 100-102 
Kambari 319, Maps 96-97 
South Kambari 327, Maps 101-102 
West Kambari 328, Maps 100, 102 
Kambe 319, Map 85 
Kamberatoro see Dera 
Kambot (Keram) 113, Map 33 
Kamchadal (Itelmen) 233-4, 239, Map 55 
Kamdang (Tulishi) 285, Maps 71-72 
Kamdang dialect 285 
Kamhau Chin (Tiddim Chin) 167-8 
Kami 319, Map 76 
Kamindjo see Morehead-Upper Maro, 
Morehead, Upper 
Kamo 319, Map 98 
Kamoro 114, Map 34 
Kampa 67, Maps 15-16, 20-21 
Kampanake see Atanke 
Kampong Baru 114, Map 34 
Kamsa (Camsa, Sibundoy, Coche) 65, 
Maps 12-13 
Kamu 116, Map 35 
Kamuku 319, Map 100 
Kamula 99, 99, 110, Map 32 
Kamviri 211 
Kamwe see Higi 
Kana 319, Map 101 
Kanaka English see Queensland 
Canefields English 
Kanakanabu (Kanakanavu) 182, 184, 
Map 50 
Kanakuru see Dera 
Kanam 319, Maps 96-98 
Kanamanti 67 
Kanamaré 67, Maps 16, 20 
Kanarese see Kannada 
Kanashi 165, Map 54 
Kanasi see Sona 
Kanauri 165, Map 54 
Kinnauri (Kanauri, Thebor) 165, 213, 
Map 54 
Kandas 109, Map 30 
Kande 319, Maps 87-88 
Kandia Map 53 
Kandoshi (Candoshi) 67, Maps 14-15 
Kanela 73 
Kanembu 311,319, Map 99 
Kanga 285 
Kangite see Apurina 
Kangjia 161, 191, 193, Map 45 
Kanhgag see Kaingang 
Kanichana (Canichana) 68, Maps 20-21 
Kaniet 109, Map 30 
Kaningara 113, Map 33 
Kaningi 319, Map 83 
Kaniyang 101, 117, Map 35 
Kanjobal see Qanjobal 
Kankanaey 119, Map 40 
Kankuama see Atanke 
Kannada (Canarese) 212-13, 214, 217, 
Map 52 
Kannauji Maps 51, 54 
Kanoé see Kapishana 
Kanowit 120, Map 41 
Kansa 9, 16, Maps 2, 5 
Kanswei Nsei Map 95 
Kantu [Mon-Khmer] 177-8, Maps 47-48 
Kantu [Tibeto-Burman] see Kadu 
Kanufi 319, Maps 96-97 
Kanufi-Nindem Map 96 
Kanum 111, 115, Maps 32, 34 
Kanuri 283, 307, 331, Map 99 
Kanuri 312, Maps 71, 90, 94, 98-100 
Kanuri-Kanembu 312 
Kanyara-Mantharta [Australian] 100, 
116, Map 35 
Kanyara 100, 116, Map 35 
Kanyari [Andes] 69, Map 14 
Kanyok 319, Map 78 
Kaonde Map 77 
Kaonde 319, Map 77 
Kapajap see Nocte 
Kapampangan 103, 119, Map 40 
Kapanawa (Capanahua) 70, Maps 14-16, 
20 
Kapau see Hamtai 
Kapin 111, Map 32 
Kapingamarangi 107-8, Maps 26, 28 
Kapishana (Kapixana, Kanoé) 63, Map 17 
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Kapixana see Kapishana 
Kapong 76, Maps 12-13 
Kapore [Austronesian: Vitiaz] 109, 
Map 30 
Kapori [Papuan: Kaure] 114, Map 34 
Kapriman 113, Map 33 
Kapuas (Ngaju) 120, Map 41 
Ngaju Dayak [lingua franca] 103-4, 
Map 25 
Kapuchi see Bezhta 
Kaqchikel (Cakchiquel) 45, 50, 52, 
Map 10 
Kara [Bantu] 319, Map 86 
Kara [Central Sudanic] 284, 307 
Kara (Karani, Caranqui, Imbaya) 
[Paes-Barbakéan] 64, Map 12 
Kara [Meso-Melanesian] 109, Map 30 
Kara [Papuan] see Ngala 
Karaboro 319, Maps 104-105 
Karachay-Balkar 234-5, 238, Map 59 
Karagas see Tofa 
Karaim 232, 234, 238 
Karaja 73, Map 19 
Karaja-Shambioa (Xambioa) 73, 
Maps 18-19 
Karajarri 100, 116, Map 35 
Karakalpak 211, 216, 234, 236-8, 
Maps 53, 57 
Karamojong (Karimojong) 319, Maps 72, 
86 
Karang [Mbum subgroup] Map 91 
Karanga, Central [Bantu] 315, 
Maps 74-75 
Karanga [Shona] 301 
Karankawa 7,9, 16, Map 2 
Karanya 101,117, Map 35 
Karapana (Carapana) 68, Maps 12-13, 16 
Karas 114, Map 34 
Karata 233-4, 239, Map 59 
Karatti see Konso 
Karaw [Philippine] 119, Map 40 
Karawa 113, Map 33 
Karawari (Tabriak) 113, Map 33 
Karayu Map 72 
Karbi see Arleng 
Kardu 100, 116, Map 35 
Kare (Care, Kare Lenka) [Lenkan] 54, 
Map 10 
Kare [Gur-Adamawa] 319, Maps 89-90 
East Kare 316, Map 90 
West Kare 328, Map 90 
Kare [ Trans-New Guinea] 112, Map 33 
Karekare 319, Maps 98, 100 
Karelian 234, 238, 259, 263, Maps 58, 64 
Karen 159, 174-5, 181, Maps 47-49 
Black Karen see Rianglang 
Red Karen see Kayah 
Karendala 101,117, Map 35 
Karenni see Kayah 
Karfa 319, Map 96 
Kari 319, Map 79 
Kari'nya (Carib, Galibi) 75, Maps 12-13 
Kariai 66, Maps 12, 16 
Kariana see Kaliana 
Kariban (Cariban) 62, 75-6, Maps 16-20 
Karihona (Carijona) 75, Maps 12-13 
Karik see Karuk 
Karimojong see Karamojong 
Karingani (Taleshi) 211 
Karipuna (Caripuna) 71, Maps 16-17, 
20-21 
Kariri (Cariri) 73, Maps 18-19 
Karitiana 75, Maps 16-17, 20 
Kariya 319, Map 97 
Kariyarra 100, 116, Map 35 
Karkar [Kemant variety] 286 
Karkar [Papuan isolate] 99 
Karkin see Costanoan 
Karko 169 
Karlamay 101, 117, Map 35 
Karluk see Turkic, Eastern 
Karmarong 164, Maps 51, 54 
dKarmdzes 164 
Karnic 101, Map 35 
Central 101, Map 35 
Eastern 101, Map 35 
North-western 101, 117, Map 35 
South-western 101, Map 35 
Karo [Omotic] 288,289, Map 72 
Karo Batak 121, Map 43 
Karok see Karuk 
Karon [Atlantic] 319, Map 108 
Karon Dori 115, Map 34 
Karon Pantai 115, Map 34 
Karpong 166 
Karruwali 101, 117, Map 35 
Kartangaruru 100, 116, Map 35 
Kartutjarra 100, 116, Map 35 
Kartvelian Maps 59-60 
Karu (Karo) 66, Maps 12-13 
Karuk (Karok, Karik) 7-8, 16,51, 
Maps 4, 9 
Kartitana (Cartitana) Map 13 
see also Baniwa 
Kasa 319, Map 108 
Kasaba Map 52 
Kasele 319, Map 102 
Kasem 319, Maps 103-104 
Kaseng 177-8, Maps 47-48 
Kasenga 319, Maps 78, 80 
Kasere see Ikobi 
Kasharari (Kaxarari) 70, Maps 14, 16-17, 
20-21 
Kashaya (Southwestern Pomo) 8, 16, 
Maps 4, 8-9 
Kashibo (Cashibo-Cacataibo) 70, 
Maps 14-16, 20-21 


Kashinawa (Cashinahua, Kaxinaua) 71, 
Maps 14-17, 20-21 
Kashmiri 211-13, 259, 263, Map 54 
Kashtawari Map 54 
Kashubian 231-2, 236 
Kashuyana 75 
Kashuyana-Warikyana (Kaxuiana) 75, 
Maps 12-13, 17-19 
Kasiguranin 119, Map 40 
Kasimbar (Tajio) 120, Map 39 
Kaska 8, 16, Maps 7-9 
Kaskan (Cazcan) 53-4, Map 10 
Kaskiha Map 19 
Kasong see Song 
Kasua 110, Map 32 
Kasunga 319, Map 108 
Kasuweri 114, Map 34 
Katab [Kaje-Katab subgroup] Map 96 
Katab 319, Map 96 
Katakdoan 70, Map 14 
Katakao (Catacao) 70, Map 14 
Katang 177-8, Maps 47-48 
Katawishi (Katawixi, Hewadie) 68, 
Map 16 
Katawixi see Katawishi 
Katcha (Katla) 285 
Katchall [Nicobarese dialect] 178 
Kate 102, 105, 110, Map 32 
Katembri 73, Map 18 
Kath Bhotia see Lhomi 
Kathang 102, 118, Map 35 
Kathlamet see Cathlamet 
Kathmandu Newari 165 
Kathoriya Tharu Maps 51, 54 
Kathu 174-5, Map 45 
Kati [Indo-Iranian] 211, 216, Map 53 
Kati [ Trans-New Guinea] 
Northern Kati (Niinati) 112, 114, 
Maps 33-34 
Southern Kati (Metomka) 110, 112, 
114, Maps 32-34 
Katiati 111, Map 33 
Katingan 120, Map 41 
Katinja see Katinya 
Katinya (Katinja) 110, 112, Maps 32-33 
Katla 304 
Katla see Katcha 
Kato (Cahto) 8, 11, 16, Map 4 
Katu see Katuic, Katu 
Katua [Bahnaric] 177-8, Map 48 
Katuic 175, 177-8, Maps 47-48 
Katu 177-8, Map 48 
Katukina (Catuquina) [Puinavean] 68, 
Maps 16-17, 20 
Southern (Dyapa) 68, Maps 14-17 
Katukina (Wanninawa) [Panoan] 70, 
Maps 14-17 
Katukinarti [Tupi] Map 16 
Kaugat 114, Map 34 
Kaugian see Kogi 
Kaukombar 112, Map 33 
Kauma 319, Map 85 
Kaunga 113, Map 33 
Kaura (Kauru) [Central Nigerian 
subgroup] Maps 96-97, 101 
Kaure [Trans-New Guinea subgroup] 114 
Kaure 99, 114, Map 34 
Kauri see Gauri 
Kaurna 101, 117, Map 35 
Kauru see Kaura 
Kauwol 112, 114, Maps 33-34 
Kavalan 179, 182, 184, Map 50 
Kavango 319, Map 74 
Kavinenya (Cavinefia) 71, Maps 16-17, 
20-21 
Kaviyari (Cabiyari) 65, Map 13 
Kavu see Arapesh, Mountain 
Kaw see Kansa 
Kawacha see Kawatsa 
Kawahib 74 
Kawahla Maps 71-72 
Kawaiisu 8, 16, Maps 3-4, 8-9 
Kawapanan 69 
Kawatsa (Kawacha) 111, Map 32 
Kawe 115, 118, Maps 34, 37 
Kawéskar (Qawasqar, Kawesqar) 71, 
Map 23 
Kaxinaua see Kashinawa 
Kaxuiana see Kashuyana-Warikyana 
Kayabi (Parua) 74, Maps 18-19, 21 
Kayagar 99, 99, 114, Map 34 
Kaygir 114 
Kayah (Bwe) [Karen] 174-5, Map 49 
Kayan [Karen] 174-5, Map 49 
Kayan-Kenyah [Malayo-Polynesian] 103, 
120, Map 41 
Kayan Map 41 
Kayan Mahakam 120, Map 41 
Kayan River Kayan 120, Map 41 
Kenyah 120, Map 41 
Kayan River Kenyah 120, Map 41 
Kayapa (Cayapa, Chachi) [Barbakéan] 
64, Maps 12-13 
Kayapa [Philippine] see Kallahan; Keley-I 
Kayapo 72, Maps 18-19, 22 
Kayaw see Bre 
Kayeli 118, Map 37 
Kaygir 114, Map 34 
Kayik 113, Map 33 
Kayla see Falasha 
Kayong 177-8, Map 48 
Kaytetye 101, 117, Map 35 
Kayupulau 115, Map 34 
Kayuvava (Cayubaba) 74, Maps 16, 
20-21 
Kazakh 186-8, 211, 216, 234-8, Maps 45, 
53, 56-59 


Kazhuo 172, 175, 191-2, Map 46 
Kaziba 319, Map 80 
Kazukuru 109, Map 29 
Kcho (Chinbok, Saingbaung Chin, Miitin, 
Ng'men) 168 
Keaka 319, Map 93 
South 327, Map 93 
West 328, Map 93 
Keapara 110-11, Maps 31-32 
Kebu [Central Sudanic] see Lendu 
Kebu-Animere [Kwa: Ka-Togo] Maps 
102-103 
Kebu 319, Maps 102-103 
Kechumaran (Quechumara) 70, 
Maps 13-15, 20-21 
Kechua (Quechua) 27, 62, 70, 
Maps 20-21 
Kichua (Kechua, Quichua) 70, 
Maps 13-15, 20-21 
and Media Lengua 27 
Kedang 119, Map 38 
Kefa (Kafaa) 285, 288-9, Map 72 
Kefoid see Gonga 
Kei-Fordata 115, 119, Maps 34, 38 
Keiga (Aigang) 285 
Kein see Bemal 
Kejia see Chinese (Sinitic), Hakka (Kejia) 
Kekchi see Qeqchi 
Kel [Bantu: Bushong] 319, Map 81 
Kela [Bantu: Mongo-Nkundo] 319, 
Maps 80-81 
Kela [Huon Gulf] 111, Map 32 
Kelabit 120, Map 41 
Kele [ Bantu: Poko-Kele C60] 319, 
Maps 79-80 
Kele-Mbete [Bantu: B20] Maps 81, 83, 
87-88 
Kele 319, Maps 83, 88 
Keley-I (Kallahan) 119, Map 40 
Kelinyau Kenyah 120, Map 41 
Kelo 284, Map 72 
Kelon 119, Map 38 
Kelwer 319, Map 81 
Kemak 119, Map 38 
Keman see Mijuish Mishmi 
Kemant 286, Map 72 
Kembayan 120, Map 41 
Kemtuk 115, Map 34 
Kenaboi 121, Map 44 
Kenati (Ganati, Aziana) 110, Map 32 
Kendem 319, Maps 93, 95 
Kenga [Sara-Bagirmi] 319, Map 90 
Kenge [Bantu: Kongo] 319, Map 83 
Kengin Gyee 319, Map 81 
Keningau Murut 121, Map 41 
Kensiu 121, 178, Maps 44, 48 
Batu' 178 
Pemsed 178 
Siong 178 
Kenswei Nsei 319, Maps 93, 95 
Kenta' (Kintaq) 178 
Kenta’ Nakil [Kensiu dialect] 178 
Kenuz-Dongola 283-4, Map 71 
Dongolawi 283-4, Map 71 
Kenuzi (Kenzi) 284 
Kenyah 120, Map 41 
Main Kenyah 120 
Western Kenyah 120, Map 41 
Kenyan Boni 287, 314, Map 85 
Kenyang 319, Map 95 
Lower 319, Map 93 
Upper 319, Map 93 
Kenyi 319, Map 86 
Kenzi see Kenuz-Dongola, Kenuzi 
Kepkiriwat Maps 16, 20 
Kepo' 118, Map 38 
Kera 319, Maps 90, 94 
Keram see Kambot 
Keramin 102, 118, Map 35 
Kereho-Uheng 120, Map 41 
Kerek 233-4, 239 
Keresan 7,9, Maps 3, 5, 8-9 
Rio Grande Keresan 9, 21, Maps 3, 5, 
8-9 
Kerewe [Bantu] 319, Map 86 
Kerewo [ Trans-New Guinea] 111, 
Map 32 
Keriaka 109, Map 29 
Kerita' Bong [Kensiu dialect] 178 
Kesawai (Namuya) 112, Map 33 
Kesherd (Kesherda-Daza), East 316, 
Map 99 
Koastanipna see Soddo 
Ket (Yenisey Ostyak) 231, 234, 239, 
Maps 55-56 
Ketangalan 182 
Kete 319, Maps 78, 81 
Ketengban 114, Map 34 
Ketere 319, Map 81 
Keuru (Belepa) 111, Map 32 
Kewa 110, Map 32 
Keweevkapaya see Yuman, Upland 
Kewieng 110, 112, Maps 32-33 
Keyauwee Map 2 
Keyo 319, Maps 72, 86 
Kgalagadi Map 74 
Kgatla 319, Map 74 
Khabiir Map 70 
Khadiya [Indo-Aryan] 214 
Khakas (Abakan Tatar, Yenisey Tatar) 
234, 238, Maps 55-56 
Khakas Tuva Map 45 
Khala Wa 176 
Khalaj 211,216, Map 53 
Khaling 165-6, 214, Map 51 
Khalkh(a) [Mongol] 188, 190, Map 45 
Kham [Himalayan] 164-5, Maps 46, 51 


IKham [Khoisan] Map 74 
Khami (Khumi) 168, 175, 215, Maps 49, 
51 
Khamnigan (Ziirchid) [Mongol] 188-9, 
Map 45 
see also Evenki (Ewenke) 
Khams [E. Tibetan] Map 45 
Khamtanga (Xamtana) 286, Map 72 
Khamta [dialect] 286 
Khamti 167, 180, 184, 217, Maps 49, 51 
Khamyang 180, 184, 217, Map 51 
Khan [Tibetan] 164, Maps 51, 54 
Khandeshi 213, Maps 52, 54 
Khang 176, 178, Map 47 
Khanty (Ostyak) 231, 234, 238, 
Maps 55-56, 58 
Khao 176, 178, Map 47 
see also Pasing 
Kharachin (Hargqin, Kharchin) 189, 191, 
Map 45 
Kharga (Kharga) 281, Map 71 
Kharia 213, 217, Map 51 
Khasian 176, 178, 217, Map 51 
Khasi(a) 168, 175-6, 178, 213, 215, 
217, Map 51 
Khatu (Kaduo) 174-5, Maps 46-47 
\Kh'au-ll'e (IIX'au-ll'e) Maps 74, 77 
Khawas see Bhujel 
Khayo 319, Map 86 
\IKhegwi Map 74 
Khengkha 165, 215, Map 51 
Kherwari 217, Maps 51-52 
Khesang 165, Map 51 
Khez(h)a 167-8, 213 
Khiemnungan 213 
Khinalug(h) 239, Map 59 
Khiri 319, Map 74 
Khisa see Komono 
Khmer 175, 177-8, Map 48 
Khmer 177-8, 181, 194, Map 48 
Khmer Padong 177 
Khmuic [Mon-Khmer subgroup] 175-6, 
178, Maps 46-47 
Khmu 176, 178, 181, Maps 46-47 
Khoirao 168 
Khoisan 301, 304, 307-8, 329-30, 
Maps 74, 76-77, 84, 86 
Khoe (Central Khoisan) 307-8, 
Maps 74, 77, 84 
Khoe 319, 329-30, Map 84 
Cape Khoe Map 74 
Khoekhoe, North Maps 74, 84 
North (Ju, Zhu) 307, Maps 74, 77 
South (!wi-Taa) 307-8, Map 74 
Khékh Nuur see Kok Nur 
Khoksar 164 
EKhomani Map 74 
Khond see Kui 
Khong Kheng see Hung 
Khorasani Turkish (Quchani) 211, 
Map 53 
Khorchin (Harqin) 189, 191, Maps 45, 
50 
Khoshuud (Hoshut) 189, Map 45 
Khotan [Uighur dialect] 186 
Khotgoid (Hotogoyt) 188 
Khoton 188, Map 45 
Khowa see Bugun 
Khowar 211, Map 53 
Khozam 319, Maps 90, 99 
Khrihdu 164 
Khua 177-8, Map 47 
Khualsim 168 
Khufi [Roshani dialect] 237 
Khulung-Muthun see Wancho 
Khumi see Khami 
Khunsari see Kumzari 
Khutswe 319, Map 74 
Khizistén Map 70 
Khwarazmian 216 
Khwarshi 234, 239, Map 59 
Khyamnyungan (Kalyokengnyu, Nok-aw, 
Nauk-o, Ponyo, Para, Paya, 
Tukhemmi, Aoshedd) 167, 175, 
Maps 49, 51 
Khyang 168, 215, Map 51 
Khyn 180, 184, Maps 47, 49 
Ki see Amto 
Kia [Awyu-Dumut] 114, Map 34 
Kia [Santa Ysabel] 109, Map 29 
Kiai [N. Vanuatu] 108, Map 27 
Kiambu 319, Map 86 
Kibalo 319, Map 97 
Kibangu 319, Map 83 
Kibet 283, 319, Map 89 
Kichéan 49,52, Maps 10-11 
Kiché (Quiché) 45, 49-50, 52, 
Maps 10-11 
Kichua see Kechumaran, Kechua 
Kickapoo 7, 16,22, Maps 1, 9 
Kiga (Ciga) 319, Map 80 
Kikongo see Kongo (Kikongo) 
Kiksht 8, 16, Maps 4, 6, 8-9 
Kil [E. Mnong dialect] 177 
Kil [Koho dialect] 177 
Kilba 319, Maps 91, 94, 98 
Kilema see Ubu-Mbokomu-Moshi-Kirua- 
Kilema-Mwika 
Kilen 187, Map 45 
Kilenge 109, Map 30 
Kilenge 109, Map 30 
Kilimanjaro Maps 85-86 
Kilivila-Louisiades 110, Map 31 
Kilivila 110, Map 31 
Kilivila 110, Map 31 
Kiliwa 8, 16, Maps 3-4, 8-9 
Kilmeri 112, 114, Maps 33-34 


Kim Map 90 
Kim 319, Map 90 
Kim Mun [Yao variety] 183 
Kimaghama 114, Map 34 
Kimaragang 121, Map 41 
Kimbaya (Quimbaya) 63, Map 12 
Kimberley, Northern 116, Map 35 
Kimbu 319, Map 76 
Kimbundu [H20] 310, 313, 319, 330, 
Map 84 
Kimongo 319, Maps 83, 88 
Kimushungulu see Mushungulu 
Kinabatangan, Upper 121, Map 41 
Kinakomba 319, Map 85 
Kinalakna 110, 112, Maps 32-33 
Kinamigin 119, Map 40 
Kinaray-A 119, Map 40 
Kinda 319, Maps 87, 91 
Kinga 319, Maps 76-77 
Kingi-Ntansye 319, Map 81 
Kings River 25, Maps 4, 9 
Kingwana [Swahili variety] 303, 329 
Kinnauri see Kanauri, Kinnauri 
Kinome 114, Map 34 
Kintswen 319, Map 81 
Kinuku 319, Map 97 
Kinyarwanda 310 
Kiong-Korop Maps 93, 101 
Kiong 319, Maps 93, 101 
Kiowa (Plains) Apache 8, 16, Maps 2, 5, 
8-9 
Kiowa-Tanoan 7,9, Maps 2-3, 5, 8-9 
Kiowa 9, 16, Maps 2-3, 5, 8-9 
Kipchak see Turkic, Western 
Kipchak-Bulgar see Uralian 
Kipchak-Cuman see Ponto-Caspian 
Kipchak-Noghai see Turkic, Central 
Kipsigis 319, Map 86 
Kiput 120, Map 41 
Kir see Jiru 
Kir-Balar 319, Maps 97-98 
Kiramang 119, Map 38 
Kiranti (Kirat, Rai) [Himalayan 
subgroup] 163, 165-6, 214, Map 51 
Kiranti Limbu 166 
Kirat see Kiranti 
‘Kirdasga' 233n. 
Kire see Giri 
Kirfi 319, Map 98 
Kirghiz see Kyrgyz 
Kiria 319, Map 94 
Kirma-Tyurama Maps 104-105 
Kirma 319, Maps 104-105 
"Kirmanca'’ 233n. 
Kirmanjki 259, 263 
Kirua see Ubu-Mbokomu-Moshi-Kirua- 
Kilema-Mwika 
Kirundi see Rundi 
Kis 113, Map 33 
Kisa [Bantu] 319, Map 86 
Kisan (Kurux) [Dravidian] 212-14 
Kisan (Maithili) [Indo-Aryan] 212 
Kisar 119, Map 38 
Kisi [Bantu] 319, Map 76 
Kisi(i) [Atlantic subgroup] 332, Map 106 
Kisi 319, Map 106 
Kiswahili see Swahili 
Kitanemuk 9, 16, Maps 3-4 
Kitemo-Nape 65 
Kitimi (Tumi) 319, Maps 97, 101 
Kitjic 116 
Kitja 116, Map 35 
Kitsai 9, 16, Maps 2-3 
Kituba [pidginized lingua franca] 303, 
319, 330 
Kitui 319, Map 85 
Kiunthali (Mahasui) Map 54 
Kiwai-Porome 99, 99, 111, Map 32 
Kiwai 105, 111 
Bamu Kiwai 111, Map 32 
Daru Kiwai 105 
Island Kiwai 105 
North-Eastern Kiwai 111, Map 32 
Southern Coastal Kiwai 105, 111, 
Map 32 
Turama River Kiwai (Morigi) 111, 
Map 32 
Kiwo-Naruma-Vunjo 319, Map 86 
Kiyatjarra 100, 116, Map 35 
Klallam see Clallam 
Klamath-Modoc 8, 16-17, Maps 4, 8-9 
Klao 311, Map 106 
Klao 319, Map 106 
Kled [Che'wong dialect] 178 
Klias River Kadazan 121, Map 41 
Klor (Ngeh, Kriang) 177-8, Maps 47-48 
Ko 319, Map 104 
Koaiker (Awa-Coaiquer) 63, Maps 12-13 
Koalib Maps 71-72 
Koasati (Coushatta) 9-10, 17, 26, 
Maps 1-2, 8-9 
Koaya 74 
Kobi 319, Map 80 
Kobiana-Banyun Map 108 
Kobiana 319, Map 108 
Kobo 319, Maps 91, 98 
Kobo-Dii 319 
Kobon 112, Map 33 
Koch [W. Bodo-Garo] 166, 213, 215 
Kochila Tharu Map 51 
Koda (Kora) 213, Map 51 
Kodagu (Coorg) 213, 217, Map 52 
Kodhin see Thaminyi-Kod(h)in 
Kodi [Austronesian] 118, Map 38 
Kodin see Thaminyi-Kod(h)in 
Kodu [Koho dialect] 177 
Koenoem 319, Map 96 
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Koeruna (Coeruna) 69, Maps 12-15 
Kofa-Mogum 319, Map 89 
Kofan (Cofén) 68, Maps 12-13 
Kofyar [Angas subgroup] Map 96 
Kofyar 319, Map 96 
Kogi (Coqui, Kaugian) 64, Maps 12-13 
Koguman 112, Map 33 
Kohi (Koi) [Himalayan] 165-6, Map 51 
Kohistani 211 
Kohlreng 168 
Kohno 319, Map 106 
Koho 177-8, Map 48 
Kohumono 319, Map 101 
Koi (Koyi) [Tibeto-Burman] 214 
Koiarian 99, 110-11, Maps 31-32 
Koiaric 110-11, Map 32 
Mountain Koiari 110-11, 
Maps 31-32 
Koihoma (Orején, Koto, Coto) 68-9, 
Maps 12, 14 
Koinjmal (Gunymal) 102, 117, Map 35 
Koints see Sunwar 
Koita 111, Map 32 
Koji 319, Maps 78, 80 
Koji-Budya 319, Map 78 
K6k Nur (Kok Nur, Khokh Nuur) 
[Mongolian] 191, Map 45 
Kok-Nar [Pama-Nyungan] 101, 117, 
Map 35 
Koka (Coca) 53-4, Map 10 
Kokama 74 
Kokama-Kokamilya (Cocama- 
Cocamilla) 62,74, Map 17 
Kokborok (Tripuri, Mrung) 166, 213, 
Map 51 
Koke 319, Maps 89-90 
Koko-Bera 101, 117, Map 35 
Kokonuko (Coconuco) 63, Maps 12-13 
Kokota 109, Map 29 
Kol 99, 109, Map 30 
Kola [Austronesian: Maluku] 115, Map 34 
Kola Saami (Kola Lappish) Map 58 
Kolakngat 102, 118, Map 35 
Kolami 213, 217, Maps 52, 54 
Kolan (Colan) 70, Map 14 
Kolana 119, Map 38 
Kolchan (Upper Kuskokwim) 8, 24, 
Maps 7-9 
Kolena 319, Map 91 
Koli 211 
Kolibugan Subanon Map 40 
Kolo [Bantoid-Cross] 319, Map 101 
Kolo (Colo) [Lenkan] 54, Map 10 
Kolod 121, Map 41 
Koloi 166 
Kolokuma 319, Map 101 
Kolom 112, Map 33 
Kolopom 99, 99, 114 
Kolyma Yukaghir 239 
Kom [Grassfields] 319, Map 95 
Kom [Kuki-Chin-Naga] 168, 213, Map 51 
Koma [Komuz] see Koman, Komo 
Koma [Leko-Nimbari] 319, Maps 91, 98 
Koman [Komuz subgroup] 285, 307, 
Map 72 
Komo (Koma) 285, 290, Map 72 
Komba [Western Huon] 110, 112, 
Maps 32-33 
Kombe [Bantu] 319, Maps 88, 92 
Kombio 113, Map 33 
Kombio 113, Map 33 
Kome 319, Map 86 
Komering 121, Map 43 
Komerkrudo see Comecrudan; Yeméan 
Komi (Zyryan, Komi-Zyryan) 234, 
Maps 56, 58 
Komi-Permyak 234, Maps 56, 58 
Kominimung 113, Map 33 
Komo see Koman, Komo 
Komodo 118, Map 38 
Komono (Khisa) 319, Maps 104-105 
Komutu 110, 112, Maps 32-33 
Komuz 283, 288-9, Maps 71-72 
see also Gumuz; Koman 
Kona 319, Map 98 
Konai 110, 112, Map 32 
Konda [Bantu: Mongo-Nkundo C70] 319, 
Map 81 
Konda [Bantu: Tetela C70] 319, Maps 78, 
80 
Konda (Kubi) [Dravidian] 213, 217, 
Maps 51-52 
Konda [Trans-New Guinea] 114, Map 34 
Konda 114 
Kondekor Gadaba see Gadaba 
Kondh see Kui 
Koneraw 114, Map 34 
Kongo [Bantu: Mongo-Nkundo] 319, 
Map 81 
Kongo [Bantu subgroup] 310, 330, 
Maps 81, 83-84, 88 
Kongo 330, Maps 83-84 
Kikongo [lingua franca] 303, 310, 
311 
Kongola 319, Maps 78, 80 
Kongon see Konyak 
Koningkom 319, Maps 96-97 
Konja (Kwanja) 320, Map 95 
Konjo 319, Maps 79-80 
Konkani 212-13, 214, Map 52 
Konkomba 311, 319, Maps 102-104 
Konkow 8, 17, Maps 4, 8-9 
see also Maiduan 
Konni 319, Maps 103-104 
Kono [Central Nigerian] 319, Map 97 
Kono-Vai [Mande subgroup] Map 106 
Kono 319, Map 106 


Konomala 109, Map 30 
Konomihu [Shastan] 8, 17,51, Map 4 
Konongo 319, Map 76 
Konso-Gidole (Konsoid) 286 
Konso (afa Karatti) 286, 288, Map 72 
Konta 288, 289, Map 72 
Konua 99, 109, Map 29 
Kony 319, Map 86 
Konyagi 319, Maps 107-108 
Konyak (Tableng, Mulung, Kongon, 
Angwangku) 167, 213 
see also Naga, Northern 
Konyanka 319, Map 106 
Konzo Maps 79-80 
Koore nuuna, Koorete see Koyra 
Kootenai 7 
Kutenai (Ktunaxa) 9, 16, Maps 5-9 
Koozime 319, Map 87 
Kopar Map 33 
Kora [Austronesian] see Kodi 
Kora [Munda] 217 
Korafe 110, Map 31 
Koraga 217, Map 52 
Korak-Waskia 112 
Korak 112, Map 33 
Koran 50, 53 
Kora (Cora) 53-4, Maps 10-11 
Korana see !ora 
Korandebo 319, Map 104 
Korap 113, Maps 30, 33 
Korapun 114, Map 34 
Koraput 217, Maps 51-52 
Korat 180, 184, Map 48 
Kordofanian 304 
Kore see Koyra 
Korean 184—5, 194, 237, Maps 45, 50, 55 
abroad 27, 234-6 
in Australia Map 36 
Koreguaje see Korewahe 
Korekore 319, Maps 74-75 
Mozambian see Goba 
Korewahe (Koreguaje, Caquetd) 68, 
Maps 12-13 
Kori see Poka-Kori-Wipsi-Geeri 
Koriki see Purari 
Koriki Hiri [trading language] Map 25 
Koring-Kulele Map 101 
Koring 319, Map 101 
Koriok see Oxoryok 
Korku 213, 217, Maps 52, 54 
Kormakiti see Cypriot Arabic 
Koro [Central Nigerian] 319, 
Maps 96-97, 101 
Koro (Okro) [E. Admiralty] 109, Map 30 
Koro [N. Vanuatu] 108, Map 27 
Koroa Map 2 
Koroa see Akroa 
Korofa Maps 95-96 
Koroko see Valman 
Koronadal Blaan 119, Map 40 
Korop 319, Map 93 
Korop6 73, Maps 18, 22 
Korwa 213, 217, Map 51 
Koryak 234, 239, Map 55 
Kosa 319, Map 77 
Kosare [Kaure] 114, Map 34 
Kosarek [Mek] 114, Map 34 
Koshin (Koskin) 319, Map 95 
Koskin see Koshin 
Kosorong 110, Map 32 
Kosraean 108, Map 28 
Kota [ Bantu: Kele-Mbete B20] 319, 
Maps 81-83, 87-88 
Kota [Bantu: Ngando C10] 319, Map 87 
Kota [Dravidian] 212,217, Map 52 
Koti 319, Map 75 
Koto (Coto, Koihoma, Orején) 68-9, 
Maps 12, 14 
Kotogiit 114, Map 34 
Kotoke (Cotoque) 52, Maps 10-11 
Kotoko [Chadic subgroup] Map 94 
Kotoko 319, Maps 90, 94, 99 
Kotokoli (Tem) 319, Maps 102-104 
Kotoname see Yué 
Kotopo 319, Map 95 
Kovai 112, Maps 30, 33 
Kove-Kaliai 109, Map 30 
Kow see Asas; Sinsauru 
Kowaki 112, Map 33 
Koya 213, Map 52 
Koyaima (Coyaima) 75 
Koyi [ Tibeto-Burman] see Koi 
Koyo [Bantu] 320, Maps 81, 83 
Koyo [Kru] 320, Map 105 
Koyra (Koore nuuna, Kore, Kwera, 
Amarro) 288, 289, Map 72 
Koyra Boro 320, Map 104 
Koyra Chiini 320, Map 104 
Koyta [Nara dialect] 290 
Koyukon-Ingalik 8, Maps 7-9 
Koyukon 8, 17, Maps 7-9 
Kpala see Kresh 
Kpalaga Maps 104-105 
Kpalaga (Palaka) 301, 320, 
Maps 104-105 
Kpan Map 95 
Kpan 320, Map 95 
Kpara see Kresh 
Kpasam 320, Map 98 
Kpatiri 320, Maps 82, 89 
Kpelle Map 106 
Kpelle 311, 320, Map 106 
Kposo-Ahlo-Bowiri Map 102 
Ahlo-Bowiri Map 103 
Krachi 320, Maps 102-103 
Kranh 320, Maps 105-106 
Kravet [Brao dialect] 177 


Kreda Aza 320, Map 99 
Kreen-Akarore see Ipewi 
Krenak 72, Maps 18-19 
Kréol 302 
Kresh-Aja 283-4, Maps 72, 79, 89 
Kresh (Kpara, Kpala) 284-5, Map 72 
Kpala 320, Map 82 
Kri 177-8 
Kriang see Klor 
Krikati-Timbira see Timbira 
Krim 310, 320, Map 106 
Krio [English creole] 308-9, 312, 320, 
332, Map 106 
Kriol [Portuguese creole] 103, Map 36 
Krisa 112, Map 33 
Krisa 112, Map 33 
Kristang [Portuguese creole] 103, 193 
Krobu 305, Map 105 
Krobu 320, Map 105 
Krongo 285, Maps 71-72 
gli-Krongo 304 
see also Kadu (gli-Krongo) 
Kru 305-6, 332, Maps 104-106 
Kru 320, Map 106 
Krung [Brao dialect] 177 
Krymchak 234, 238 
Kryz (Qriz, Dzhek) 235, 239, Map 59 
Ktum see Tum 
Ktunaxa (Kootenai, Kutenai) 9 
Kuang 320, Map 90 
Kuangsi 173, 175, Map 46 
Kuanua see Tolai 
Kuba [Bantu: Bushong C90] see Bushong 
(Kuba) 
Kuba [Bantu: Mbosi C30] 320, Maps 81, 
83 
Kube [Huon] 110, Map 32 
Kubewa (Cuveo, Pamiwa, Hehenawa) 68, 
Maps 12-13, 16 
Kubi [Chadic] 320, Maps 97-98 
Kubi [Dravidian] see Konda 
Kuchichere (Kacicere) 318, Maps 96-97 
Kucong see Lahlu 
Kuda-Zamani Map 97 
Kuda-Chamo 320, Map 97 
Kueretu (Cueret) 68, Maps 12-13, 16 
Kugama 320, Map 98 
Kugbo Map 101 
Kugbo 320, Map 101 
Kugu Nganhcara 101, 117, Map 35 
Kugu Mangk 101, 117, Map 35 
Kugu Mu'inh 101, 117, Map 35 
Kugu Muminh 101, 117, Map 35 
Kugu Ugbanh 101, 117, Map 35 
Kugu Uwanh 101, 117, Map 35 
Kugu Yi'anh see Kugu Nganhcara, Kugu 
Mangk 
Kui (Kondh, Khond) [Dravidian] 213, 
217, Maps 51-52 
Kui [Papuan] 119, Map 38 
Kuijau 121, Map 41 
Kuiku (Kuika) 63 
Kuikuro see Amonap 
Kuit Ambo Map 81 
Kuiva (Cuiba) 65, Maps 12-13 
Kujargé 282 
Kuka 320, Maps 90, 99 
Kukata 101, 117, Map 35 
Kukatj [S.W. Paman] 101, 117, Map 35 
Kukatja [W. Desert] 100, 116, Map 35 
Kukele (Ukele) 320, Map 95 
Kuki-Chin 163, 167-8, 175, Maps 49, 51 
Kuki 175, 213-14, Map 51 
Kuku [Bari dialect] 284 
Kuku [East Nilotic] 320, Maps 79, 86 
Kuku Jangkun [Kuku Yalanji dialect] 101, 
117, Map 35 
Kuku Yalanji (Kuku Yalandji, Gugu 
Yalandyi) 101, 103, 107, 117, Map 35 
Kukur [Tabi dialect] 284 
Kukura 73 
Kukuya [Austronesian] 110, Map 31 
Kukwa [Bantu: Teke] 320, Map 83 
Kukwaya [Gumuz dialect] 285 
Kulaal see Iro Gula 
Kulango Maps 103, 105 
Kulango 320, Maps 103, 105 
Kulango-Nkuareng 320, Maps 103, 
105 
Kulere 320, Map 96 
Kulesa 320, Map 85 
Kuliak 283, 306-7, 330 
Kuliak Map 86 
Kulin 102, 118, Map 35 
Kulina (Culina, Madiha) [Arawdn] 67, 
Maps 15-17 
Kulino [Panoan] 70, Maps 14-17, 20-21 
Kullo 288, 289, Map 72 
Kulon 182, 184 
Kului Map 54 
Kulumali 101, 117, Map 35 
Kulung [Himalayan] 165-6, 214, 
Map 51 
Kulung (Wurkum) [Bantoid] 320, 
Maps 96—98 
Kulyi 69, Map 14 
Kumak 108, Map 27 
Kumal [Indo-Aryan] 214 
Kumam 320, Maps 72, 86 
Kuman 102 
Kumana (Chayma) 76, Map 12 
Kumandi 234, 238 
Kumarchen [Orogen dialect] 187 
Kumauni 213, Maps 51, 54 
Kumba 320, Map 98 
Kumeyaay see Dieguefio 
Kumil 112, Map 33 


Kumokio 110, 112, Maps 32-33 
Kumu 320, Maps 79-81 
Kumyk 234, 238, Map 59 
Kumzaari (Khunsari) 211,216, Maps 53, 
70 
Kuna (Cuna) 64, Maps 12-13 
Kunaama see Kunama 
Kunabeeb 320, Map 87 
Kunama (Kunama-llit) 283, 285, 289, 
307, Map 72 
Kunama (Kunaama, Baza, Barya) 283, 
290, Map 72 
Kunbarlang 116, Map 35 
Kunda [Bisa-Lamba M50] 320, Map 77 
see also Chikunda 
Kunda [Luba L30] 320, Maps 77-78 
Kunfal (Kunfal, AgiwP) 286, Map 72 
Kungarakany 100 
Kungatutji 101, 117, Map 35 
Kungkari 101 
Kuni 111, Map 32 
Kunimaipa 111, Map 32 
Kunini see Bide 
Kunkalanya 101, 117, Map 35 
Kunsa-Kapishandé 63, Maps 17, 20-22 
Kapishana Map 21 
Kunsa (Cunza) 63, Maps 20, 22 
Kunyi 320, Maps 83, 88 
Kuo (Ebkuo) [Grassfields] 316, Map 95 
Kuo [Mbum-Day] 320, Maps 90-91 
Kuome see Kwoma 
Kuot 99, 107, 109, Map 30 
Kupto 320, Maps 98, 100 
Kurada 109, Map 31 
Kurama 320, Maps 96-97 
Kuranko 320, Maps 106-107 
Kurdish 211, 216, 233-7, 277, 280, 
Maps 53, 59-60, 70 
Kuri [Chadic] see Kakaa 
Kuri [Pama-Nyungan subgroup] 102, 118, 
Map 35 
Kuri-Dou [Puindvean] 67, Maps 12, 16 
Kuria 320, Map 86 
Kuria Muria 281 
Kurike 320, Map 106 
Kurmali [Indo-Aryan] 214 
Kurmanji 211, 216, 259, 261-3, Maps 53, 
60 
Kurrama 100, 116, Map 35 
Kurripako see Ipeka-Kurripako 
Kurrtjar 101, 117, Map 35 
Kurtépkha 165, 215, Map 51 
Kuruaya (Curuaia) 75, Maps 18-19 
Kurudu 115, Map 34 
Kurukh see Kurux 
Kurumba [Dravidian] 217, Map 52 
Kurumba [Gur] 320, Map 104 
Kuruti-Kele-Ere 109, Map 30 
Kurux (Kurukh, Oraon) 213, 217, 
Maps 51-52 
Kusa (Kusu) 320, Map 80 
Kusaal 311, 320, Maps 102-104 
Kusaghe 109, Map 29 
Kushi 320, Map 98 
Kuskokwim see Kolchan; see also Tanana 
Kustenau 66 
Kusu see Kusa 
Kusunda 214 
Kutang Ghale 164, Map 51 
Kutash6é (Kutaxo) 73 
Kutaxo see Kutashé 
Kutchin see Gwich'in 
Kuteb see Kutep 
Kutenai (Kootenai, Ktunaxa) 9, 17, 
Maps 5-9 
Kutep (Kuteb) 320, Map 95 
Kuthant 101, 117, Map 35 
Kuti Ambo 320, Map 80 
Kutin Map 91 
Kutin 320, Maps 91, 98 
Kutu 320, Map 76 
Kutubu 
East 99, 99, 110, Map 32 
West 99, 99,111, Map 32 
Kuturmi 320, Maps 96-97 
Kuuk Thaayorre 101, 117, Map 35 
Kuuk Yak 101, 117, Map 35 
Kuuku Ya'u 101, 117, Map 35 
Kuumbere 320, Map 91 
Kuurn Kopan Noot (Warrnambool) 102, 
118, Map 35 
Kuvale 320, Map 84 
Kuvi 212, 217, Maps 51-52 
Kuwaa 320, Map 106 
Kuwait see Kuwayt 
Kuwani 115, Map 34 
Kuwarra 100, 116, Map 35 
Kuwayt (Kuwait) Map 70 
Kuy (Suay) 177-8, Maps 47-48 
Kuya 320, Map 105 
Kuyani 101, 117, Map 35 
Kuyonon 119, Map 40 
Kuzamani 320, Map 97 
Kwa' [Bantoid] 320, Map 93 
Kwa [Gur-Adamawa subgroup] 305-6, 
331-2, Maps 98, 102-105 
Kwa (Ba) 320, Map 98 
Kwaatay 320, Map 108 
Kwadi 307-8, 320, 330, Map 84 
Kwadia Map 105 
Kwadia 320, Map 105 
Kw'adza 287 
Kwaio 109, Map 29 
Kwak'wala see Kwakiutl 
Kwakiutl (Kwak'wala) 8, 17, Maps 6, 
8-9 
Kwakum 320, Map 87 


Kwala 320, Maps 81, 83 
Kwalean 99, 110-11, Map 32 
Kwale 110-11, Maps 31-32 
Kwalhioqua-Clatskanie 8, 17, Maps 4, 6 
Kwaluudhu 320, Map 84 
Kwama [Nilo-Saharan] 285, Map 72 
Kwambi 320, Map 84 
Kwame 320, Map 80 
Kwamera 108, Map 27 
Kwami 320, Maps 98, 100 
Kwandi-Kwangwa 320, Map 77 
Kwandu 320, Map 77 
Kwang-Kera Maps 90, 94 
Kwanga (Gawanga, Womsak) 113, 
Map 33 
Kwangalak 320, Map 96 
Kwangali 320, Maps 74, 84 
Kwange 320, Map 80 
Kwangwa Maps 74, 77, 84 
Kwanha [Mnong dialect] 177 
Kwanja see Konja 
Kwanka 320, Maps 96-97 
Kwanthar 101, 117, Map 35 
Kwanyama 320, Map 84 
Kwanyoka 320, Map 84 
Kwara'ae 102, 106, 109, Map 29 
Kware 110, Map 32 
Kwasengen 113, Map 33 
Kwasio 320, Maps 87-88, 92 
Kwato (Waupe) 112, Map 33 
Kwaya 320, Map 86 
Kwegu 284 
Kwena 320, Map 74 
Kwengo see Hukwe 
Kwera see Koyra 
Kwerba 115, Map 34 
Kwerba 115, Map 34 
Kweso 320, Map 87 
Kwesten 115, Map 34 
Kwetjman 101, 117, Map 35 
Kwezo 320, Map 81 
Kwikateko (Cuicateco) 53, Maps 10-11 
Kwinti Creole English Map 13 
Kwitatleko 54 
Kwitlateko (Cuitlateco) 50, Map 10 
Kwo (Meru) [Kilimanjaro] 320, Map 86 
see also Meru [Thagicu] 
Kwoireng see Liangmei 
Kwoma (Kuome, Washkuk) 113, Map 33 
Kwomtari-Leftmay 99, 112, Map 33 
Kwomtari 112, 115 
Kwomtari 99, 108, 112, Map 33 
Kwongai see Sawos 
Kxoe 320, Map 74 
Kyaw see Chawte 
sKyergumdo 164 
Kyibaku (Cibak) 320, Map 94 
sKyidgrong 164 
Kyrgyz (Kirghiz) 186-7, 211, 216, 234-6, 
238, Maps 45, 53, 57 


La'u [Ngwi] see Lang'e 
La Wa 176 
Laadi see Ladi 
Labi 320, Map 90 
Labu (Laua) [S.E. Papuan] 110-11, 
Maps 31-32 
Lac [Koho dialect] 177 
Lacandon see Lakantin 
Lachi 181, 184, Map 47 
see also Laji 
Lachik see Lashi 
Ladakhi 164, 213, Map 54 
Ladi (Laadi) 320, Map 83 
Ladin [Rhaeto-Romance] 259, 261, 
Maps 65, 67 
Ladino (Judezmo, Judeo-Spanish) 233, 
237, 260 
Lae 111, Map 32 
Laeko-Libuat 113, Map 33 
Laemae (Lama) 170, 175, Map 46 
Laganyan 115, 118, Maps 34, 37 
Laghu [Santa Ysabel] 109, Map 29 
Laghuu (Xapho) [Ngwi] 174-5, Map 47 
Laguna (Acoma-Laguna) 9-10, Maps 3, 5 
Lagwan (Logone) Maps 90, 94, 99 
Laha 181-2, 184, Map 47 
Lahanan 120, Map 41 
Lahauli 213 
Lahlu (Kucong, Cosung) 173, 175, 
Maps 46-47 
Lahnda 213, Maps 53-54 
Lahoid 173,175, Map 46 
Lahu 173, 175-6, 180-1, Maps 46-47, 
49 
Lahuli 164-5, Map 54 
Lai see Paliu 
Lailen 168 
Laimon 13, Map 3 
Laitong 166 
Laizo (Zahao) 168, 175, Map 51 
Laji 182, 184, Map 45 
see also Lachi 
Lajia (Lakkia) 179, 184, Map 45 
Lak-Dargi 239, Map 59 
Lak 234-5, 239, Map 59 
Laka [Mbum: Karang] 320, Maps 91, 98 
Laka [Negwi] see Aluo 
Laka [Sara-Bagirmi] 320, Map 90 
Laka-Kaba 320, Map 90 
Laka-Mbum 320, Map 90 
Lakalei 119, Map 38 
Lakamad'i [Moru dialect] 284 
Lakanttn (Lacand6én) 52, Map 10 
Lake Iroquoian 9 
Lake Miwok see Miwok, Western 
Lake, Upper [Kainji subgroup] Map 102 
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Lakes District see Babine 
Lakes Plain 99, 108, 114-15, Map 34 
Central 115 
East 115 
Lakha see Mara (Lakher) 
Lakkia see Lajia 
Lakona 108, Map 27 
Lakota [Kru] 320, Map 105 
Lakota [Siouan-Catawban] see Sioux 
Lala [Austronesian] 111, Map 32 
Lala [Bantu] 320, Map 77 
Lalaki 120, Map 39 
Lalana Chinanteko 53 
Lali 320, Map 83 
Laloid 175, Map 46 
Lalo 172-3, 175, Map 46 
Lalok see Bom 
Lalung (Tiwa) 166, 213, Map 51 
Laluo see Naluo 
Lama see Laemae 
Lamaholot 119, Map 38 
Lamalamic 101, 117, Map 35 
Lama-Lama 101, 117, Map 35 
Lamang 308 
North 320, Map 94 
South 320, Map 94 
Lamba [Bantu: M50] 312,320, Map 77 
Lamba [Gur] 312, 320, Maps 102-104 
Lambaya 118, Map 38 
Lambayeque Kechua (Kichua, Quechua) 
70 
Lambia 320, Maps 75-77 
Lambichong (Mugali) 166 
Lame [Bantoid] 320, Map 97 
Lame [Chadic] 320, Map 90 
Lamen 108, Map 27 
Lamet 176, 178, Map 47 
Lamgang 168 
Lamista Kechua (Kichua, Quechua) 70, 
Map 15 
Lamja 320, Maps 91, 98 
Lamma 119, Map 38 
Lamnso' 320, Maps 93, 95 
Lamogai 109, Map 30 
Lampung 121 
Lampung 121, Map 43 
Lamu 172-3, 175, Map 46 
Lamut see Even 
Landuma 320, Map 107 
Lang’ 320, Map 93 
Lang'e (La'u) 173, 175, Map 46 
Langa 320, Maps 80-81 
Langalanga 109, Map 29 
Langam 113, Map 33 
Langbasi 320, Maps 79, 82, 89 
Lango [Central Nilotic: Lotuko dialect] see 
Logiri 
Lango [West Nilotic] 312, 320, Maps 72, 
86 
Langrong (Rangkhol, Hallam) 168, 
Map 51 
Langsu see Maru 
Langtang 164, Map 51 
Langue d’Oil see French, Langue d’Oil 
Languedocien see French, Langue d’Oc 
Lanoh 121, 178, Map 44 
Lao (Isan) [ Tai-Kadai] 179-81, 184, 194, 
Maps 47-48 
Laomian 174-5, 180, Maps 46, 49 
Laopan 174 
Lapachu 67, Map 20 
Lappic 259 
Lappish see Saami 
Lapuyan Subanun Map 40 
Laqua see Pupeo 
Lara’ 120, Map 41 
Laragiyan 116, Map 35 
Laragiya 116, Map 35 
Larantuka 119, Map 38 
Lardil 116, Map 35 
Larevat 108, Map 27 
Larike-Wakasihu 118, Map 37 
Laro Map 72 
Larrakiyan 100 
Larteh 320, Maps 102-103 
Laru 320, Maps 101-102 
Lashi (Achik Garo, Lachik, Leqi) 
[Burmish] 166-7, 171, 175, Maps 46, 
49 
Lassik see Eel River Athabaskan 
Lasta-Agaw [Khamtanga dialect] 286 
Latgalian [Latvian variety] 232 
Latha see Zayein 
Lati see Lachi; Laji 
Latin 261 
Latium (Italian) 261, Map 67 
Latvian 232, 234, 236-7, Maps 58, 62 
Lau [Jieng dialect] 284 
Lau [Solomonic Chain] 109, Map 29 
Laua see Labu 
Laurentian 9,17, Maps 1, 7 
Lautu see Lawtu 
Lavatbura-Lamusong 109, Map 30 
Lave see Brao 
Lavongai-Nalik 109, Map 30 
Lavongai 109, Map 30 
Lavrung (S. Jiarong) 170 
Lavua 176, 178, Map 47 
Lavukaleve 99, 109, Map 29 
Lawa [Mon-Khmer] 181 
Lawngwaw see Maru 
Lawseng 174 
Lawtu (Lautu) 168 
Layolo 120, Map 39 
Laz (Chan) 233-5, 238, Maps 59-60 
Lazujap see Nocte 
Lebanese see Arabic 


Lebanon dialects 
North Map 70 
North-Central Map 70 
South Map 70 
South-Central Map 70 
Leco see Leko 
Leemo [Hadiyya dialect] 287 
Leeward Creole (English) 26, 27 
Lefa' 320, Map 93 
Leftmay 
Left May 99 
see also Arai; Kwomtari-Leftmay 
Lega-Kalanga Maps 78-80 
Lega 320, Map 80 
Legbo Map 101 
Legbo 320, Maps 95, 101 
Lehali 108, Map 27 
Lehalurup 108, Map 27 
Lehar 320, Map 108 
Leipon 109, Map 30 
Leisan 320, Map 85 
Leke 320, Map 87 
Lekeh [Karen] 174 
Leko (Leco) [Katakdoan] 70, Maps 20-21 
Leko (Samba Leko) [Dakoid] Map 98 
Leko-Nimbari |Gur-Adamawa] 305, 
Maps 90, 93, 95, 98 
Leko Map 98 
Leko (Samba) 326, Maps 91, 95, 98 
see also Daga-Nyonga 
Lekongo 320, Map 93 
Leku 320, Maps 82, 86 
Lela 320, Map 100 
Lelak 120, Map 41 
Lelau 320, Map 98 
Lele [Bantu: Bushong C90] 320, Map 81 
Lele [Bantu: Kuba] 302 
Lele [Chadic] 302, 320, Map 90 
Lele [Gur] 302 
Lele [Mande] 302, 320, Map 106 
Lelemi-Akpafu 305, Maps 102-103 
Lelemi (Bwem) 320, Maps 102-103 
Lembena 112, Map 33 
Lemérig 108, Map 27 
Lemio 112, Map 33 
Lemoro 320, Map 97 
Lempu 320, Map 84 
Lenakel 108, Map 27 
Lenape see Delaware; Unami 
Lenca see Lenka 
Lendamboi 108, Map 27 
Lendo 320, Map 81 
Lendu [Central Sudanic] 283, 307, Map 79 
Lendu (Kebu) 320, Map 79 
see also Baledha; Kebu [Kwa: 
Ka-Togo] 
Lenge [ Bantu: Bobangi-Ntomba C40] 
320, Map 82 
Lenge [Bantu: Maka-Njem A80] 320, 
Map 88 
Lenge [Bantu: Umbundu R10} 320, 
Map 84 
Lengilu 120, Map 41 
Lengo 109, Map 29 
Lengola 320, Maps 79-81 
Lengua (Enxet) 72, Map 23 
see also Media Lengua 
Lenje-Tonga Maps 74, 77 
Lenje 320, Map 77 
Lenkan 50, 54, 61 
Lenka (Lenca) 54, Maps 10-11 
Lenkau 109, Map 30 
Lenyima 321, Maps 93, 95 
Leonese 262, Map 68 
Leonhard Schultze 113, Map 33 
Lepcha 163, Map 51 
Lepcha 167-9, 213-15, Map 51 
Leqi see Lashi 
Lese 321, Map 79 
Lesser Antillean Creole French 26, 27 
Leti 321, Maps 92-93 
Letri Lgona 119, Map 38 
Lettish see Latvian 
Letzeburgesch 259, 262, Maps 65, 69 
Leun [Bru dialect] 177 
Levei-Tulu 109, Map 30 
Lewada-Dewara 111, Map 32 
Lewo 108, Map 27 
Leya 321, Maps 74, 77 
Leyigha 321, Maps 93, 95 
Lezgic 239, Map 59 
Lezgian 234—6, 239, Map 59 
Lhanima 101, 117, Map 35 
Lhasa (hLasa, U, Central dBus) 164 
Lhokpu 165, 215, Map 51 
Lhomi (hLomi, Kath Bhotia) 164, 214, 
Map 51 
Lhota 167, Map 51 
Li (Hlai) 179, 184, Map 45 
Li-Tamare-Niende 321, Maps 103-104 
Li-Tamare 321, Map 102 
Liangmei (Liang, Kwoireng) 167-8, 213 
Liberian Pidgin English 309, 321, 332 
Liberian Settler English 308-9, 321, 332, 
Map 106 
Libido 287, Map 72 
Libo Map 98 
Libo 321, Map 98 
Libobi-Likata-Lifonga 321, Map 82 
Libya [Arabic subgroup] Map 71 
Libyan Bedouin dialects Map 71 
Lifonga 321, Maps 82, 87 
see also Libobi-Likata-Lifonga 
Ligbi 321, Map 104 
see also Jogo 
Ligri 321, Map 98 
Liguri [Daju dialect] 284, Maps 71-72 


Ligurian 261-2, Map 67 
Lihir 109, Map 30 
Lijili see Migili 
Likata see Libobi-Likata-Lifonga 
Likile (Likila) 321, Maps 79, 82 
Liko 321, Map 79 
Likoka 321, Map 82 
Likola Map 87 
Likpa (Dompago) 321, Maps 102, 104 
Likpe see Sekpela 
Likum 109, Map 30 
Likungu 321, Map 82 
Lila [Ngwi] see Aluo 
Lilau (Ngaimbom) 113, Map 33 
Lillooet (St'at'imcets) 8, 17, Maps 6-9 
Lima 321, Map 77 
Limba 332, Maps 106-107 
Limba 321, Maps 106-107 
Limbu [Tibetan] 164-5, 213-14, Map 51 
Limbum 321, Map 95 
Limi 321, Map 80 
Limilngan 100, 116, Map 35 
Limousin (French) Map 69 
Linda 314, 321, Map 89 
Lindrou 109, Map 30 
Lindu 120, Map 39 
Lingala 302, 310, 311, 321, 330, Maps 82, 
87 
Lingao see Be 
Linghkim 214 
Lingithigh (Linngithigh) 101, 117, 
Map 35 
Lingkabau 121, Map 41 
Linngithigh see Lingithigh 
Lino 321, Map 87 
Liombo 321, Map 79 
Lionje 321, Maps 79-82 
Lipan 8, 17, Maps 2-3 
Lipo 172-3, 175, Map 46 
Liptako Ful 321, Maps 102, 104 
Lipulio see Lachi; Laji 
Lisabata-Noniali 118, Map 37 
Lisoid 175, Maps 45—46 
Lisu (Yobin) 167, 169-70, 172-3, 175, 
181, Maps 45-49, 51 
Lithuanian 232, 234-7, Maps 58, 62 
Little Nicobar [Nicobarese dialect] 178 
Liude 173, 175, Map 46 
Livonian 238 
Livvi (Olonets, Livvikovskij) 238, 259, 
263 
Livvikovskij see Livvi 
Lo (gLo) [Tibetan] 164, Map 51 
Lo (Loo) [Gur-Adamawa] 321, Map 98 
Lobir(i) 321, Maps 103-105 
Lobo 321, Maps 82, 87 
Lobola 321, Maps 82, 87 
Lobr see Birifor 
Lobu 121, Map 41 
Lodhanti Map 54 
Lodiya 321, Map 80 
Logba see Ekpanu 
Logbara 283 
Logiri (Lango) [Lotuko dialect] 284 
Logo 321, Map 79 
Logon (Loghon) 321, Maps 103-105 
Logone see Lagwan 
Loh 108, Map 27 
Lohiki 111, Map 32 
Lohorong (Lorung) 165-6, 214, Map 51 
Loi [Mon-Khmer] 176, 178, Maps 47, 49 
Loi (Ngiri) [Bantu] 321, Maps 82, 87 
Loka (Lokaa, Yakur) 321, Maps 93, 95 
Lokai [Madi dialect] 284 
Loko [Mande] 321, Maps 106-107 
Loko(-Ikom) [Cross subgroup] Maps 93, 
95 
Lokoli 321, Maps 93, 95 
Lokono see Arawakan, Arawak 
Lokoutka 71 
Lokoya [Lotuko dialect] 284 
Lokuto-Maa Map 85 
Lolo [Vietnamese Nisu] see Nisu 
Loloan see Malayo-Polynesian, Malayan, 
Malay, South-Western Bali 
Loloda 118, Map 37 
Loloid Map 46 
Lolo Maps 46-47 
Loloish see Ngwi 
Lolopo (Xiangtang) 172-3, 175 
Lolue 321, Map 93 
Lom 121, Map 43 
Loma [Mande] 311, 321, Map 106 
Lombard (Italian) 261, Map 67 
Lombe [Bantu] 321, 330, Maps 92-93 
Lombi [Nilo-Saharan] 321, Maps 79-80 
Lombo [Bantu] 321, Map 79 
Lomoto 321, Map 79 
Lomotwa 321, Maps 77-78 
Lomwe 312, 321, Map 75 
Lomya [Lotuko dialect] 284 
Londo (Borondo) 321, Map 92 
Longa 109, Map 30 
Longgu 109, Map 29 
Longmi see Lungmi 
Longo 321, Map 81 
Longto 321, Map 91 
Longuda 308, Maps 91, 98 
Longuda 321, Maps 91, 98 
Loniu 109, Map 30 
Lonwolwol 108, Map 27 
Lonzo 321, Maps 81, 83 
Looliya (Dadiya) 321, Map 98 
Lopa 321, Maps 100-102 
Lopit [Lotuko dialect] 284 
Lopnur [Uighur dialect] 186 
Lorediakarkar 108, Map 27 


Lorrain (French) Map 69 
Lorung see Lohorong 
Losakanyi 321, Map 81 
Lotapa 321, Map 80 
Lotha 213 
Lotsu-Piri 321, Map 98 
Lotud 121, Map 41 
Lotuko-Maa (Lotuko-Mafro-Asiatic) 284, 
307, Map 86 
Lotuko-Maa 307, Maps 76, 86 
Lotuko 284, 321, Maps 72, 86 
Lou (Torricelli) 113, Map 33 
Loucheux see Gwich'in 
Louisiades 110, Map 31 
Louisiana Creole (Gombo, Négue) 27, 
Maps 8-9 
Loun 118, Map 37 
Loup 7,17, Map 1 
Love see Brao 
Loven (Jru', Boloven) 177-8, Maps 47-48 
Low Country Semang 121, Map 44 
Lower Murray 102, Map 35 
Lowland East Cushitic 286 
Lowland Kechua (Kichua, Quechua) 70, 
Maps 15, 17, 21 
Lowudo [Lotuko dialect] 284 
Lowudu 284 
Lozi 302, 321, 329, Maps 74, 77 
Lozwa 321, Map 105 
Luangiua 107, 109, Maps 26, 29 
Luano 321, Map 77 
Luba 311, Maps 78, 80-81 
Ciluba 310 
Luba Katanga (Luba-Shaba) 321, 329, 
Maps 77-78 
Luba Lulua 321, 329, Maps 78, 80-81 
Luba Shankali 321, Map 78 
Luba Shaba see Luba, Luba Katanga 
Lubila 321, Map 93 
Lucanian (Italian) 261, Map 67 
Luchazi 321, Maps 77, 84 
Liidian 238 
Ludima 321, Maps 83, 88 
Lue (Ly) 180, Maps 47, 49 
Luganda (Ganda) 302, 312, 321, 
Maps 79-80, 86 
Lugbara 307, 321, Maps 79, 86 
Lugoli (Ragoli, Maragoli) 321, 330, 
Map 86 
Luguru (Lugulu, Ruguru) 301, 321, 
Maps 76, 85 
Luhu 118, Map 37 
Luhupa see Tangkhul 
Luhya Map 86 
Luhya 311, 321, 330, Map 86 
Luimbi 321, Map 84 
Luisehio 9, 17, Maps 3-4, 8-9 
Luish see Sak 
Lukep 113, Maps 30, 33 
Lule Saami 263 
Lule-Vilelan 72, Maps 20, 22 
Lule 72, Maps 20, 22 
Lulua 321, Map 78 
Lumba [Himalayan] 165-6, Map 51 
Lumbu [Bantu: Lenje-Tonga M60] 321, 
Maps 74, 77 
Lumbu [Bantu: Songye L20] 321, 
Maps 78, 80 
Lumbu (Luumbu) [Bantu: Shira-Punu] 
321, Maps 83, 88 
Lummi [Northern Straits Salish] 20 
Lun Daye 120, Map 41 
Luna 321, Maps 78, 80-81 
Lunda-Ndembu 321, Maps 77, 84 
Lunda-Ruund 321, Maps 77, 84 
see also Ruund 
Lundu-Balong Maps 92-93 
Lundwe 321, Maps 74, 77 
Lungga [Santa Ysabel] 109, Map 29 
Lunggar [Tibetan] 164, Map 45 
Lungmi (Longmi) 169, 175, Map 49 
Lungu (Idun) 321, Maps 96-97 
hLunrtse 164 
Luntu 321, Maps 78, 80-81 
Lunu 321, Maps 78, 80 
Lunwinja 321, Map 80 
Luo 311, 321, Maps 79, 86 
Luo (Lwo(o), Dhe Lwo, Jur Lwo) 284, 
318, Maps 72, 79 
North Maps 79, 86 
South Maps 79, 86 
Luri [Chadic] 321, Maps 96-97 
Luri (Luri) [Iranian] 211, 216, 277, 
Map 70 
Luri-Bakhtiari Map 53 
Luri-Feyli Map 53 
Luri-Mamasani Map 53 
Lushai see Mizo 
Lushootseed 8, 17, Maps 6, 8-9 
Liisu see Ersu 
Luthigh 101, 117, Map 35 
Luunda 321, Map 77 
Luvale see Lwena 
Luvelu 321, Maps 74-75 
Luwo see Luo 
Luyana 321, 329, Maps 74, 77, 84 
Ltizi see Ersu 
Luzi see Nungish 
Lwel 321, Map 81 
Lwena (Luvale) 321, 330, Maps 74, 77, 84 
Lwo [W. East Sudanic] see Luo 
Ly (Lyy) see Lue 
Lyele (Lele) 302, 321, Map 104 
Lyngnam [Khasi dialect] 178 


M'kang (Kaang) 168 
Marya see Yukatekan/Maya 


Ma'a (Mbugu) [E. Cushitic] 304, 321, 
Map 86 
Ma'a [S. Cushitic] 287 
Ma'anyan see Barito, East, South-East 
Barito 
Ma'da [Chadic] 321, Map 94 
Ma (Maa) [Koho dialect] 177 
Maa [E. Sudanic] Map 86 
Maasai (Masai) 284, 301, 307, 310, 321, 
330, Maps 72, 76, 85-86 
Maay 321, Map 85 
Ma‘aza Map 71 
Maba [Austronesian] 118, Map 37 
Maba Fur [Nilo-Saharan] 283, Maps 71, 
73 
Maban (Maba) 283, 307, Maps 71, 73, 
89 
Maba (Mabang) 283, 307, 311, 321, 
Maps 71, 73, 89 
Mabaan (Jumjum) 284, Map 72 
Mabale 321, Maps 82, 87 
Maban see Maba Fur 
Mabang see Maba Fur, Maba 
Mabas 321, Map 94 
Mabi 321, Maps 88, 92 
Mabo-Barkul 321, Map 96 
Mabuso 112, Map 33 
Maca see Maka 
Macedo-Rumanian (Aromanian) 
259-60 
Macedonian 232-3, 236, 259-60, 
Maps 61, 66 
in Australia Map 36 
Machakos see Masaka 
Machame 321, Map 86 
Machinga 321, Map 76 
Machongrr see Chang 
Machoto see Itonama 
Maci 173, 175, Map 46 
Macro-Algonquian 9 
Macro-Andean 69 
Macro-Arawakan 65-7 
Macro-Je 62, 72-3 
Macro-Katembri-Taruma 73-4 
Macro-Kulyi-Cholonan 69 
Macro-Lekoan 69-70 
Macro-Otomakoan 65 
Macro-Paesan 63-4 
Macro-Panoan 70-1 
Macro-Puindvean 67-8 
Macro-Siouan 9 
Macro-Tekiraka 68 
Macro-Tupi-Karibe 74-6 
Macro-Waikuruan 72 
Macro-Warpean 71 
Macu see Maku 
Maculai 54 
Macuxi see Makushi 
Mada [Central Nigerian] 321, Map 96 
Madak 109, Map 30 
Madak 109, Map 30 
Madang (Madang-Adelbert Range) 99, 
111, Map 32 
Madeira-Azores Map 68 
Maden 115, 118, Maps 34, 37 
Madi 284, 321, Maps 72, 79, 86 
Madige 116, Map 35 
Madiha see Kulina 
Madik 115, Map 34 
Madimba 321, Map 83 
Madingu 321, Map 83 
Madurese 103, 121, Map 42 
Maele (Malele) 321, Map 79 
Maew see Hmong 
Maewo, Central 108, Map 27 
Mafa-Matakam 
North Map 94 
South A-D Map 94 
D (Cuvak-Mafa) Map 94 
Mafa 321, Map 94 
Mafea [N. Vanuatu] 108, Map 27 
Mafia [ Bantu: Sabaki] 321, Map 76 
Mag-Anchi Ayta 119, Map 40 
Mag-Indi Ayta 119, Map 40 
Magahi 214, Map 51 
Magar 165, 214, Map 51 
Magh see Arakanese 
Magi see Mailuan, Mailu 
Magindanao 103, 119, Map 40 
Magori 110, Map 31 
Magori 110, Map 31 
Magyar see Hungarian 
Mah Meri see Besisi 
Maha 321, Maps 98, 100 
Mahakiranti 165 
Mahas 284 
Mahasui see Kiunthali 
Mahi 321, Map 102 
Mahican 7,17, Map 1 
Mahongwe 321, Map 87 
Mahra see Mehri 
Mahwa 321, Maps 89-90 
Maiak [Burun dialect] 284 
Maiduan 7-8, 10, 17, Maps 4, 8-9 
Maidu 17 
Southern see Nisenan 
Maii 108, Map 27 
Mailuan 99, 110, Map 31 
Mailu (Magi) 110, Map 31 
Maimai Map 33 
Main [Angan subgroup] 111, 
Map 32 
Main Kenyah 120 
Maingtha see Achang (Ngochang) 
Mainland Comox 13 
Maipure 66, Map 12 
Maipurean 62, 65-7, Maps 12-21 
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Mairasi 99,115, Map 34 
Mairasi 99, 115, Map 34 
North-Eastern Mairasi 115, Map 34 
Maisin 110, Map 31 
Maithili 212-14, Map 51 
see also Angika 
Maiwa 110, Map 31 
Maiya Map 53 
Maja 321, Map 93 
Majang (Masongo) 284, 289 
Majeerteen see af-Majeerteen 
Majera 321, Map 99 
Majhi [Indo-Aryan] 214 
Maji see Dizi 
Majingay (Sar) 321, Maps 89-90 
Majoruna-Matsés (Mayoruna) 71, 
Maps 14-17 
Majung see Chang 
Mak (Mo) 179, 184, Map 45 
Maka (Maca) [Matdékoan] 72, Maps 20, 
22-23 
Maka-Njem [Bantu] Maps 87-88, 91-92 
Makah [Nootkan] 8, 17, 19, Maps 6, 8-9 
Makalero 119, Map 38 
Makanya (Mankanya, Bola) 322, 
Maps 107-108 
Makasai 119, Map 38 
Makassarese 120, Map 39 
Makatao 182 
Makere 321, Map 79 
Makian 
East 118, Map 37 
West 118, Map 37 
Makina 321, Maps 87-88 
Makiri 74 
Makiritare (Maquiritari) 76, Maps 12-13 
Maklew 115, Map 34 
Mako 77 
Makoma 321, Map 77 
Makonda [Bantu: Umbundu] 321, 
Map 84 
Makonde [Bantu: Rufiji] 312, 321, 329, 
Maps 75-76 
Maku'a [Austronesian] 119, Map 38 
Maku (Macu) [Kalidnan] 68, Maps 12-13 
Makua [Bantu] 329, Maps 75-76 
Makua 321, Maps 75-76 
Makua-Lomwe 312 
Makuna-Erulia 68, Maps 12-13, 16 
Makunduchi 321, Maps 76, 85 
Makurap Maps 16-17, 20-21 
Makushi (Macuxi, Teweya, Teueia) 76, 
Maps 12-13 
Mal (Pray, Phay, T'in) 176, 178, Map 47 
Mala see Rumaya 
Malaba 321, Maps 75-76 
Malagasy 310, Maps 41, 75 
Andamanese 217, Map 52 
Malaita-San Cristobal 109, Map 29 
Malaita 109, Map 29 
North Malaita 109, Map 29 
South Malaita 109, Map 29 
Malakhel 211, 216, Map 53 
Malakmalak 116, Map 35 
Malali Map 18 
Malalulu 321, Map 85 
Malamalai (Bonga) 113, Map 33 
Malango 109, Map 29 
Malas 112, Map 33 
Malasanga 111, 113, Maps 32-33 
Malaweg 119, Map 40 
Malayalam 212-13, 214,217, Map 52 
Malayo see Wamaka 
Malayo-Polynesian 121, 184, Maps 41-44 
Malayan 184 
Malay 119, 121, 162-3, 182-3, 
193-4 
Aboriginal Malay 103, Map 44 
Ambonese (Malayu Ambon) 97, 
03, 104, 109, 118, 262, 
Maps 37-38, 43-44, 48 
Baba (Straits, Chinese) Malay 
03-4, 162, Map 25 
Bahasa Malayu 119, Map 38 
Bahasu Melayu Loloan (S.W. 
Bali) 121, Map 42 
Bazaar (Pasar, Bazar) Malay 103, 
Map 25 
Borneo Coastal 120, Map 41 
Coastal Map 44 
Cocos 216, Map 52 
Jakarta Malay 121, Map 42 
Local 121, Map 44 
Manado Malay 120, Map 39 
North-eastern 183-4, Map 48 
North-western 183-4, Maps 44, 48 
Para-Malay 103, 121, 183, 
Map 44 
Sri Lankan Creole 214, Map 52 
Sumatra Malay 121, Map 43 
Western Coastal 120, Map 41 
see also Indonesian 
Malayic Dayak 120, Map 41 
Northern 121 
see also Austronesian 
Malchin Tuva see Tannu Uriankhai 
Maldivian (Divehi) 215, 217, Map 52 
Male (Malo) [Ometo] 288, 289, Map 72 
Male [ Papuan: Rai Coast] 112, Map 33 
Malele see Maele 
Maleng 177-8, Map 47 
Maleu 109, Map 30 
Malgana 100, 116, Map 35 
Malgbe 321, Maps 90, 99 
Malila 321, Maps 76-77 
Malin 167,175 
Malinaltepec Tlapaneko 53 


Malinke 302, 312, 321 
Maliseet-Passamaquoddy 7, 17-18, 
Maps 1, 7-9 
Malmariv 108, Map 27 
Malo see Male [Ometo] 
Maltese 261, 281, Maps 67, 73 
in Australia Map 36 
Malto 213, 217, Map 51 
Malua Bay (Petarmul) 108, Map 27 
Maluku [subgroups] Maps 34, 36 
Central 114-15, 118, Maps 34, 37 
East Central 115, 118, Map 34 
West Central 118 
South-East 115, 119, Maps 34, 38 
Matlila (Malila) 278, Map 70 
Malvi 213, Map 54 
Malwal [Jieng dialect] 284 
Malyangapa 102, 117, Map 35 
Mama [Bantoid] 321, Map 96 
Mamaa [ Trans-New Guinea] 110, Map 32 
Mamai [Trans-New Guinea subgroup] 
113 
Mamanwa 119, Map 40 
Mamba [Bantu: Congo] 321, Maps 78, 80 
Mambai [ Austronesian: Timor] 119, 
Map 38 
Mambangura 101, 117, Map 35 
Mambay [Mbum-Day] 321, Maps 91, 94 
Mambiloid 305, 306, 331, Maps 87, 91-93, 
95 
Mambila 321, Map 95 
Mamboru 118, Map 38 
Mambwe 321, Maps 76-78 
Mamean 52, Maps 10-11 
Mam 50, 52, Map 10 
Mamenyam (Bamenyam) 321, Maps 93, 
95 
Mamfe Maps 93, 95 
Mampruli 321, Maps 102-104 
Mamuju 120, Map 39 
Mamulique see Comecrudan 
Mamusi 109, Map 30 
Mamvu see Tengo 
Man see Yao [Miao- Yao] 
Mana Map 89 
Manado Malay 120, Map 39 
Managalasi 110-11, Maps 31-32 
Manam 113, Map 33 
Manam 113, Map 33 
Manambu 113, Map 33 
Manang (Nyishang, Nyeshang) 164, 
Map 51 
Manao 66, Maps 12, 16 
Manasir Map 70 
Manchati see Pattani 
Manchu-Tungus 187, Map 45 
Manchu 187 
Manchu 187, 191, Map 45 
Manchurian Reindeer Ewenk see Evenki 
(Ewenke) 
Manda [Bantu subgroup] 329, 
Maps 75-77 
Manda 321, Map 76 
see also Siska (Tonga) 
Manda [Dravidian] 217, Maps 51-52 
Mandage Maps 90, 99 
Mandahuaca see Mandawaka 
Mandaic [Aramaic subgroup] 278, 
Map 70 
Mandaic Map 70 
Mandan [Siouan] 9, 18, Maps 5, 8-9 
Mandandanyi 101, 117, Map 35 
Mandar [Sulawesi] 120, Map 39 
Mandara (Amandara) [Chadic subgroup] 
Maps 95, 99 
Mandarin see Chinese (Sinitic), Mandarin 
Mandawaka (Mandahuaca) 66, 
Maps 12-13 
Mandaya 119, Map 40 
Mande 306, 309, 332, Maps 73, 100-108 
East 332, Maps 103-108 
West Maps 103-108 
Mandeali Map 54 
Mander 115, Map 34 
Mandi 113, Map 33 
Manding [Mande subgroup] 302, 302, 332 
Mandinka 302, 311-12, 321, 
Maps 107-108 
Maninka 302, 302, 312, 322, 
Maps 106-107 
Manding-Cono Map 105 
Kono-Vai Map 106 
Manding(-Kuranko) Maps 106-108 
Manding 312, Maps 105-108 
Manding-Jogo Map 104 
Mandinka see Manding, Mandinka 
Manem (Jeti, Skofro, Wembi) 112, 115, 
Maps 33-34 
Manemgi 115, Map 34 
Manengabo 321, Map 93 
Mang 176, 178, Maps 46, 47 
Mangalé 73 
Manganja (Malawian Nyanja-Chewa) 
321, Map 75 
Mangap 113, Maps 30, 33 
Mangarayi 116, Map 35 
Mangareva 107, Map 26 
Mangarla 100, 116, Map 35 
Mangas (Mangas-Margi) 322, 
Maps 97-98 
Mangba 322, Maps 82, 87 
Mangbetu-Asua 283, 307, Maps 79-80 
Mangbetu 284, 322, 330, Map 79 
Mangbetu-Efe (Mangbutu-Efe) 283, 307, 
Map 79 
Mangbutu 322, Map 79 
Mangbutu-Efe 284, Map 79 


Mangean 53 
Mange (Mangue) 45, 49, 53 
Mangerri 116, Map 35 
Mangga Buang 111, Map 32 
Manggarai 118, Map 38 
Mangkong 177-8, Map 47 
Mangseng 109, Map 30 
Mangue see Mangean, Mange 
Mangyan 119, Map 40 
Manhthe 116, Map 35 
Manihiki-Rakahanga 107 
Manimo see Vanimo 
Maningrida 100, 116, Map 35 
Maninka see Manding, Maninka 
Maninkaxanwo 322, Map 107 
Manipa 118, Map 37 
Manipuri 212-13, 215, 217, Map 51 
see also Meithei 
Manigq (Tonga’, Mos, Ten'en) 121, 177-8, 
Maps 44, 48 
Manitereni see Piro 
Manitsawa (Maritsaud) 75 
Manjaku [Atlantic] 311 
Manjanku 322, Map 108 
Manjiljarra 100, 116, Map 35 
Manjung see Chang 
Mankanya see Makanya 
Mankon 322, Maps 93, 95 
Mannka Kankan [Mande] 311 
Mano 311, 322, Map 106 
Manobo 119, Map 40 
Mansa* 287 
Mansi (Vogul) 231, 234, 238, Maps 56, 58 
Manta 322, Maps 93, 95 
Mantharta (Mantharda) 100, 116, 
Map 35 
Mantion 115, Map 34 
Mantjintja 100, 116, Map 35 
Manu (Manumanaw, Punu) 174-5, 
Map 49 
Manubaran 99, 110-11, Maps 31-32 
Manus [Admiralty subgroup] Map 30 
East 109, Map 30 
West 109, Map 30 
Manus [Bima-Sumba] 118, Map 38 
Manusela 118, Map 37 
Manx 263 
Manya 322, Map 106 
Manyag see Muya 
Manyanga 322, Map 84 
Manyika 322, Maps 74-75 
Manz [Amharic dialect] 288 
Manza Map 90 
Manza 322, Maps 89-90 
Mao (Sopvoma) [Tibeto-Burman] 167-8, 
180, 213, Map 51 
Mao [Omotic subgroup] 288, 289, 
Map 72 
Mao of Bambeshi 288, Map 72 
Mao of Didessa 288, Map 72 
Mao, Southern [Omotic: Gonga] see 
Anfillo 
Mao Shan see Dai, Dai Mao 
Maonan 179, 184, Map 45 
Maore [Bantu: Swahili] 322, Map 75 
Maore [Comorian variety] 216 
Maori 107, Map 26 
Cook Islands 107, Map 26 
Mape 110, Map 32 
Mapena 110, Map 31 
Mapeo 322, Map 91 
Mapi [Awyu-Dumut] 114, Map 34 
Mapia [Micronesian] 108, 115, Maps 28, 
34 
Mapopoi see Popoi 
Mapos Buang 111, Map 32 
Mapoyo-Yavarana (Wanai) 76, 
Maps 12-13 
Mapudungu 71, Maps 22-23 
Ma‘qili 281 
Maquiritari see Makiritare 
Mara [Austronesian] see Maran, Mara 
Mara (Miram, Lakher) [ Tibeto-Burmanl] 
168, 175, 213, 215, Map 51 
Marachi 322, Map 86 
Maragoli see Lugoli 
Marakhus (Tape) 108, Map 27 
Marakwet 322, Maps 72, 86 
Maralango 111, Map 32 
Maram 168, 213, Map 51 
Maramba 113, Map 33 
Maran [Austronesian] 100, 116, Map 35 
Mara 116, Map 35 
Maranao 103, 119, Map 40 
Maranunggu 116, Map 35 
Mararit Maps 71-72 
Marathi Map 51 
Marathi 212-13, 214, 216-17, 
Maps 51-52, 54 
Maratino Map 3 
Marau 109, 115, Maps 29, 34 
Marawa 66, Map 12 
Marawan-Karipura 66 
Marawara 102, 118, Map 35 
Marba 322, Maps 90, 94 
Marches (Italian) 261, Map 67 
Marchha see Rangpa, Rangpa 
Marda [Kunama dialect] 290 
Mardin Map 70 
Mare 322, Map 88 
Margany 101, 117, Map 35 
Margi 
North 322, Map 94 
South 322, Map 94 
Margu 116, Map 35 
Mari [Austronesian: Huon Gulf] 111, 113, 
Maps 32-33 


Mari (Cheremis) [Finno-Ugric] 231, 234, 
236, 238, Maps 56, 58 
Eastern Meadow Mari 234n. 
Hill Mari 234n. 
Mari [ Papuan: Sepik Hill] 113, Map 33 
Maria [S.E. Papuan] 110-11, Maps 31-32 
Mariaté 65 
Maric 101, Map 35 
Maricopa 8, 18, Maps 3-5, 8-9 
Marienberg 113, Map 33 
Marin Miwok see Miwok, Western, Coast 
Miwok 
Marind 99, 111, 114, Maps 32, 34 
Bian Marind 111, 114, Maps 32, 34 
Marind 99, 111, 114, Maps 32, 34 
Maring [ Tibeto-Burman] 168, 175, 213, 
Maps 49, 51 
Maring [ Trans-New Guinea] 112, Map 33 
Maringe [Austronesian] 106, 109, Map 29 
Marino 108, Map 27 
Maripti see Mukuchi 
Maritime Maipurean 66, Maps 12-13 
Maritime Tsimshian see Tsimshianic 
Maritsaua see Manitsawa 
Marka [Mande] 322, Map 104 
Marke 322, Map 91 
Markham 111, 113, Maps 32-33 
Marlé [Arbore variety] 287 
Marma 215 
Marngu 100, 116, Map 35 
Maro see Morehead-Upper Maro 
Marovo 102, 106, 109, Map 29 
Marpha Thakali 164 
Marquesas 
North 107, Map 26 
South 107, Map 26 
Marri Ammu 116, Map 35 
Marridan 116, Map 35 
Marrimanindji 116, Map 35 
Marringarr 116, Map 35 
Marrithiyel 116, Map 35 
Marrityabin 116, Map 35 
Marrngu see Marngu 
Marrulha 101, 117, Map 35 
Marshallese 108, Map 28 
Martu Wangka 101, 117, Map 35 
Martuthunira 100, 116, Map 35 
Maru (Lawngwaw, Langsu) [Burmish] 
171, 175, Maps 46-47, 49 
Marubo 70, Maps 14-16, 20 
Maruhia see Isabi 
Marwari 211, 213-14, Maps 51, 54 
Masa 308, 331, Maps 90-91, 94 
East Map 90 
North Maps 90-91, 94 
Masa Maps 90-91, 94 
South Maps 90-91, 94 
Masaba 322, 330, Map 86 
Masai see Maasai 
Masaka (Machakos) 322, Map 85 
Masakara 73, Map 18 
Masakin Map 72 
Masalit (Kaana Masala) 283, 285, Map 71 
Masama 322, Map 86 
Masateko (Mazateco) 49-50, 53 
Masawa (Mazahua) 45,53, Maps 10-11 
Masbate see Sorsogon (Masbate) 
Masbatefio 119, Map 40 
Mascouten 7 
Mascoy see Maskoi 
Masela-South Babar 119, Map 38 
Masenzu 322, Map 83 
Mashakalian 73, Maps 18-19 
Mashakali (Maxakali) 73, Maps 18-19 
Mashi [Bantu: Kwangwa] 322, Map 77 
Mbwela Map 84 
Mashi [ Bantu: Nkoya] see Mbwela 
Mashi [Omotic] see Zilmamu 
Mashile 286 
Mashubi 73 
Masiwang 115, 118, Maps 34, 37 
Maskelynes 108, Map 27 
Maskoian 72, Maps 18-19 
Maskoi (Mascoy, Emok) 72, Maps 18, 
20, 22 
Maslam 322, Maps 90, 99 
Masmida 282 
Masoanka (Sua) 322, Map 108 
Masongo see Majang 
Masqaén 288, Map 72 
Massachusett-Narragansett 7, 18, Map 1 
Massenrempulu 120, Map 39 
Massep 115, Map 34 
Massina Ful 
East 316, Map 104 
West 328, Maps 104, 107-108 
Matabeng (Matjanchi) 166 
Mataco see Matako 
Matagalpa 61,65, Map 12 
Matakam 322, Map 94 
Matakoan 72, Maps 20-23 
Matako (Mataco, Wichi) 72, 
Maps 20-23 
Matal 322, Map 94 
Matanawi 71, Maps 16-17, 20 
Matbat 115, 118, Maps 34, 37 
Matchi [Achik subdialect] 166 
Matengo 302, 322, Maps 75-76 
Matepi 112, Map 33 
Mathira 322, Map 86 
Matipu see Amonap 
Matjanchi see Matabeng 
Matlatzinkan 45,50, Maps 10-11 
Matngele 116, Map 35 
Matsigenga 67, Map 21 
Mattole 8, 18, Map 4 
Matu see Ngala 


Matukar 113, Map 33 
Matumbi 322, Map 76 
Mau (Mauka) 322, Maps 105-106 
Mauritius Creole French 216, Map 52 
Mawae 111, Map 32 
Mawahib Map 70 
Mawak 112, Map 33 
Mawake see Ulingan 
Mawali 279, Map 70 
Mawan 112, Map 33 
Mawanda see Ndonde 
Mawé (Sateré-Mawé) 74, Maps 17-19 
Mawenzi 322, Map 86 
Mawes 115, Map 34 
Mawia 322, Maps 75-76 
Mawng 116, Map 35 
Mawshang see Tangsa 
Maxakali see Mashakali 
Maxineri see Piro 
May [Cushitic dialect group] 287, 
Map 72 
May [Vietic] 176-8, Map 47 
Maya [Austronesian: Raja Empat] 115, 
118, Maps 34, 37 
Maya [Pama-Nyungan] 100, 116, Map 35 
Maya [ Papuan: Madang] see Saki 
Mayan [Meso-America] 46, 51-2, 
Maps 10-11 
Maya 52 
Mayawarli 101, 117, Map 35 
Mayi 101, 117, Map 35 
Mayi-Kulan 101, 117, Map 35 
Mayi-Kutana 101, 117, Map 35 
Mayi-Thakurti 101, 117, Map 35 
Mayi-Yapi 101, 117, Map 35 
Mayo (Mayo-Yesan, Yasyin, Yasi, Yessan- 
Mayo) [Papuan] 113, Map 33 
Mayo [Uto-Aztecan] 54, Maps 3, 9 
Mayogo 322, Map 79 
Mayol (Moyon, Mon) 168, Map 51 
Mayoruna see Majoruna-Matsés; Morike 
Mayu [Berta dialect] 285, 290 
Mazahua see Masawa 
Mazanderani 211, 216, Map 53 
Mazateco see Masateko 
Mba [Bantoid: Mbam-Nkam] 322, 
Map 95 
Mba [Ubangi: Ngbaka-Mba] Map 79 
Mba 322, Map 79 
Mbabaram 101, 117, Map 35 
Mbagani 322, Map 81 
Mbai 322, Maps 89-90 
Mbai-Diabe 322, Map 90 
Mbala [Ubangi: Banda] 322, Map 89 
Mbala [W. Bantu: Mbala] Maps 81, 83 
Mbala 322, Maps 81, 83 
Mbalantu 322, Map 84 
Mbaloh 120, Map 41 
Mbam 306, 331, Maps 92-93 
West Maps 92-93 
Mbam-Nkam 306, Map 95 
Mbamba 322, Maps 81, 83 
Mbangala 322, Map 84 
Mbangi 322, Maps 79, 82, 89 
Mbangwe Maps 83, 88 
Mbanja [ Bantu: Ovambo-Kwanyama] 
322, Map 84 
Mbanjo [Bantoid: Mambiloid] 322, 
Maps 92-93 
Mbanza 322, Maps 82, 87, 89 
Mbanza-Manteke 322, Maps 83-84 
Mbara 101, 117, Map 35 
Mbariman-Gudhinma 101, 117, Map 35 
Mbata [Bantu: Kongo] 322, Map 84 
Mbati [Bantu: Ngundi] 322, Map 87 
Mbato [Kwa: Potu] 322, Map 105 
Mbause 322, Map 81 
Mbaya see Kadiwéu 
Mbe 322, Map 95 
Mbedam 322, Map 94 
Mbeere 322, Maps 72, 85 
Mbeku 322, Map 84 
Mbele [ Ubangi: Banda] 322, Maps 89-90 
Mbelo [Bantu: Mongo-Nkundo] 322, 
Map 81 
Mbembe [Central Jukunoid] 322, Map 95 
Mbembe [Upper Cross] Map 95 
Mbembe 322, Maps 93, 95 
Mbere 311, 322, Maps 81, 83, 87 
Mbere Maps 81, 83, 87 
Mbesa 322, Maps 79, 82 
Mbeti (Beti) 303, 322, Map 92 
Mbi 322, Map 89 
Mbidiankamba [Mongo dialect] 330 
Mbilia-Nkamba 322, Map 81 
Mbiywom 101, 117, Map 35 
Mbo [Bantu: Bira-Huku D30] 322, 
Map 79 
Mbo [Bantu: Lundu-Balong A10} 322, 
Maps 92-93 
Mbodomo 322, Map 87 
Mbogno 322, Map 95 
Mboi [Waja-Jen subgroup] Map 98 
Mboi 322, Maps 94, 98 
Mboi [ Waja-Jen: Yungur| Map 98 
Mboka [Bantu: Kongo] 322, Map 83 
Mboko [Bantu: Mbosi] 322, Maps 83, 87 
Mbokomu see Ubu-Mbokomu-Moshi- 
Kirua-Kilema-Mwika 
Mbole 322, Maps 79-81 
Mbombeleng 322, Map 87 
Mbombo 322, Map 83 
Mbong'a 322, Map 91 
Mbonge 322, Map 93 
Mbonji 322, Maps 82, 87 
Mbonza see Mbonzo 
Mbonzi [Bantu: Mbosi] 322, Map 83 
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Mbonzo (Mbonza) [Bantu: Bobangi- 
Ntomba] 322, Maps 82, 87 
Mbosi Maps 81-83, 87, 89 
Mboshi 322, Maps 81, 83 
Mbowe-Milonga-Liyuwa 322, Map 77 
Mbii'a see Mbya Guarani 
Mbugu see Ma'a 
Mbugwe 322, Map 86 
Mbui 322, Map 84 
Mbuko 322, Map 94 
Mbukushu [Bantu: Kwangwa] 322, 
Maps 74, 77, 84 
Mbukushu [Kwangari variety] 302 
Mbukushu [Mashi dialect] 302 
Mbula-Bwaza 322, Maps 91, 98 
Mbulu [Cushitic] see Iraqw 
Mbulu [Ubangi] 322, Maps 82, 89 
Mbulungish 322, Map 107 
Mbum-Day 305, Maps 87, 89-91, 94, 98 
Mbum 331, Maps 89-91, 94, 98 
Central Map 91 
Laka-Mbum Maps 89-90 
South Maps 87, 91 
East Mbum 316, Maps 89-90 
West Mbum 328, Map 91 
Mbun (Mbuun) 322, Map 81 
Mbunda 322, Maps 77, 84 
Mbunga 322, Map 76 
Mbuun see Mbun 
Mbwale 322, Map 93 
Mbwas 322, Map 87 
Mbwela [ Bantu: Nkoya L50| 322, Map 77 
Mbwela (Mashi) [ Bantu: Kwangwa K40] 
322, Map 84 
Mbya Guarani (Mbii'a) 74, Maps 19, 23 
Meakambut 113, Map 33 
Meax 115, Map 34 
Mebeza 322, Maps 87-88 
Mebu 112, Map 33 
Mecca Hijazi Map 70 
Mecha see Walligga 
Mech(e) see Bodic, Bodo 
Medebur 113, Map 33 
Media Lengua [mixed language] 27 
Medina Hijazi Map 70 
Mednyj Aleut [mixed language] 7, 26, 
239 
Medumba 322, Map 93 
Me'en (Megan, Bodi) 284, 289 
Mefele 322, Map 94 
Megaka 322, Map 93 
Megiar 113, Map 33 
Megleno-Rumanian 259-60, Map 66 
Megrelian see Mingrelian 
Mehek (Driafleisuma) 113, Map 33 
Meherrin Maps 1-2 
Mehri 281-2, 287, Map 70 
Meidob 283, Map 71 
Meifu [Li subvariety] 179 
Meithei (Manipuri) 167,213, Map 51 
see also Manipuri 
Meje 322, Map 79 
Mejicanero see Nawa, Nawal 
Mek (Goliath) 99, 99, 114, Map 34 
Eastern 114, Map 34 
Western 114, Map 34 
Meka (Mekaa) 322, Maps 87, 91 
Mekambo 322, Map 87 
Mekan see Me'en 
Meke 322, Maps 87-88 
Mekeo 111, Map 32 
Mekitelyu [creole] 27 
Mekmek 113, Map 33 
Mekongka 120, Map 39 
Mekwei-Gresi-Kwansu 115, Map 34 
Mekwob 322, Map 87 
Mel/Sua-Gola Maps 106, 108 
Mel Maps 106-107 
Melanau 120, Map 41 
Melanesian, Meso- 109, Map 30 
Mele-Fila 107, Map 26 
Meluri see Ntenyi 
Mena 111, Map 32 
Mena see Turama-Kikorian, Turama- 
Omatian 
Menba see Monpa 
Menchum-Momo 306 
Menchum 306, Map 95 
Mendalam Kayan 120, Map 41 
Mende-Loma Maps 106-107 
Mende 309, 312, 322, 332, Map 106 
Mendi see Angal 
Meneka-Witoto (Minica) 69, 
Maps 12-14 
Mengen 109, Map 30 
Mengen 109, Map 30 
Menia see Muya 
Meninggo 115, Map 34 
Menka 322, Maps 93, 95 
Menominee 7, 18, Maps 1, 5, 8-9 
Menri' (Mendriq, Menraq) 178 
Mentawai 121, Map 43 
Menya 111, Map 32 
Menyén 73, Map 18 
Meo see Hmong 
Meoswar 115, Map 34 
Meqan see Me'en 
Meramera 109, Map 30 
Merei 108, Map 27 
Merey 322, Map 94 
Mergel [Evenki dialect] 192 
Merlav 108, Map 27 
Mern(y)ang 322, Map 96 
Meroitic 282 
Meru [Thagicu] 311, 322, 330, Maps 72, 
85-86 
see also Kwo [ Kilimanjaro] 


Mescalero-Chiricahua 8, 18, Maps 3, 8-9 
Mescalero 8 
Mese 322, Map 93 
Mesem 110, Map 32 
Meéseén 108, Map 27 
Meskwaki see Sauk-Fox 
Mesme 322, Maps 90, 94 
Meso-Melanesian 109, Map 30 
Meso-Philippine 119 
Mesopotamian Arabic 279 
Mesquakie see Sauk-Fox 
Messiriyya Map 72 
Meta’ 322, Maps 93, 95 
Metmata Map 73 
Metomka Kati see Kati 
Mewahang (Newahang) 165-6, 214, 
Map 51 
Mewari 213, Map 54 
Mewuk see Miwok 
Mezime 322, Map 87 
Mfinu 322, Maps 81, 83 
Mfumte 322, Map 95 
Mgbo 322, Map 101 
Mi'kmaq see Micmaq 
Mi-Gangam-Dye 322, Map 104 
Mi-Gangam 322, Maps 102-103 
Miami [New Guinea] see Tani 
Miami-Illinois [Algonquian] 7, 18, 
Maps 1, 5 
Miami 7, 18 
Mianmin 112, 114, Maps 33-34 
Miao- Yao (Hmong-Mien) 179, 183-4, 
Maps 45-48 
Miao (Hmong) 179, 181, 183-4, 
Maps 45-48 
Miccosukee see Mikasuki 
Micha (Mielang, Miqie) 173, 175, Map 46 
Michif [mixed language] 7, 18, 26, 
Maps 8-9 
Michigamea Maps 2, 5 
Micmagq (Mi'kmagq, Mi'gmagq, Miikmaq) 
7, 18, 26, Maps 1, 7-9 
Micronesian 107-8, 115, Maps 28, 34 
Midsivindi 113, Map 33 
Mielang see Micha 
Migabac 110, Map 32 
Migili (Jijili, Lijili) 322, Maps 95-96 
Mi'gmaq see Micmaq 
Migueleno [Salinan] 22 
Miikmagq see Micmaq 
Miini see Mwini 
Mije-Sokean (Mixe-Zoquean) 46, 50-1, 
Maps 10-11 
Mije (Mixe) 51, Maps 10-11 
Miji [Bugun-Khowa] see Dhammai 
Miji Kenda (Mijikenda) 302, 306, 309, 
311, 330, Maps 72, 85 
Mijikenda see Miji Kenda 
Mijuish Mishmi (Keman) 169, 175, 
Maps 45-46, 49 
Keman (Miju) Maps 45-46 
Miju ‘Mishmi’ (Geman, Kaman, 
Geman Deng) 169, 175, 213, 
Maps 45-46, 49 
Mikarew (Ariawia, Gumasi) 113, Map 33 
Mikaru see Dadibi 
Mikasuki (Hitchiti) 9, 18, Maps 2, 8-9 
Mikaya 322, Map 87 
Mikeyir (Shabo) 285, 289, Map 72 
Mikir see Arleng 
Mi'kmaq see Micmaq 
Milang 169 
Mileeri 284 
Millcayac see Milykayak 
Milo [Nicobarese dialect] 178 
Miltu 322, Maps 89-90 
Miluk (Coos) 8, 15, Map 4 
see also Hanis 
Milykayak (Millcayac) 71, Map 22 
Mimi (Mima) [Fur] 283 
Min (Hokkien) see Chinese (Sinitic) 
Mina see Ge 
Minahasan 120, Map 39 
Minangkabau 121, Maps 43-44 
Minanibai 111, Map 32 
Minbei [Northern Min dialect] Map 45 
see also Chinese (Sinitic), Southern, 
Hokkien (Min) 
Mindanao, East 119, Map 40 
Mindanao, South 119 
Mindi 116, Map 35 
Mindi 100 
Mindiri 113, Map 33 
Mindong [Eastern Min dialect] Map 45 
see also Chinese (Sinitic), Southern, 
Hokkien (Min) 
Mingabari 101, 117, Map 35 
Mingat see Myangad 
Mingin 116, Map 35 
Mingo see Seneca 
Mingrelian (Megrelian) 234—5, 238, 
Map 59 
Minhe (San ch'uan) [Monguor dialect] 
191-2, Map 45 
Mini [Bantoid-Cross] 322, Map 101 
Mini [ Trans-New Guinea] see Omati 
Minianka 322, Maps 104-105, 107 
Minica see Meneka-Witoto 
Miniferi [Saho dialect] 286 
Minihiki-Rakahanga Map 26 
Minjim 112, Map 33 
Minnan [Southern Min dialect] Map 45 
see also Chinese (Sinitic), Southern, 
Hokkien (Min) 
Mintil [Batek dialect] 178 
Minungu 322, Map 84 
Minyong 169 


Minzhong [Central Min dialect] Map 45 
see also Chinese (Sinitic), Southern, 
Hokkien (Min) 
Migie see Micha 
Mirafab Maps 71-72 
Mirandese (Portuguese) 262, Map 68 
Mirembe 322, Map 78 
Miri [Kadugli-Krongo] 285 
Miri [ Tibeto-Burman] see Hill Miri; 
Mising 
Miriam 111, Map 32 
Miriwungic 116 
Miriwung 116, Map 35 
Mirniny 101, 117, Map 35 
Mirniny 101, 117, Map 35 
Mirnong 101, 117, Map 35 
Mirripti see Mukuchi 
Misegian 113, Map 33 
Mishing see Mising 
‘Mishmi’ Maps 49, 51 
see also Digarish; Mijuish 
Misima 110, Map 31 
Mising (Miri, Mishing) 169, 213, Map 51 
Miskitu (Mosquito) 61, 65, Maps 12-13 
Misong (Missong) 322, Map 95 
Mississauga [Ojibwe dialect] 14 
Missouri River Siouan 9, Maps 5, 8-9 
Missouria see Chiwere 
Mistekan 52-3 
Misteko (Mixteco) 46, 49-50, 53, 
Maps 10-11 
Misumalpa 61, 65, Map 10 
Mitchif see Michif 
Mithaka 101, 117, Map 35 
Mituku 322, Maps 80-81 
Miutini 323, Map 85 
Miwa 116, Map 35 
Miwok 8, 18, Map 4 
Eastern 
Sierra Miwok 
Central 8, 14, Maps 4, 9 
Northern 8, 14, Maps 4, 9 
Southern 8, 14, Maps 4, 9 
Western 25 
Coast 8, 12-13, 25, Map 4 
Lake 8, 25, Maps 4, 8-9 
Mixe-Zoquean see Mije-Sokean 
Mixteco see Misteko 
Miya 323, Map 97 
Miyak 113, Map 33 
Miyan 101, 117, Map 35 
Miyengen 323, Map 93 
Miyobe 323, Maps 102-104 
Miza [Moru dialect] 284 
Mizo (Lushai, Hualngo) 168, 175, 
213-15, Maps 49, 51 
M'kang (Kaang) 168 
Mlabri 176, 178, Map 47 
Mlomp 323, Map 108 
Mmaala 323, Map 92 
Mmani 323, Map 106 
Mmem 323, Map 95 
Mmetho 323, Map 75 
Mnong (Phnong) 178 
Central 177, Map 48 
Eastern 177, Map 48 
Southern 177, Map 48 
Mo'ang 174-5, Map 47 
Mo [Gur] see Deg 
Mo [Kam-Sui] see Mak 
Moba (Bimoba) [Gur] 323, 
Maps 102-104 
Mobe (Guaymi, Move, Ngobere) 64, 
Maps 12-13 
Mober 323, Map 99 
Mobile Map 2 
Mobilian Jargon 26 
Mocha (Moéa) 285, 288, 289, Map 72 
Mochika 69, Maps 14, 20 
Moché (Motozintleco) 52 
Mocovi see Mokovi 
Modang 120 
Modang 120, Map 41 
Modea [Gumuz dialect] 285 
Modele 323, Map 95 
Modo-Baka Map 79 
Modoc see Klamath-Modoc 
Modole 118, Map 37 
Modra [Todrah dialect] 177 
Moere 112, Map 33 
Moewehafen 109, Map 30 
Mofu 
North 323, Map 94 
South Map 94 
Mogareb [Nara dialect] 290 
Mogés see Wambiano 
Mogh see Arakanese 
Moghamo (Widekum) 323, Maps 93, 95 
Moghol (Moghul) 192 
Mogododo see Yaaku 
Mohave (Mojave) 8, 18, Maps 3-5, 8-9 
Mohawk 9, 18, Maps 1, 8-9 
Mohegan-Pequot 7, 18, Map 1 
Mohere see Mwali 
Mohican 7 
Moho (Mojo) 67, Map 20 
Moi 115, 118, Maps 34, 37 
Mojave see Mohave 
Mojo see Moho 
Mojung see Chang 
Mok 176, 178, Map 47 
Mok-Aria 109, Map 30 
Moken 183-4, Map 48 
Mokerang 109, Map 30 
Mokiba 323, Maps 83, 87 
Mokilese 108, Map 28 
Moklen 183-4, Map 48 


Mokochi see Mukuchi 
Mokovi (Mocovi) 72, Maps 22-23 
Mokpwe 323, Maps 92-93 
Moksha Mordvin see Mordvin 
Mokulu Maps 89-90 
Mokulu 323, Maps 89-90 
Molala 8, 18, Map 4 
Molbog 119, Map 40 
Moldavan see Romanian; see also 
Moldovan 
Moldovan 234-7 
Moliba 323, Maps 82, 87 
Molima 110, Map 31 
Molo 284, Map 72 
Molof 114, Map 34 
Molokwo 323, Map 94 
Momare 110, Map 32 
Mombasa 323, Map 85 
Mombo 323, Map 104 
Mombum 114 
Mombum 99, 99, 114, Map 34 
Momo 306, Maps 93, 95 
‘Momo’ [now Mundani] 306 
“West Momo’ see Momo 
Momzombo Map 82 
Mon Karen 174 
Mon-Khmer 175, Maps 45-49, 51-52 
Monic 175, 177-8 
Mon 171, 177-8, 181, Maps 47-49 
Mon Ro/Mon Rao _ 177 
Northern 176 
Monachi see Mono 
Mondé (Sanamai) 75, Maps 17, 21 
Mondropolon 109, Map 30 
Mong see Hmong 
Mongo [Daju dialect] 284 
Mongo-Nkundo 310, 330, Maps 79-83, 87 
Mongo 310-11, 313-16, 323, 
Maps 79-83, 87 
as lingua franca 330 
Mongol-Langam [Papuan: Sepik] 113 
Mongol 113, Map 33 
Mongolic 187,238 
Mongolian 188-90, Maps 45, 50, 55, 
58-59 
Inner Mongolian 191, Maps 45, 50 
Mongol(ian) 187-91, 234, Map 45 
see also Bargu-Buriat; Oirats 
Mongondow-Gorontalo 109, 120, Map 39 
Mongondow 120, Map 39 
Mongsen [Ao dialect] 167 
Monguor (Tsagaan) 191, Map 45 
Minhe Monguor (Mangghuer, White 
Mongol) 192 
Tu (Tukun, Tuhuija) 192 
Moni (Mwa, Mwan) [Mande] 323, 
Map 105 
Moni [Trans-New Guinea] 114, Map 34 
Monic see Mon-Khmer: Monic 
Monja 323, Map 80 
Monnepwa 174 
Mono [Gur-Adamawa] 323, Map 91 
Mono [Ubangi] 323, Maps 82, 89 
Mono [Uto-Aztecan] 8, 18-19, Maps 4, 
8-9 
Mono-Alu [Austronesian] 108, Map 29 
Monogoy 323, Maps 90, 94 
Monom 177-8, Map 48 
Monpa (Menba, Tshangla) 165, 213, 
Maps 45, 51 
Eastern (Motuo Menba) 165, 175, 
Maps 45, 51 
Western (Central, Cona Menba) 
164-5, 175, Map 45 
Monqui-Diditi (Monqui) 9, 13, 15, 
Map 3 
Montagnais-Naskapi 7 
Montagnais (Innu, Innu-Aimun) 7, 13, 
15, 19, Maps 1, 7-9 
Montana Salish see Spokane-Kalispel- 
Flathead 
Montol 323, Maps 96, 98 
Monumbo 113 
Monumbo 113, Map 33 
Monyton see Tutelo 
Monzombo 323, 330, Maps 82, 87 
Moore see More [Gur] 
Moose Cree 7, 19 
Mopan 52, Maps 10-11 
Mopo [Burun dialect] 284 
Mopwa (Palaychi) 174-5, Map 49 
Mér [Austronesian: Cenderawasih Bay] 
115, Map 34 
Mor [Papuan: Trans-New Guinea] 99, 99, 
114, Map 34 
Morafa 112, Map 33 
Moraid 115, Map 34 
Moraori 99, 114, Map 34 
Morawa 110, Map 31 
Mordvin (Volga) 231, 234-6, 238, Map 58 
Erzya 234n., 238, Map 58 
Moksha 234n., 238, Map 58 
More [Chapaktran] see Itene 
More (Moore) [Gur] 311, 323, 332, 
Maps 102-104 
Morehead-Upper Maro 99, 111, 115, 
Maps 32, 34 
Lower Morehead (Peremka) 111, 
Map 32 
Upper Morehead (Rouku, Kamindjo, 
Tjokwai, Tokwasa) 111, Map 32 
Moresada 111, Map 33 
Mori 120, Map 39 
Morigi (Turama River Kiwai) 111, Map 32 
Morike (Mayoruna) 62, 67 
Morin Davaa [Daur subdialect] 192 
Moriori 107, Map 26 


Morisyen 302 
Moroccan 281 
Morouas 108, Map 27 
Mortlockese 108, Map 28 
Moru-Logbara-Madi 283 
Moru-Madi 284, 307, Maps 72, 79 
Moru 323, Map 72, 79 
Moru-Madi 284, 323, Map 79 
Moruba 323, Maps 89-90 
Moruwad'i [Moru dialect] 284 
Morwap 115, Map 34 
Mos see Maniq 
Mosa 323, Map 85 
Mosetén-Chon 71, Maps 20-21, 23 
Mosetén 71, Maps 20-21 
Moshang see Tangsa 
Moshi see Ubu-Mbokomu-Moshi-Kirua- 
Kilema-Mwika 
Mosi-Gurma Ful 323, Maps 102, 104 
Mosimo 112, Map 33 
Mosiya see Bussa 
Moso [Bantu] 323, Map 80 
Moso (Na) [Tibeto-Burman] 170, 175, 
192, Map 46 
Mosquito see Miskitu 
Mota 102, 106, 108, Map 27 
Motembo 323, Maps 82, 87 
Motilén see Bari 
Motlav 108, Map 27 
Motozintleco see Moché 
Motu 105, 111, Map 32 
Motuna 99 
Motuo Menba see Monpa, Eastern 
Mountain Koiari Map 31 
Mountain Ok 112, 114, Maps 33-34 
Mountain Pima 54 
Mountain Slavey 8, 11, 14, 19, 23, 
Maps 7-8 
Move see Mobe 
Movima 74, Maps 17, 20-21 
Mowea 323, Map 82 
Moy 323, Maps 81, 83 
Moyoko 323, Map 83 
Moyon see Mayol 
Mozi (Southern Tujia) 170, 175, Map 45 
Mpada 323, Maps 90, 99 
Mpakwithi 101 
Mpalityanh 101, 117, Map 35 
Mpama 323, Map 81 
Mpee [Bantu] 323, Map 81 
Mpese 323, Map 84 
Mpezeni 329 
Mpi [Burmese: Ngwi] 173, 175, Map 47 
Mpiem (Mpiemo) 323, Map 87 
Mpo 323, Map 87 
Mpoman 323, Map 87 
Mpondo 323, Map 74 
Mpongwe 323, Map 88 
Mpopon 323, Map 87 
Mpoto 323, 329, Maps 75-76 
Mpotovoro 108, Map 27 
Mputu 323, Maps 78, 80-81 
Mrabri 176 
Mro 215 
Mru 167, 171, Maps 49, 51 
Mru 167, 171, 175, Maps 49, 51 
Mrung see Kokborok 
Mser 323, Maps 90, 99 
Mtang'ata 323, Maps 76, 85 
Mubi-Toram Map 89 
Mucht (Tuzanteco) 52 
Muda 174-5, Map 46 
Mudburra 100, 116, Map 35 
Muellama see Muelyama 
Muelyama (Muellama) 63, Map 12 
Mufwa [Burun dialect] 284 
Mufwian see Arapesh, Southern 
Mugali see Lambichong 
Mughaja [Burun dialect] 284 
Mugil (Saker) 112, Map 33 
Muguji 284 
Muhiang see Arapesh, Southern 
Muhisu 172-3, 175, Map 46 
Muisca (Muiska) see Chibcha- 
Misumalpan, Chibcha 
Muji 172, 174-5, Map 46 
Mujum 323, Map 93 
Muk-Thang 102, 118, Map 35 
Mukalla 281 
Mukawa see Are 
Mukuchi (Mirripu, Maript) 63, Map 12 
Mukulu 323, Map 77 
Mulaha 111, Map 32 
Mulam (Mulao) [Kam-Tai] 179, 184, 
Map 45 
Mulao (Ayo) [Kadai] 182, 184, Map 45 
Muller-Schwaner “Punan’ 120, Map 41 
Mulung see Konyak 
Muluriji [Kuku Yalanji dialect] 101, 117, 
Map 35 
Mumbake 323, Map 98 
Mumeng 111, Map 32 
Mumoni 323, Map 85 
Mumuye-Yendang Maps 91, 98 
Mumuye 323, Maps 91, 98 
Mun [Omotic] see Mursi 
Muna-Buton 120, Map 39 
Muna 120, Map 39 
Munda 175, 213-15, 217, Map 54 
North 217, Map 51 
South 217, Map 51 
Mundang 323, Maps 90-91, 94 
Mundani 306, Map 93 
Mundani 323, Map 93 
Mundari-Ho 217, Maps 51-52 
Mundari 213, 217, Maps 51-52 
Mundat 323, Map 96 
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Mundokuma see Biwat 
Mundu 323, Map 79 
Mundugumor see Biwat 
Mundum 323, Map 95 
Munduruku 75, Maps 16-19 
Munga 323, Map 98 
Mungaka 323, Maps 93, 95 
Munggui 115, Map 34 
Mungkan see Wik Mungkan 
Munichi (Muniche) 68, Maps 14-15 
Munit 112, Map 33 
Muniwara (Tumara) 113 
Munji 211, 216, Map 53 
Munjuk Maps 90, 94 
Munjuk 323, Maps 90, 94 
Munkip 110, Map 32 
Munsee (Munsi, Canadian Delaware) 7, 
17, 19, 24, Maps 1, 8 
Munukutuba [Bantu: Kongo] 311 
Munyo 323, Map 85 
Muong 176, 178, Map 47 
Mura-Matanawian 71, Maps 13, 15-18, 
20 
Muran 71, Maps 15-18 
Mura 71, Maps 13, 15-18 
Murang'a 323, Map 86 
Murik Kayan 120, Map 41 
Murik (Nor) 113, Map 33 
Muringam 102, 118, Map 35 
Murle 284 
Murmi see Tamang 
Murray, Lower 102, 118, Map 35 
Murrinh-patha 107, 116, Maps 35-36 
Mursi (Mun) 284, 289 
Murui-Witoto (Huitoto) 69, Maps 12-15, 
17 
Murulle 323, Map 85 
Murunitja 101, 117, Map 35 
Murupi 112, Map 33 
Murutic 120, Map 41 
Murut Map 41 
Muruwari 102, 117, Map 35 
Musa see Musian 
Musak 111, Map 33 
Musar 112, Map 33 
Musey 323, Maps 90, 94 
Musgu 323, Maps 90, 94 
Mushuaau Innu (Eastern Naskapi) see 
Naskapi 
Mushungulu (Kimushungulu) 285, 290 
Musian (Musa) 113, Map 33 
Muskogean 7, Maps 1, 8-9 
Muskogee (Creek) 9, 13, 26, Maps 2, 
8-9 
Muslung 166 
Musom 111, Map 32 
Mutayr Map 70 
Muthambi 323, Map 85 
Mutonia see Wancho 
Mutsun see Costanoan 
Mutu [Austronesian] 113, Maps 30, 33 
Mutt [Timétean] 63 
Mutum (Paswan) [ Trans-New Guinea] 
111, Map 32 
Mutyanthi [Uradhi dialect] 101, 117, 
Map 35 
Muumingan [Ulanqab subdialect] 191 
Miiin see Kcho 
Muungo 323, Map 92 
Muure see Peere 
Muxir 288, Map 72 
Muya (Manyag, Menia) 170, 175, 
Map 45 
Muyang-Ma'da (A) [S. Mafa-Matakam 
subgroup] Map 94 
Muyang 323, Map 94 
Muyonzi 323, Map 83 
Muyuw 110, Map 31 
Mvano 323, Map 95 
Mvedere 323, Map 89 
Mvele 323, Maps 87, 92-93 
Mveny 323, Maps 87-88 
Mvtwang 169, 175, Map 49 
Mvuba 323, Map 79 
Mvumbo 323, Maps 88, 92 
Mvumbu (Wuumu) 323, Maps 83, 88 
Mwa-Gan Map 105 
Mwa (Mwan) see Moni 
Mwahed 323, Map 93 
Mwali (Mohere) 323, Map 75 
Mwaneko 323, Map 93 
Mwani 323, 329, Maps 75-76 
Mwatebu 110, Map 31 
Mwele 323, Maps 83, 88 
Mwenshi 323, Maps 77-78 
Mwenyi-Simaa-Mbumi-Imilangu 323, 
Map 77 
Mwera 323, Maps 75-76 
Mweri see Sumbwa 
Mwika see Ubu-Mbokomu-Moshi-Kirua- 
Kilema-Mwika 
Mwimbi 323, Map 85 
Mwina 323, Map 85 
Mwini (Miini, Barawa) [Swahilioid] 323, 
330 
Myaing see Hmong 
Myangad (Mingat) 188, Map 45 
Myanmar see Burmese 
Mzab Map 73 
Mzieme 167-8 


Na [Nilo-Saharan] see Denje 

Na [Tibeto-Burman] 169-70, 175 
Na see Moso 

Na-Dene 8-10, Maps 2-4, 5-9 

Na-Togo 305, 332, Maps 102-104 

N'aym 279, Map 70 


Naadh see Naath 
Naath (Naadh, Nuer) 284-5, 323, 
Maps 72, 79 
Naati 108, Map 27 
Nabak 110, Map 32 
Nabarlgu 102, 118, Map 35 
Nabi 115, Map 34 
Nabit 323, Maps 102-104 
Nabulu see Abulu 
Nachering (Nachhiring) 165-6, 214, 
Map 51 
sNadkarrtse 164 
Nadéb 67, Map 13 
Nafaanra Map 105 
Nafaanra 323, Maps 103, 105 
Nafara 323, Maps 104-105 
Nafri 115, Map 34 
Naga Map 51 
Northern (Konyak) 166-7, Map 51 
Naga [Omotic] 289 
Nagamese (Naga Pidgin) 214, 217, 
Map 51 
and Nepali Map 51 
Nagatman 113, Map 33 
Nagchu 164 
Nage-Keo 118, Map 38 
Nagovisi 99, 109, Map 29 
Nagumi 323, Map 91 
Naha'ai 108, Map 27 
Nahara 323, Map 75 
Nahu 110, 112, Maps 32-33 
Nahua see Nawa 
Nahuatl, Durango Maps 8-9 
Naiki 217, Maps 52, 54 
Naikri 217, Map 52 
Naiman [Ju Ud subdialect] 191 
Naisu 175, Map 46 
Western (Red Yi) 172 
Najdi 279 
Naka'ela 118, Map 37 
Naka [Kresh dialect] 284 
Nakaku 100, 116, Map 35 
Nakama 110, Map 32 
Nakanai 109, Map 30 
Nakanmanga 108, Map 27 
Nake 112, Map 33 
Nakh-Daghestanian see Caucasian, North, 
East 
Nakh [subgroup] Map 59 
Naki 323, Map 95 
Nakkara 116, Map 35 
Nakon see Assiniboine 
Nakrehé 72, Map 18 
Nalca 114, Map 34 
Nali [E. Admiralty] 109, Map 30 
Nalik [New Ireland] 109, Map 30 
Nalu 332, Maps 107-108 
Nalu 323, Maps 107-108 
Naluo (Naruo, Laluo) 173, 175, Map 46 
Nama [Khoekhoe] Map 74 
Nama-Damara [Khoekhoe] 307, 323, 329, 
Maps 74, 84 
Namakir 108, Map 27 
Naman 108, Map 27 
Namau see Purari 
Nambe see Tewa 
Nambi 113, Map 33 
Nambikuaran 74, Maps 16-21 
Nambikuara (Nambikwara) 74, 
Maps 16-21 
Nambu 111 
Nambu 111, Map 32 
Namia 113, Map 33 
Namonuito 108, Map 28 
Namsangia see Nocte 
Namumi 111, Map 32 
Namuya see Kesawai 
Namuyi 170, 175, Maps 45-46 
Nanai see Hezhe 
Nanai [Tungus subgroup] 187, 238 
Nanay (Gold) see Hezhe 
Nancere-Gabri Map 90 
Nancowry [Nicobarese dialect] 178, 217 
Nande 310-11, 323, Maps 79-80 
Nandeva see Chiripé-Nyandeva 
Nandi 323, Map 86 
Nandu-Tari Map 96 
Nandu-Tari 323, Map 96 
Nane 109, Map 30 
Nangam 323, Map 104 
Nangchen 164 
Nanggu 109, Map 29 
Nankanse 323, Map 104 
Nankarti 101, 117, Map 35 
Nankina 110, 112, Maps 32-33 
Nanomam 77 
Nanticoke-Conoy 7, 19, Maps 1-2 
Nanuni 323, Maps 102-104 
Nao Gelao, Nao Klao see Baonao 
Nao (Nayi) [Dizoid] 288, 289, Map 72 
Naolan Map 3 
Napu 120, Map 39 
Nar [Tibetan] 164, Map 51 
Nara [Nilotic] 283,290, Map 72 
Nara (Nara bana) 285, Map 72 
Naraguta 323, Maps 96-97 
Narak 112, Map 33 
Narangga 101, 117, Map 35 
Narango 108, Map 27 
Narau 114, Map 34 
Nari Nari 102, 118, Map 35 
Naro (Nharo, Nharon) [Khoisan] 323, 
Map 74 
Narom 120, Map 41 
Narragansett [Massachusett dialect] 18 
Narrow Bantu see Bantu 
Narrow Grass see Grassfields 


Naruma see Kiwo-Naruma-Vunjo 
Naruo see Naluo 
Nasarian 108, Map 27 
Nasioi 99, 109, Map 29 
Naskapi 7, 13, 19, Maps 7-9 
Eastern 7, 13-14, 16, 19, 25, Maps 7-9 
Western 7, 14, 19, 25, Maps 7-8 
Naso [Meso-America] see Teribe 
Nasoid [Tibeto-Burman] 172-3, 175 
Nass-Gitksan 8 
Nisga'a (Nisgha, Nisqa'a) 8, 19, 
Maps 6-8 
Nasu (Eastern Yi) 172, 175, Map 46 
Central (Black Yi) 172 
Nasu [Nisu dialect] 172 
Nasvang 108, Map 27 
Nata 323, Map 86 
Natchez 7,9, 19, Map 2 
Natioro 323, Maps 104-105 
Natu 73, Map 18 
Naudem 323, Maps 102-104 
Naueti 119, Map 38 
Nauk-o see Khyamnyungan 
Naukan Yupik see East Cape Yupik 
Nauna 109, Map 30 
Nauruan 108, Map 28 
Navajo (Navaho) 8, 19, 27, Maps 3, 5, 8-9 
Navava 108, Map 27 
Navut 108, Map 27 
Nawa (Nahua, Nahuatl) 45, 49-50, 53-4, 
62, Maps 10-11 
Nawal (Mejicanero) 53 
Northern 49-50, Maps 10-11 
Nawathinehena (Southern Arapaho) 10, 
19, Maps 2, 5 
Nawiki 
Eastern 66 
Western 65-6 
Nawo 101, 117, Map 35 
Nawuri 323, Maps 102-103 
Naxi 169-70, 175, Map 46 
Nayi see Nao [Dizoid] 
N'aym 279, Map 70 
Nazan (Nazhan, Naza) 173, 175, Map 46 
Neane 323, Map 95 
Nchema see Gurma 
Nda'nda' 323, Map 93 
Ndai 323, Map 91 
Ndaka 323, Map 79 
Ndali 323, Map 76 
Ndam 323, Map 90 
Ndamba 323, Map 76 
Ndao [Bima-Sumba] 118, Map 38 
Ndasa 323, Maps 83, 88 
Ndau [Bantu] 323, Maps 74-75 
Ndau (Pendau) [Central Sulawesi] 120, 
Map 39 
Nde [Ekoid subgroup] Maps 93, 95 
Nde 323, Maps 93, 95 
Ndebele 312, 323, 329, 332-3, Maps 74, 
77 
Ndembu 323, Map 84 
see also Lunda-Ndembu 
Ndemli 306, Maps 92-93 
Ndemli 323, Maps 92-93 
Ndendeule 323, Maps 75-76 
Ndengense see Ndengese 
Ndengereko 323, Map 76 
Ndengese (Dengese, Ndengense) 323, 
330, Maps 80-81 
Ndera [Bantu: Pokomo] 323, Map 85 
Ndera [Gur-Adamawa] 323, Maps 91, 98 
Ndi 323, Map 89 
Ndia 323, Map 86 
Ndibu 323, Map 84 
Ndienga 323, Map 83 
Ndingi 323, Map 83 
Ndjuka see Aukaan 
Ndo 323, Map 79 
Ndobo 323, Maps 82, 87 
Ndoe 323, Maps 93, 95 
Ndokpa 323, Maps 82, 89 
Ndolo 323, Maps 82, 87 
Ndom 114, Map 34 
Ndombe 323, Map 84 
Ndonde (Mawanda) 324, Maps 75-76 
Ndonga 324, Map 84 
Ndongo-Binza 324, Maps 82, 87 
Ndongona 324, Map 84 
Ndoro 324, Map 95 
Ndra'ngith 101, 117, Map 35 
Ndri 324, Maps 82, 89 
Nadu 113, Map 33 
Nduga 114, Map 34 
Nduindui 108, Map 27 
Nduke 109, Map 29 
Ndura 324, Map 85 
Ndut 324, Map 108 
Nduuvum 324, Map 91 
Nedebang 119, Map 38 
Nee 324, Map 93 
Negd 281 
Negev Bedouin Map 70 
Negidal 234, 238 
Negrito, East Cayagan (Dupaninan Agta) 
119 


Négue (Louisiana Creole) 26 
Nehan-North Bougainville 108, Map 29 
Nehan 108, Map 29 

Nek [Trans-New Guinea: Erap] 110, 
Map 32 

Nekgini 112, Map 33 

Neko [Trans-New Guinea: Gusapmot] 
112, Map 33 

Neku [Austronesian] 108, Map 27 

Nembao 109, Map 29 

Nembe 324, Map 101 


Nemi 108, Map 27 
Nenaya (Nineia) [Austronesian: Vitiaz] 
111, 113, Maps 32-33 
Nenets (Yurak Samoyed) 234, 238, 
Maps 55-56, 58 
Nengone 108, Map 27 
Neo-Aramaic see Assyrian 
Neo-Melanesian see Tok Pisin 
Nepal Sign see Sanketic 
Nepala bhasa see Newari 
Nepali (Gorkhali) 163-4, 212-13, 217, 
Maps 51, 54 
abroad 181, 214-15 
Nora see Nara 
Neré 108, Map 27 
Nete 112, Map 33 
Netherlandic/German 259 
Netherlandic (Dutch, Nederlands) 
259, 262, Maps 64-65, 69 
abroad 26-7, 234 
in Australia Map 36 
Berbice Creole Map 13 
Flemish 262 
Neusiok Maps 1-2 
Neutral Map 1 
Neve'ei 108, Map 27 
Névome 54 
Nevwervwer 108, Map 27 
New Britain 
East 99, 109, Map 30 
South-West 109, Map 30 
see also Yele-West New Britain 
New Caledonia and Loyalty Is 108 
New Caledonian Pidgin French 106, 
Map 25 
New England Algonquian, Southern 7, 10 
New Georgia-Santa Ysabel 109, Map 29 
New Georgia 109, Map 29 
New Guinea 309 
North 109, 113 
North-West 114-15, 118, Maps 34, 37 
West 98 
New Guinea Pidgin see Tok Pisin 
New Ireland 109, Map 30 
South New Ireland/NW Solomons 109, 
Map 30 
New River Shasta 19, Map 4 
New South Wales, Central 102, Map 35 
New Xiang see Chinese (Sinitic), Central 
Newahang see Mewahang 
Newari (Nepala bhasa) 163, 165, 214, 
Map 51 
Neyo 324, Map 105 
Nez Perce 8, 19, Maps 4-6, 8-9 
Ng'men see Kcho 
Ng'wande 324, Maps 91, 95 
Ngaanyatjarra 100, 116, Map 35 
Ngaatatjarra 100, 116, Map 35 
Ngabu 324, Maps 79, 89 
Ngada 118, Map 38 
Ngadjuri 101, 117, Map 35 
Ngadyumaya 101, 117, Map 35 
Ngagu 102, 118, Map 35 
mNgahris see Ngari 
Ngaimbom see Lilau 
Ngaing 112, Map 33 
Ngaja 324, Map 89 
Ngaju [Barito] see Kapuas 
Ngaju Dayak see Kapuas, Ngaju Dayak 
Ngala [Bantu] 324, Map 84 
Ngala (Kara, Sogap) [Sepik] 113, Map 33 
Ngala (Matu) [Kuki-Chin] 168 
Ngalakan 116, Map 35 
Ngali-Nduga 114 
North Ngalik 114, Map 34 
South Ngalik 114, Map 34 
Ngaliwuru 116, Map 35 
Ngaliya 101, 117, Map 35 
Ngalum 112, 114, Maps 33-34 
Ngam (Ngama) 324, Maps 89-90 
Ngamambo 324, Maps 93, 95 
Ngambaa 102, 118, Map 35 
Ngambay 324, 331, Maps 90-91 
Ngamini 101, 117, Map 35 
Ngamo 324, Maps 98, 100 
Nganasan (Tavgi Samoyed) 234, 238, 
Maps 55-56 
Negandi [Australian] 116, Map 35 
Ngando [Bantu: Mongo-Nkundo C70] 
324, Maps 79, 80 
Ngando [Bantu: Ngando C10] Maps 82, 
87 
Ngando Maps 82, 87 
Ngangwela 324, Maps 74, 84 
Nganhcara see Kugu Nganhcara 
Nganjera 324, Map 84 
Nganyayawana 102, 118, Map 35 
Ngao 324, Maps 89-90 
Ngapu 324, Map 89 
Ngarduk 116, Map 35 
Ngare 324, Maps 81, 83, 87 
Negari (mNgahris) [Tibetan subgroup] 
164, 175 
Ngarigu 102, 118, Map 35 
Ngarinyeri 102, 118, Map 35 
Ngarinyman 100, 116, Map 35 
Ngarla 100, 116, Map 35 
Ngarlawangka 100, 116, Map 35 
Ngarluma 100, 116, Map 35 
Ngarndji 116, Map 35 
Ngarti 100, 116, Map 35 
Ngatana 324, Map 85 
Ngawn 168 
Ngawun 101, 117, Map 35 
Ngayarta 100, 116, Map 35 
Ngayawung 102, 118, Map 35 
Ngaygungu 101,117, Map 35 


Ngazija 324, Map 75 
Negbaka 311, Map 79 
Ngbaka-Baka 324, Map 87 
Ngbaka 324, Map 87 
‘Baka 330 
Ngbaka-Ma'bo 324, Maps 82, 87, 89 
Ngbaka-Manza 324, Maps 82, 87 
Ngbaka-Minangende 324, 330, Maps 82, 
87 
Negbandi Maps 79, 82, 87, 89 
Ngbandi 311, 324, Maps 79, 87, 89 
Ngbundu 324, Map 82 
Ngeh see Klor 
Ngele 324, Map 80 
Ngemba Maps 93, 95 
Ngende 324, Map 81 
Ngenkikurrunggur 116, Map 35 
Ngenkityemeri 116, Map 35 
Ngero-Vitiaz 109, 111, 113, Maps 30, 
32-33 
Ngero 109, 111, 113, Maps 30, 32-33 
Nggwahyi see Ngwaphyi 
Nghwele (Nhwele) 323, Maps 76, 85 
Ngie 324, Maps 93, 95 
Ngindo 324, Maps 75-76 
Ngiri see Loi (Ngiri) 
Ngishe 324, Map 95 
Ngiti (Dru, Druna) 324, Map 79 
Ngizim 324, Map 100 
Ngkoth 101, 117, Map 35 
Ngnai 183-4, Map 45 
Ngobere see Mobe 
Ngochang see Achang 
Ngola 324, Map 84 
Ngom 324 
North 324, Map 87 
South 327, Maps 83, 88 
Ngoma 324, Map 77 
Ngomba [Bantoid: Narrow Grass] 324, 
Map 93 
Ngombale 324, Map 93 
Ngombe [Bantu: N.W.] Maps 79, 82-83, 
87 
Ngombe 324, Maps 82, 87 
as lingua franca 330 
Ngombe-Motembo 324, Maps 79, 82, 
87 
Ngondi 324, Map 87 
Ngong 324, Map 91 
Ngongo 324, Maps 80-81 
Ngoni 302, 324, Maps 75-76 
Ngoni-Senga 324, 329, Maps 75-77 
Ngoro [Bantoid: Mambiloid] 324, 
Maps 92-93 
Ngoro [Bantu: Lundu-Balong] 324, 
Map 93 
Nggqika 324, Map 74 
Ngubi 324, Map 88 
Nguburindi 116, Map 35 
Ngulu 324, Maps 76, 85 
Ngulupulu 101, 117, Map 35 
Ngumbo 324, Map 77 
Ngumbur see Ngurmbur 
Ngumpin-Yapa 100, 116, Map 35 
Ngumpin 100, 116, Map 35 
Ngunawal 102, 118, Map 35 
Ngunbudj 116, Map 35 
Negundi 330, Maps 83, 87 
Ngungwel 324, Maps 81, 83 
Ngungwoni 324, Map 83 
Nguni 302, 329, Maps 74-75, 77 
Nguon 176, 178, Map 47 
Ngurawarla 101, 117, Map 35 
Nguri 101, 117, Map 35 
Ngurimi 324, Map 86 
Ngurlu 101, 117, Map 35 
Ngurmbur 116, Map 35 
Ngtsa Map 73 
Ngwa 324, Map 93 
Ngwaketse 324, Map 74 
Negwane 324, Map 91 
Ngwaphyi (Nggwahyi) 324, Map 94 
Ngwato 324, Map 74 
Negwe 324, Map 93 
Ngweme 324, Map 87 
Ngweshe [Bantu: Shi-Havu] 324, Map 80 
Ngweshe (Gvoko) [Chadic] 324, Map 94 
Ngwi [Bantu] 324, Map 81 
Newi-Loloish-Yi [Tibeto-Burman] 
Map 46 
Ngwi 171-2, 175, Maps 45-49, 51 
Negwo 324, Map 95 
Ngwoi 324, Maps 100-101 
Ngyeboong 324, Map 93 
Nhanda 100, 116, Map 35 
Nhang see Yay 
Nhanyatjarra 100, 116, Map 35 
Nharo, Nharon see Naro 
Nho 324, Maps 92-93 
Nhuwala 100, 116, Map 35 
Nhwele see Nghwele 
Ni see Sani 
Nias 121, Map 43 
Nicaraos 52 
Nicobarese 175-6, 178, 213, 217, Map 52 
Great Nicobarese 217, Map 52 
Nicola 8, 19, Maps 6-7 
Niellim 324, Maps 89-90 
Niende 324, Map 102 
Niesu [Nisu dialect] 172 
Niger-Congo 304, 306-7, 309, 330, 
Maps 52, 70-108 
Nigerian, Central 305-6, 331, 
Maps 95-97 
see also Atlantic, Benue-Congo; 
Gur-Adamawa; Kru; Kwa; Mande 
Niger-Kordofanian Maps 70-72 
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Nigerian Pidgin English 331 
Nigi 324, Map 93 
Nihali 211, Maps 52, 54 
Nii 110, 112, Maps 32-33 
Niinati see Kati, Northern 
Nila-Serua 119, Map 38 
Nile [E. Arabic subgroup] Map 71 
Nilo-Saharan 46, 282-5, 304, 306-7, 330, 
Maps 76, 79-80, 82, 89-91, 93 
Nilotic 283, 289, 307, Maps 71-72, 76, 
79 
Central (Eastern) 284, 289 
East 283, 307, Maps 79, 85-86 
East-Central Map 72 
Northern (Western) 284, 289, 
Map 72 
South 283, 307, 330, Maps 72, 86 
West 283, 307, 330, Maps 72, 79, 86 
Saharan Maps 71-73, 90, 94, 98-100 
see also Komuz; Maba Fur (Maban); 
Songhay; Sudanic 
Niluo [Negwi] see Aluo 
Nimadi Map 52 
Nimanbur 116, Map 35 
Nimbari Map 91 
Nimbari 324, Map 91 
Nimboran 99, 108, 115, Map 34 
Nimboran 115, Map 34 
Nimi [Trans-New Guinea: Finisterre] 110, 
Map 32 
Nimo [Kwomtari-Leftmay] 113, Map 33 
Nimoa [Austronesian: Papuan Tip] 110, 
Map 31 
Ninam see Yanam-Ninam 
Ninde [Central Vanuatu] 108, Map 27 
Nindem [Central Nigerian] 324, Maps 
96-97 
Nineia see Nenaya 
Ninggera 112, Map 33 
Ninggirum 112, 114, Maps 33-34 
Ningil 113, Map 33 
Ninzam (Ninzo) 324, Map 96 
Ninzo see Ninzam 
Nioleien 108, Map 27 
Nipissing see Algonquian, Southern 
Algonquin 
Nipsan 114, Map 34 
Nisa 115, Map 34 
Nisenan 8, 19, Map 4 
see also Maiduan 
Nisgha see Nass-Gitksan 
Nishi (Nishang, Nyishang) 169, 175, 
Map 51 
Nishnaabemwin 20 
see also Ojibwe, Central Southern; 
Ojibwe, Eastern; Ottawa 
Nisqa'‘a see Nass-Gitksan 
Nissi 213 
Nisu (Southern Yi, Lolo) 172, 175, 
Maps 46-47 
Nisvai 108, Map 27 
Nithi 324, Maps 72, 85-86 
Nitinaht (Ditidaht) 8, 19, Maps 6, 8-9 
as Nuuchahnulth 19 
Nitu (Daai) 168 
Niuafo'ou 107, Map 26 
Niue 107, Map 26 
Nivaclé see Chulupi 
Nivkh (Gilyak) 231, 234, 239, Map 55 
Njabi Maps 83, 88 
Njabi see Nzebi 
Njanyi 324, Map 91 
Njem 324, Map 87 
Njikun 324, Map 95 
Njinga 324, Map 84 
Njinju 324, Map 83 
Njoa-Poantu 324, Map 93 
Njoyame 324, Map 91 
Njwani see Nzwani 
Nkambe Map 95 
Nkamu 324, Map 84 
Nkangala 324, Map 84 
Nkem 324, Map 95 
Nkim 324, Map 95 
Nkoi 324, Maps 78, 80 
Nkombe 324, Map 80 
Nkomi 324, Map 88 
Nkonya 324, Maps 102-103 
Nkore (Nyankore) 324, Map 80 
Nkoya Map 77 
Nkoya 324, Map 77 
Nkuchu 324, Maps 80-81 
Nkum 324, Map 95 
Nkundo 324, Map 81 
Mongo dialects 330 
Nkwen 324, Maps 93, 95 
Nlaa 324, Map 93 
Nle 324, Map 93 
Nnakenyake 324, Maps 91, 98 
Nnam 324, Map 95 
Noanama (Waunana) 63, Maps 12-13 
Noatia (Tipra, Tippera) 166, 215 
Nobiin 283-4, Map 71 
Nocaman see Nokaman 
Nocte 167, 175, 213, Maps 49, 51 
Nogai (Noghai) 234, 238, Map 59 
Astrakhan 234, 238 
see also Tatar, Alabugat; Tatar, Yurt 
Noghai see Nogai 
NOI [Nupoid-Oko-Idomoid subgroup] 
306, Maps 93, 95, 97, 101 
Nok-aw see Khyamnyungan 
Nokaman (Nocaman) 70, Maps 14, 16, 20 
Nokopo 110, 112, Maps 32-33 
Nokuku 108, Map 27 
Nolgin 116, Map 35 
Nomaante (Numaand) 324, Maps 92-93 


Nomad 110, 112, Maps 32-33 
Nomane 110, Map 32 
Nomlaki [Wintuan] 8, 25, Maps 4, 8-9 
Nomu 110, 112, Maps 32-33 
Non (Noon) 324, Map 108 
Nonda 324, Map 80 
Nongatl see Eel River Athabaskan 
Noni 324, Map 95 
Nonni [Daur dialect] 192 
Nonuya 69, Maps 12-14 
Nooksack 8, 19, Map 6 
Noon see Non 
Nootka (Nuuchahnulth) 8, 19, Maps 6, 
8-9 
Nootre see Bulba 
Nop [Koho dialect] 177 
Nor-Pondo [Lower Sepik subgroup] 108, 
111, 113 
Nor see Murik 
Nora 184 
Norfolk 107, Map 25 
Norman French 263 
Normano-Picard (French) Map 69 
Picard 262 
Norrland Swedish Map 64 
North Mainland-D’Entrecasteaux 109-10, 
Map 31 
North New Guinea cluster 111, Maps 30, 
32-33 
North-East India see Sal 
Northern Central-American English 26, 
Maps 11, 13 
Northern Straits Salish 8, 13, 20, Maps 6, 
8-9 
Norwegian 233, 263-4, Map 64 
Bokmal 259, 263 
Nynorsk 259, 263 
Nosu (Northern Yi) 170, 172-3, 175, 
Maps 45-46 
Central Nosu Map 46 
Northern Nosu (Yinuo) Map 46 
Southern Nosu Map 46 
Notsi 109, Map 30 
Nottoway 9, 20, Maps 1-2 
Notu (Ewage) 105, 110-11, Maps 31-32 
Nowola 324, Map 105 
Noy 324, Maps 89-90 
Nozaman 324, Maps 87-88 
Nruanghmei see Kabui 
Nsari 324, Map 95 
Nsele 324, Map 95 
Nsenga 324, Maps 75, 77 
Nsinseke 324, Map 83 
Nsongo 324, Maps 81-82 
Nsose 324, Map 93 
Nsundi 324, Map 84 
Nta 324, Map 95 
Ntandu 324, Maps 81, 83-84 
Ntenyi (Meluri) 168, 212, Map 51 
Pochuri (Pochury, Anyo) 167-8, 
212-13 
Ntomba 324, Maps 81, 83 
North 324, Map 81 
South 327, Map 81 
Ntrubo see Delo 
Ntsiaki 324, Map 83 
Ntsotso 324, Map 84 
Ntumu 324, Maps 88, 92 
Nu see Anung 
Nuaheun Map 48 
Nuaulu 118, Map 37 
Nubian 283, Maps 71-72 
Hill Nubian 283, Maps 71-72 
Old Nubian 282, 283 
Nubri 164, Map 51 
Nuclear Papuan Tip 109-10 
Nuclear Polynesian 107 
Nuclear Rai 112, Maps 32-33 
Nudoo (Nudoong) 324, Map 93 
Nuer see Naath 
Nujum 324, Map 93 
Nuk 110, Map 32 
Nukak Maku see Waviare 
Nukuma 113, Map 33 
Nukumanu 107, 109, Maps 26, 29 
Nukunu 101, 117, Map 35 
Nukuoro 107-8, Maps 26, 28 
Nukuria 107, Map 26 
Numaand see Nomaante 
Numana-Gwantu 324, Map 96 
Numanggang 110, Map 32 
Numao 183-4 
Numbami (Siboma) 111, Map 32 
Nume 108, Map 27 
Numéé 108, Map 27 
Numic 8, Maps 2-4, 5-6, 8-9 
Numugen-Mabuso 112, Map 33 
Numugen 112, Map 33 
Nun Maps 93, 95 
Nung [ Tai-Kadai] see Zhuang, Southern 
Nung [ Tibeto-Burman] see Nungish- 
Rawang 
Nung Ven (Ainh, Nungven) 182, 184, 
Map 47 
Nungali 116, Map 35 
Nunggubuyu 116, Map 35 
Nungish-Rawang [Tibeto-Burman] 163, 
168-9, 175, Maps 45-46, 49 
Nungu [Central Nigerian] 325, Map 96 
Nungunu [Bantu] 325, Maps 92-93 
Nungven see Nung Ven 
Nuni 325, Maps 104-105 
Nunian 283 
Nuno 325, Map 97 
Nunu [Bantu] 325, Maps 81, 83 
Nunu [Bunu dialect] 183 
Nunungal 102, 118, Map 35 


Nunusaku 118, Map 37 
Nupoid 305, 306, Maps 96-97, 101-102 
Nupe, Central 315, Maps 97, 101-102 
Nupe Tako 325, Map 101 
Nupe-Gbagyi 312 
Nuristani 216, Map 53 
Nusalaut 118, Map 37 
Nusu 169, 173, 175, Map 46 
Nutata 325, Map 92 
Nuuchahnulth see Nootka 
Nuxalk see Bella Coola 
Nuyangbe 325, Map 92 
Nuzi see Nungish 
Nwa (Nwan, Wan) 325, Map 105 
Nxa7amxcin see Columbian 
Nyabo 325, Map 105 
Nyabwa 325, Map 105 
Nyagchu 164 
Nyaheun 177-8 
Nyahkur (Chao Bon) 177-8, Maps 47, 
48 
Nyakyusa 312, 325, Maps 76-77 
Nyala [Bantu] 325, Map 86 
East 316, Map 86 
West 328, Map 86 
Nyala [East Sudanic] Maps 71-72 
Nyala-Lagowa [Daju dialect] 284 
Nyalgulgule [Daju dialect] 284 
Nyali 325, Map 79 
Nyam Map 54 
Nyamal 100, 116, Map 35 
Nyambo 325, Map 80 
Nyamkat (mNyamskad) 164 
Nyamnyam see Suga 
Nyamwezi 325, Maps 76, 78, 80, 86 
Nyandeva see Chiripa-Nyandeva 
Nyaneka (Nyanyeka) 325, Map 84 
Nyanga [Bantu: Kongo] 325, Map 83 
Nyanga [Bantu: Rwanda-Rundi] 325, 
Maps 79-80 
Nyanganyatjarra 100, 116, Map 35 
Nyangatjarra 100, 116, Map 35 
Nyangatom 284—5, 289, Map 72 
Nyangbara [Bari dialect] 284 
Nyangbo (Yangbo) 325, Maps 102-103 
Nyangga 116, Map 35 
Nyangumarta 100, Map 35 
Nyanja 325, 329, Maps 75, 77 
see also Chewa-Nyanja 
Nyanja-Chewa 
Malawian see Manganja 
Tanzanian see Nyasa 
Nyankore see Nkore 
Nyankore-Kiga 312 
Nyanyeka see Nyaneka 
Nyarafolo (Nyarafoloro) 325, 
Maps 104-105 
Nyarwanda see Rwanda 
Nyasa (Tanzanian Nyanja) 325, 329, 
Maps 75-76 
Nyaturu see Rimi 
Nyawaygi 101, 117, Map 35 
Nyengo 325, Maps 74, 77 
Nyenkha 165,215, Map 51 
Nyep 325, Map 93 
Nyeri 325, Map 86 
Nyeshang see Manang 
Nyigina 116, Map 35 
Nyika 302 
Nyika-Safwa Maps 75-78 
Nyiha 325, Maps 75-78 
Wanda Maps 75-77 
Nyima 283, Maps 71-72 
Nyima (Nyimang) 284, Maps 71-72 
Nyindu 325, Map 80 
Nyingkhri 164 
Nyiramba-Rangi Maps 76, 86 
Nyiramba 325, Map 86 
Nyishang see Manang; Nishi 
Nyisu 172,175, Map 46 
Nyiyabali 116, Map 35 
Nynorsk see Norwegian 
Nyo [ Tai-Kadai] see Yo 
Nyo'o [Bantu: Mbam] 325, Map 93 
Nyole (Nyuli) 301, 325, Map 86 
Nyore 325, Map 86 
Nyoro-Ganda Maps 79-80 
Nyoro (Bunyoro) 325, Maps 79, 86 
Nyuan see Yuan 
Nyuli see Nyole 
Nyulnyulan 100, 116, Map 35 
Nyulnyul 116, Map 35 
Nyungar 101,117 
Nyungwe 325, Maps 74-75, 77 
Nyuqa (Jauja, Shausha) 70 
Nzadi 325, Map 81 
Nzakara 325, Map 79 
Nzanda 325, Map 83 
Nzanyi 325, Map 94 
Nzebi (Njabi) 325, Maps 83, 88 
Nzema 325, Maps 103, 105 
Nzime 325, Map 87 
Nzwani (Njwani) 325, Map 75 


Oaxaca Mije 51 

Oaxaca Soke 51 

Ob-Ugric 231, 238, 260 
Obang 325, Map 93 
Obanliku 325, Map 95 
Obispefio [Chumashan] 20 
Oblo 325, Map 91 

Obo Manobo 119, Map 40 
Obolo 325, Map 101 
Obronuagum 325, Map 101 
Obulom 325, Map 101 
Ocaina see Okaina 

Occitan 259, 261 


Oceanic 98, 100,111 
Ocuilteco see Okwilteko 
Odawa (Ottawa) see Nishnaabemwin 
Odual 325, Map 101 
Odut 325, Maps 93, 101 
Ofayé (Opayé) 73, Maps 18-19, 22 
Ofo 9, 20, Maps 1-2, 5 
Ogaden, Ogaadeen see af-Ogaadeen 
Ogbah 325, Map 101 
Ogbia 325, Map 101 
Ogbogolo 325, Map 101 
Oghuz Maps 57-61, 66, 70 
Ogoi see Elem-Ogoi 
Ogoni 306, Map 101 
Ogori-Magongo (Oko) 325, Map 101 
Oi (Sok, Sapuan, Cheng) 177-8, 
Map 48 
Oiampi 74 
Oirad [Mongolian] see Oirats; see also 
Altaic, Altai; Kalmyk 
Oirata [Austronesian] 109, 118-19, 
Map 38 
Oirats (Oirad) 188-9, 191-2, 240, 
Maps 45, 58 
dialects 188, 238, Map 45 
Oirat see Kalmyk 
“Sart Kalmuck’ 240 
Oirot see Altaic, Altai 
Ojibwe (Ojibwa) 7, 19,26, Maps 1, 5, 
7-9 
Central Southern 
(Anishinaabemowin) 7, 12, 20, 
22, Maps 8-9 
Eastern 7, 12, 14, 19-21, 23, 30 
see also Algonquin, Northern; 
Algonquin, Old; Ottawa; 
Saulteaux; Severn Ojibwe 
Ojitlan Chinanteko 53 
Ok 99, 110, 112, 114, Maps 32-33 
Lowland 112, 114, Maps 33-34 
Mountain 112, 114, Maps 33-34 
aina (Ocaina) 69, Maps 12, 14-15 
ak 325, Map 87 
anagan 8, 20, Maps 5-9 
awnuchu 20 
chai-Chacato Map 2 
elousa Map 2 
klahoma Delaware see Unami 
ko 305, 306, Map 101 
Oko see Ogori-Magongo 
obo 325, Maps 93, 101 
pahmeri 325, Map 101 
pe 325, Map 101 
kpe-Idesa-Oloma 325, Map 101 
rika (Kirike) 325, Map 101 
ro see Koro 
sapmin 99, 99, Map 33 
uma see Kahe-Okuma 
wanuchu 8, 21, Map 4 
wilteko (Ocuilteco) 45, 53, 
Maps 10-11 
am [Surma] see Murle; see also 
Zilmamu-Olam 
cha see Ulcha 
d Algonquin 7,20, Maps 1, 7 
d Arzeu Map 73 
d Bargu (Barga) 191, Map 45 
d Nubian 282, 283 
d Persian 216 
Id Plateau 306 
d South Arabian 280 
d Xiang 160-1 
ee 325, Map 83 
em see Angoram 
ikha Monpa 165, Map 51 
Ikola 101,117, Map 35 
lari 217, Maps 51-52 
o (Wapi) [Papuan: Wapei] 113, 
Map 33 
obiri 325, Map 101 
onets(ian) see Livvi 
ulumo see Ikom-Olulumo 
uteko (Oluta Mijean, Oluta Popoluca) 
51, Maps 10-11 
Omagua-Campeva see Omawa-Kampeva 
Omaha-Ponca 9, 20, Maps 5, 8-9 
Omambala Igbo 325, Map 101 
Omani 280, Map 70 
Omani dialects 280, Map 70 
Omati (Mini) 111, Map 32 
Omawa-Kampeva (Omagua-Campeba) 
62, 74, Maps 13, 15, 17 
Ombale see Chaurasia 
Ombo 325, Map 80 
Omdurman 281 
Omie (Aomie) 110-11, Maps 31-32 
Omo-Tana [Cushitic] 285-7 
Omotic 285, 287-8, Maps 72, 86 
North 288 
Ometo 288, 289, Map 72 
North Ometo 289 
South Ometo 289 
South 288, Map 86 
Omurano 69, Map 14 
Omvang 325, Map 87 
Omyene 325, Map 88 
Ona [Chon] 71, Map 22 
Onabasulu 110, Map 32 
One 113, Map 33 
Oneida 9, 20, Maps 1, 8-9 
Ong (Yir) 177-8, Maps 47-48 
Ong-be see Be 
Ongamo-Maa 284 
Ongamo 325, Map 86 
Onge [Andamanese] 217 
Ongniud, Southern [Ju Ud subdialect] 
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Onicha 325, Map 101 
Onin 115, 118, Maps 34, 37 
Onjob 110, Map 31 
Ono [Huon] 110, 112, Maps 32-33 
Onondaga 9, 20, Maps 1, 8-9 
O'odham (Upper Piman) 9, 20, 24, 26, 
Maps 3-4, 9 
Oold 188, Map 45 
Oowekyala see Heiltsuk-Oowekyala 
Opao 111, Map 32 
Opata (Opata) 54, Map 3 
Opayé see Ofayé 
Opelousa Map 2 
Opon-Karare 75, Map 12 
Ora-Emai-laleha 325, Map 101 
Orang Asli 183, 195 
Orang Hulu 121, Map 44 
Orang Kanaq 121, Map 44 
Orang Laut 121, 182-3, Map 43 
Orang Seletar 121, Map 44 
Oraon see Kurux 
Ordos (Urdus) 191, Map 45 
Oregon Athabaskan 8 
Orej6n see Koihoma; Koto 
Oriya 217, Map 51 
Oriya 212-14, Maps 51-52 
Orlo (Woro) [Kresh dialect] 285 
Orma 325, Maps 72, 85 
Ormu 115, Map 34 
Ormuri (Bargista) 211,216, Map 53 
Oro 325, Maps 93, 101 
Oroch 233-4, 238, Map 55 
Orochen [Orogen dialect] 187 
Oroha 109, Map 29 
Orok (Ulta) 234, 238, Map 55 
Orokaiva 110-11, Maps 31-32 
Oroko 
East 316, Maps 92-93 
West 328, Maps 92-93 
Orokolo 106, 111, Map 32 
Oromoid 286-7, Map 72 
Oromo (Oromoo) 285-7, 325, 
Maps 72, 85-86 
Eastern Map 72 
Oromo-Orma see Waata 
Orogen 187, Map 45 
Oroshori 236-7, Map 57 
Orowari (Pakaasnovos) 65, Maps 17, 
20-21 
Orowe 108, Map 27 
Orra 284 
Oru Yille 325, Map 104 
Orya-Mawes-Tor-Kwerba 99, 115, Map 34 
Osage 9, 20, Maps 2-3, 5, 8-9 
Oshun 325, Map 102 
Osimili Map 101 
Osimili Igbo 325, Map 101 
Ososo 325, Map 101 
Ossetian 234-7, Map 59 
Ostyak see Khanty 
Ostyak Samoyed see Selkup 
Osum 111, Map 33 
Otank 325, Map 95 
Oti 73, Map 22 
Oti-Volta Maps 102-104 
East Maps 102-104 
Northeast Map 102 
Southeast Map 102 
Oti-Volta West Maps 103-105 
Northwest Maps 103-104 
Southeast Maps 103-104 
Oto-Mang(u)ean 46, 52-3, Maps 10-11 
Oto-Pamean 46, 53, Maps 3, 8-11 
Otoe-Missouria see Chiwere 
Otomakoan 65, Map 12 
Otomako 65, Map 12 
Otomi 45, 49-51, 53, Maps 10-11 
Ottawa (Odawa) 20, Maps 1, 7 
Ottilien 113, Map 33 
Otuké (Otuqué) 72, Maps 18, 20 
Otuqué see Otuké 
Ouibala Map 101 
Oujiang see Chinese (Sinitic), Central, Wu 
Ouma 110, Map 31 
Outer Islands, Eastern Map 29 
Ovambo-Kwanyama Maps 74, 84 
Ovambo 325, Maps 74, 84 
Owena (Waisara) 110, Map 32 
Owens Valley Paiute see Mono 
Owere 325, Map 101 
Owiniga 113, Map 33 
Oxoryok (Koriok) [Lotuko dialect] 284 
Oyda (Oyde) 288, 289, Map 72 
Oyo 325, Map 102 
Oyuba 325, Map 88 


Pa 110, 112, Maps 32-33 

Pa Then see Baheng 

Pa'a 325, Map 97 

Pa'no 325, Map 91 

Pa'umotu 107, Map 26 

Pa-O 174-5, Map 49 

Paédfang 108, Map 28 

Paakantyi 102, 118, Map 35 
Southern 102, 118, Map 35 

Paama 108, Map 27 

Paangkhua (Pankho) 168 

Pabir 325, Map 98 

Pacahuara see Pakawara 

Pacific Pidgin English 104, 106-7 

Pacific Yupik (Alutiiq) 7, 20-1, 

Maps 7-8 

Pacoh 177-8, Map 47 

Padam 169 

Padari Map 54 

Padaung see Kayan [Karen] 

Padogho 325, Maps 104-105 
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Paes-Barbakéan 63, Maps 12-14 
Paes (Paez) 63, Maps 12-14 
Pagei see Pagi 
Pagi (Pagei) 112, 114, Maps 33-34 
Pagu 118, Map 37 
Paha [Buyang variety] 181-2 
Pahari 214 
Central Maps 51, 54 
Eastern Maps 51, 54 
Western Map 54 
Pahi 113, Map 33 
Pahoturi 99 
Pahoturi River 111 
Pai [Bantu] 325, Map 74 
Pai [Cochimi-Yuman] 8, Maps 8-9 
Paipai (Pai'pai) 8, 21, Maps 3-4, 8-9 
see also Ipai; Tipai 
Pai [ Papuan: Sepik] 113, Map 33 
Paici 108, Map 27 
Pailibo 169 
Paipai see Pai [Cochimi-Yuman], Paipai 
Paitanic 121 
Paite 168, 213 
Paiute 
Northern (Paviotso) 8, 20, Maps 4-5, 
8-9 
see also Mono; Ute-Chemehuevi 
Paiwa 110, Map 31 
Paiwan [Austronesian] 182, 184, Map 50 
Pajade (Badyara) 325, Maps 107-108 
Pajalate see Coahuilteco 
Pak Tay (Southern Thai) 121, 181, 183-4, 
Map 48 
Pak-Tong 109, Map 30 
Paka 113, Map 33 
Pakaasnovos see Orowari 
Pakan see Pakanic, Pakan 
Pakana Map 2 
Pakanh 101, 117, Map 35 
Pakanic [Mon-Khmer subgroup] 175, 
Maps 45-47 
Pakan (Ben'gan) 176, 178, Maps 45-46 
Pakatan 177 
Pakawara (Pacahuara) 71, Maps 14, 
16-17, 20-21 
Paku 174-5, Map 49 
Palaihnihan 7-8, 10, Maps 4, 8-9 
Palaka see Kpalaga 
Palamul 115, 118, Maps 34, 37 
Palantla Chinanteko 53 
Palauan 107-8, Map 28 
Palaung-Wa [Mon-Khmer subgroup] 
Maps 46-47 
Palaungic (De'ang, Benglong) 175-6, 
178, Map 46 
De'ang 176, 178 
Palaung (Ta-ang) 176, 178, Map 49 
Palawan 119, Map 40 
Palawefio 119, Map 40 
Palaychi see Mopwa 
Pale 176, 178, Map 46 
Palei 113, Map 33 
Paleo-Siberian (Paleo-Asiatic) 9,239 
Palewyami Yokuts 8, 21, 25, Map 4 
Pali 171 
Paliet [Jieng dialect] 284 
Palikur 62, 66, Maps 12-13 
Palioping [Jieng dialect] 284 
Paliu (Palyu, Lai) 176, 178, Maps 45, 46 
Pallanganmiddang 102, 118, Map 35 
Palmela 76, Maps 16, 20 
Palmyra (Tadmur Qoltu) Map 70 
Palor 325, Map 108 
Palpa Map 51 
Palta 69 
Palu'e 119, Map 38 
Paluan 121, Map 41 
Palyu see Paliu 
Pam 325, Map 91 
Pama-Nyungan 100-2, 108, 111, 115-16, 
Maps 32, 35 
Paman 101, Map 35 
Middle 101, 117, Map 35 
Norman 101 
Northern 101, 117, Map 35 
Rarmul 101, 117, Map 35 
South-Western 101,117, Map 35 
Southern 101, Map 35 
Pambia 325, Map 79 
Pame 53, Maps 3, 10-11 
Northern Maps 8-9, 11 
Pamigua see Pamiwa 
Pamiwa (Hehenawa) [ Tukdnoan] see 
Kubewa 
Pamiwa (Pamigua) [Tiniwan] 65, 
Maps 12-13 
Pamona 120, Map 39 
Pamplona Atta see Atta 
Pamwe see Fang 
Pana [Grusi] 325, Map 104 
Pana [Mbum-Day] 325, Map 90 
Pana [S. Ngwi] see Bana 
Panamanian Creole English 27 
Panamint (Tiimpisa Shoshone) 8, 21, 
Maps 4-5, 8-9 
Panare 76, Maps 12-13 
Panchthare Limbu 166 
Pande 325, Map 87 
Pandikeri [Madi dialect] 284 
Pangasinan 103, 119, Map 40 
Pangduwali 166 
Panggi 169 
Pangkhu 215 
Pangkumu 108, Map 27 
Pangwa 325, Map 76 
Pangwali Map 54 
Panim 112, Map 33 


Panjabi see Punjabi 
Pankararti 73, Maps 18-19 
Pankho see Paangkhua 
Pano-Takanan 70-1, Maps 14-18, 20-21 
Panoan 70-1, Maps 14-18, 20-21 
Pano 70, Maps 14, 16, 20 
Panobo 70 
Pansaleo (Panzaleo) 63, Maps 12, 14 
Pantar 119, Map 38 
Panxian 172 
Panyam 325, Map 98 
Panyjima 100, 116, Map 35 
Panzaleo see Pansaleo 
Papago see Piman, Upper 
Papantla (Lowland) Totonako 51 
Papapana 108, Map 29 
Papar 121, Map 41 
Papasena 115, Map 34 
Papel 311, Map 108 
Papel 325, Map 108 
Papi 113, Map 33 
Papi (Paupe) 113, Map 33 
Papia Kristam see Patua 
Papia Kristang see Kristang 
Papiamentu [Spanish-based creole] 26, 
27, 262, Map 13 
Papitalai 109, Map 30 
Papora 182, 184 
Papuan 97, 98-9, 99, 109-15, Maps 24, 
29, 31-34 
Central and Southeast 99 
South-East 99, 110-11, Maps 31-32 
South-Central 99, 111, Maps 32, 34 
West 99, 114-15, 118, Maps 34, 37 
see also Trans-New Guinea 
Papuan Pidgin English 102, 107 
Papuan Tip [Austronesian] 109-11, 
Map 31 
Nuclear 109-10, Map 31 
Peripheral 110-11, Map 31 
Central Papuan 110-11, Map 31 
Papuma 115, Map 34 
Para see Khyamnyungan 
Para-Malay see Malayo-Polynesian, Malay 
Parachi 216, Map 53 
Paraguayan Guarani (Avafie'e) 74 
Parakana-Asurini see Akuawa 
Parali 165-6, Map 51 
Paranan [N. Philippine] 119, Map 40 
Parannawa 71, Maps 14, 16, 20 
Paraok see Waic, Wa 
Parau Tinito [pidgin Tahitian] 102, 106, 
Map 25 
Parauhano (Paraujano, Anyti) 66, 
Maps 12-13 
Paraujano see Parauhano 
Paravilyana 76 
Paravilyana (Paravilhana) 76, Map 12 
Parawen 112, Map 33 
Pare 304, Maps 85-86 
Pare (Asu) 325, Maps 85-86 
Parecis see Paresi 
Parekwa 325, Map 94 
Parengi (Gorum) 217, Maps 51-52 
Paresi (Parecis, Haliti) 62, 66, 
Maps 18-20 
Parintintin (Tenharim) 74, Maps 16-18, 
20 
Parji 213, 217, Maps 51-52 
Parsi 259, 263 
Parthian 216 
Parya [Indo-Aryan] 236-7 
Pasar Malay see Bazaar Malay 
Pascagoula Map 2 
Pasco Kechua (Kichua, Quechua) 70 
Pasé 66, Maps 13, 16 
Pashai (Pashayi) 211, Map 53 
Pashto (Pushtu) 211-12, 216, 234, 236, 
263, Map 53 
Pasi 113, 169, Map 33 
Pasing (Bit, Khao) 176, 178, Map 47 
Pasismanua 109, Map 30 
Passamaquoddy see Maliseet- 
Passamaquoddy 
Pasto 63, Map 12 
Pata 325, Maps 78, 80 
Patamona 76 
Patani [Austronesian] 118, Map 37 
Patani [ Tibeto-Burman] Map 54 
Patapar Map 39 
Patash6-Hanhanhain (Patax6, Ha Ha 
Hae) Maps 18-19 
Pate 325, Map 85 
Pathen see Baheng 
Patimaya 100, 116, Map 35 
Patnam 164 
Patpatar 109, Map 30 
Pattani (Manchati) 165 
Patua (Papia Kristam) [creole] 162, 193, 
Map 45 
Patwin 8, 21, Maps 4, 8-9 
Pauixiana see Pawishiana 
Paulohi 118, Map 37 
Paumari 67, Maps 16-17 
Pauna-Paikone 67, Map 20 
Pauserna 74, Maps 20-21 
Pauwasi 99, 99, 112, 114, Maps 33-34 
Pauwi 115, Map 34 
Paviotso see Paiute, Northern 
Pawaian 99, 99, 111, Map 32 
Pawi 168, 213 
Pawishiana (Pauixiana) 76, Maps 12, 18 
Pawnee 9, 21, Maps 5, 8-9 
Pay 112, Map 33 
Paya [Naga] see Khyamnyungan 
Paya (Pech) [Chibchan] 61, 64, 
Maps 12-13 


Payagua see Payawa 
Payawa (Payagua) 72, Maps 18, 22 
Paynamar 111, Map 33 
Payungu 100, 116, Map 35 
Pazeh 182, 184, Map 50 
Pearic 175, 177-8, Map 48 
Pear (Pérr) 177-8, Map 48 
Pech see Paya 
Pecos Maps 3, 5 
Pedee Map 2 
Pedi [Bantu] 312, 325, 332-3, Map 74 
Peere (Muure) 323, 325, Maps 91, 98 
Peigan see Blackfoot 
Pelasla 325, Map 94 
Pemba 325, Maps 76, 85 
Pemong (Pemén) 76, Maps 12-13 
Penchal 109, Map 30 
Pend d Oreille see Spokane-Kalispel- 
Flathead 
Pendau see Ndau [Sulawesi] 
Pende Map 81 
Pende 325, Map 81 
Pengo 217, Maps 51-52 
Peninsular Arabic 279 
dialects Map 72 
Pennsylvania Dutch 27 
Penobscot see Abenaki 
Pensacola Map 2 
Pentlatch 8,21, Map 6 
Penutian 9, 46 
Pequot see Mohegan-Pequot 
Peramangk 102, 118, Map 35 
Peranakan Indonesian see Chinese 
Indonesian 
Peremka see Morehead-Upper Maro, 
Morehead, Lower 
Peri 325, Maps 79-80 
Pericu 9, 15, 21, Map 3 
Permic 238, Maps 55, 58 
Pero 325, Map 98 
Persian (Farsi) 211, 216, 234, 236-7, 259, 
280, Maps 53, 70 
Old Persian 216 
Western 262-3 
Petapar (Taje) 120 
Petarmul see Malua Bay 
Petats 108, Map 29 
Peve 325, Map 91 
Phake (Phakial) 180, 184,217, Map 51 
Phalo 176, 178, Map 47 
Phalura Map 53 
Phana 174-5, Map 47 
Phang see Plang 
hPhanpo 164 
Phay see Mal 
Phayeng see Andro 
Phedappe Limbu 166 
Phi Tong Luang see Mlabri 
Philippine [Austronesian] 27, 120, 162, 
Map 50 
Central Map 40 
Meso- Map 40 
North Map 40 
South Map 40 
Phlong see Pho 
Pho (Sho, Pwo, Phlong) 174-5, 
Maps 47-49 
Phom (Chingmengnu, Assiringia, 
Tamlu) 167,213, Map 51 
Phong [1: Khmuic] see Xinhmul 
Phong (Kha Pong) [2: Vietic] 177, 
Map 47 
Phong (Pong) [3: Vietic] 177-8, Map 47 
Phongku 174 
Photungjap see Nocte 
Phu Tai 180, 184 
Phuan [Tai] 180, 184, Maps 47-48 
Phuang [Katuic] see Phuong 
Phula Map 47 
Phuman (U, Hu) 176, 178, Map 46 
Phun see Hpun 
Phunoi (Sinsali, Singsili, Cong) 174-5, 
Map 47 
Phuong (Phuang) 177-8, Map 47 
Phutai Maps 47-48 
Phuthi 325, Map 74 
Phyaphreng 164 
Piamatsina 108, Map 27 
Piame (Biami) 113, Map 33 
Piapoco see Piapoko 
Piapoko (Piapoco, Tsase) 65-6, 
Maps 12-13 
Piaroa-Mako 77, Maps 12-13 
Piawi 99, 113, Map 33 
Waibuk (Piawi) 113, Map 33 
Waibuk (Wiyaw) 113, Map 33 
Picard see Normano-Picard 
Picuris 9,21, Maps 5, 8-9 
Pidgin Dobu (Gosiagu Talk) 102, 105, 
Map 25 
Pidgin English 302, 309, 311, 325 
Pidgin Hindi [Fiji] 106, Map 25 
Pidgin Motu 105 
pidgins 103-7 
Piedmontese (Italian) 261, Map 67 
Piegan see Blackfoot 
Piin 325, Map 81 
Pije 108, Map 27 
pijins see pidgins 
Pila 112, Map 33 
Pilaga 72, Maps 20-23 
Pileni 107, 109, Maps 26, 29 
Pilipino see Filipino 
Pilu [Katu dialect] 177 
Piman, Lower (Pima Bajo) 54, Maps 3, 9 
Pima-Papago 54 
Piman, Upper see O'odham 


Pimbwe 325, Maps 76, 78 
Pimenteira 76, Map 18 
Pinai (Pinaye) 113, Map 33 
Pinaye see Pinai 
Pinchi see Taushiro 
Pinghua Map 45 
Pingilapese 108, Map 28 
Pinikura 100, 116, Map 35 
Pinji 325, Maps 83, 88 
Pinome see Pinotl 
Pinotl (Pinome) 53-4, Map 10 
Pintini 101, 117, Map 35 
Pintupi 100, 116, Map 35 
Pinyin [Grassfields] 325, Maps 93, 95 
Piohé (Secoya) 68, Maps 12-15 
Pipelman 101, 117, Map 35 
Pipikoro (Uma) 120, Map 39 
Pipil 45, 50, 54, 61, Maps 10-11 
Pirahan (Piraha) 71, Maps 16-17 
Piratapuyo see Wanana-Pira 
Pirlatapa 101, 117, Map 35 
Piro [Kiowa-Tanoan] 9,21, Maps 3, 5 
Piro (Manitereni, Maxineri) [Maipurean] 
67, Maps 14-17, 20-21 
Pirriya 101 
Piru Bay 118, Map 37 
Pisa 114, Map 34 
Pisaflores Tepewa 51 
Piscataway see Conoy 
pisins see pidgins 
Pisu 174 
Pitcairn 107, Map 25 
Pite Saami 263, Map 64 
Piti [Central Nigerian] see Piti-Atsam, Piti 
Piti [Tibetan] see Spiti 
Piti Ulunna Salo (Pitu Ulunna Salo) 120, 
Map 39 
Piti-Atsam Maps 96-97 
Piti 325, Maps 96-97 
Pitjanjatjara (Pitjantjatjara) 100, 107-8, 
116, Map 35 
as lingua franca 116 
Pitta-Pitta 101,117, Map 35 
Pitu Ulunna Salo see Piti Ulunna Salo 
Pitungu 100, 116, Map 35 
Piu 111, Map 32 
Piva 108, Map 29 
Piya (Wurkum) 325, Map 98 
Piyo see Biyue 
Plains Apache see Kiowa Apache 
Plains Cree 7, 21, Maps 6-9 
mixed language see Michif 
Plains [Igboid subgroup] Map 101 
Plains Igbo 325, Map 101 
Plains Miwok see Miwok, Eastern 
Plang (Phang) 176, 178, Maps 46-47, 49 
Plateau [Central Nigerian] Maps 95-98, 
100 
North Maps 96-97 
North-West Maps 96-97 
South-West Maps 95-98 
West Maps 95-96 
Plateau (Plateau Penutian) [America] 8, 
12, 16, 18, 21, Maps 4-6, 8-9 
Platoid 305-6 
Central see Kaje 
Plautdietsh Creole Map 11 
Playano Salinan Map 4 
Pnar (Synteng, Jaintia) 178, 217, Map 51 
Po see Bo 
Pochuri (Pochury) see Ntenyi (Meluri) 
Pochuteko 50, 54, Map 10 
Pocomchi see Poqom, Poqomchi 
Podopa (Polopa) 111, Map 32 
Podzo 325, Map 75 
Pogaya (Bogaia) 112, Map 33 
Pogoro 325, Map 76 
Poguli Map 54 
Poh yut see Sulung 
Pohena see Qalyawaya 
Poitevin (French) Map 69 
Pojoaque see Tewa 
Pojulu (Péjulu) 284, 325, Maps 72, 79 
Pok 325, Map 86 
Poka-Kori-Wipsi-Geeri 325, Maps 100, 
102 
Poke-Kele Maps 79-81 
Poke 325, Maps 79-80 
Poko-Vumba 325, Map 82 
Pokomam see Pogom, Pogomam 
Pokomchi see Pogom, Poqgomchi 
Pokomo 306, 325, 330, Maps 72, 85 
Pokot 325, Maps 72, 86 
Pol see Pori 
Polar Eskimo (North Greenlandic) 15, 
263 
Polci 325, Maps 97-98 
Police Motu (Hiri Motu) 102, 105, 
Map 25 
Polish 231-4, 236, 259-61, Maps 58, 
61-62, 64-65 
abroad 26, 234-6, 261 
in Australia Map 36 
Polonombauk 108, Map 27 
Polopa see Podopa 
Polynesian 107-9, 309, Maps 26, 28-29 
Eastern 107 
Pom 115, Map 34 
Pémaro [Nawa] 53 
Pomo [Bantu] 325, Map 87 
Pomoan 7-8, 21, 46,51, Maps 4, 8-9 
Central Pomo 12, Maps 4, 9 
Eastern Pomo 8, 14, Maps 4, 9 
Northeastern Pomo 8, 19, Map 4 
Northern Pomo 20, Maps 4, 9 
Southeastern Pomo 8, 23, Map 4 
Southern Pomo 23, Maps 4, 9 


Southwestern Pomo see Kashaya 
Western Pomo 8, Map 4 
Ponam 109, Map 30 
Ponapeic 108, Map 28 
Ponapean 108, Map 28 
Ponca see Omaha-Ponca 
Pondo 113, Map 33 
Pondo see Angoram 
Pondoma 111, Map 33 
Pong see Phong; see also Maleng 
Pongo [Bantoid: Duala] 325, Map 92 
Pongu (Ri) [Basa-Kamuku] 325, 
Map 101 
Pongyong 166 
Ponosakan 120, Map 39 
Pontic (Greek) 260, Map 66 
Ponto-Caspian (Kipchak-Cuman) 238 
Ponyo see Khyamnyungan 
Popoi (Mapopoi) 322, Map 79 
Popoloca see Popolokan; Popoluca; see also 
Pupulaca de Conquaco 
Popolokan Sapotekan 53 
Popoloka (Popoluka) 50-1, 53, 
Maps 10-11 
Popoluca de Oluta 51 
Popoluca de Sayula 51 
Popoluca de Texistepec 51 
Popoluka see Popolokan; Popoluca 
Popti (Jacalteco) 52 
Pogom 52 
Poqgomam (Pokomam) 52, Maps 
10-11 
Pogomchi (Pokomchi, Pocomchi) 52, 
Maps 10-11 
Porapora (Grass) 113, Map 33 
Pori (Pol) 325, Map 87 
Porohanon 119, Map 40 
Porome 111, Map 32 
Port Sandwich 108, Map 27 
Port Vato 108, Map 27 
Portuguese 259, 262, Map 68 
abroad 26, 234, 261 
in Africa 303, 311 
in Australia Map 36 
in South America 27, Maps 13, 17, 
19, 21, 23 
creoles 103, 162, 193, 325, 330, 
Maps 25, 52, 68, 88, 108 
Indo-Portuguese 214, Map 52 
see also Crioulo; Kristam; Kristang 
Portvato see Port Vato 
Potawatomi 7,21, Maps 1, 8-9 
Potiguara 74, Map 19 
Poto [Bantu] 325, Map 82 
Poté (Chilanga) [Lenkan] 54 
Potu-Tano 305, Map 105 
Potu Map 105 
Pove 325, Maps 83, 88 
Poyanawa (Puinahua) 71, Maps 14-16, 20 
Prakrit 216 
Prang [Mnong dialect] 177 
Prasuni 211, 216, Map 53 
Pray see Mal 
Pre-“Anazi Map 70 
Preh [Mnong dialect] 177 
Provengal (French) 261, Maps 65, 67, 69 
Pru [Koho dialect] 177 
Psikye 325, Map 94 
Pu [Nicobarese dialect] 178 
Puari 112, Map 33 
Pubiao 181-2, 184, Map 45 
see also Pupeo 
Pudtol Atta see Atta 
Puelche 69, Map 22 
Puget Sound Salish see Lushootseed 
Pugliese (Italian) 261, Map 67 
Puguli (Pwo) 325, Maps 104-105 
sPuhreng 164, Map 45 
Puinave (Guaipuinavi, Waipuinavi) 68, 
Maps 12-13 
Puindvean 67-8, Maps 12-13, 16-17 
Puiron 168 
Pukapuka 107, Map 26 
Pukina 70, Map 20 
Pula 172, 174-5, Map 46 
Pulaar see Pular 
Pulabu 112, Map 33 
Pulana 325, Map 74 
Pular (Pulaar) 311,325, Maps 107-108 
Pulie 109, Map 30 
Pulinya 100, 116, Map 35 
Puluwat 108, Map 28 
Puma [Himalayan] 165-6, 214, Map 51 
Pumi [N.E. Tibeto-Burman] 169-70, 175, 
Maps 45-46 
Punan Aput 120, Map 41 
Punan Bah-Biau 120, Map 41 
Punan Merah 120, Map 41 
Punan Merap 120, Map 41 
Punan-Nibong 120, Map 41 
Punan Tubu 120, Map 41 
Puncchi (Punchi) Map 54 
Pungu [Bantu] 325, Map 84 
Pungu-Pungu [Australian] 116, Map 35 
Punjabi 211,259, Maps 53-54 
Punjabi (Panjabi) 211-14, 214, 217, 
Maps 53-54 
abroad 26, 181, 263 
Punthamara 101, 117, Map 35 
Punu [Bantu] 325, Maps 83, 88 
Punu [Karen] see Manu 
Puoc [Khmuic] see Xinhmul 
Pupeo (Pubiao, Kabeo, Laqua) 182, 184, 
Map 47 
Pupuluca de Conquaco 49 
Puragi 114, Map 34 
Purari (Koriki, Namau) 111, Map 32 
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Purepecha see Tarasko 
Purian 73, Maps 18, 22 
Puri 73, Maps 18, 22 
Purik 164, Map 54 
Purisimefio [Chumashan] 21 
Puroik see Sulung 
Purubord 75, Maps 16, 20 
Purukoté 76, Map 12 
Purum 168 
Puruwa 69, Map 14 
Pushtu see Pashto 
Putai 325, Maps 94, 98-99 
Putitjarra 100, 116, Map 35 
Putoh 120, Map 41 
Puxian see Chinese (Sinitic), Southern, 
Hokkien (Min) 
Puyuma 182, 184, Map 50 
Pwaamei 108, Map 27 
Pwapwa 108, Map 27 
Pwo see Pho; Puguli 
Pyapun 325, Map 96 
Pyu [Kwomtari-Leftmay] 112, 115, 
Maps 33-34 
Pyu [Sak] 167,171 


Qabena see Timbaro 
Qafar see Afar 
Qahar [Ulangab subdialect] 191, Map 45 
Qahtan Map 70 
Qalyawaya (Callahuaya, Pohena) 70 
Qanjobalan 52, Maps 10-11 
Qanjobal (Kanjobal) 45, 52, 
Maps 10-11 
Qashqai 211, 216, Map 53 
Qasim Map 70 
Qatar Map 70 
Qauqaut 182 
Qaw 181-2, 184, Map 45 
Qawaskar see Kaweskar 
Qawiaraq see Inupiaq 
Qeqchi (Kekchi) 50,52, Maps 10-11 
Qi [Li subvariety] 179 
Qiangic 175 
Qiang 169-70, 175, Map 45 
Qinlian [Yue dialect] 160 
Qiongwen [Min dialect] Map 45 
see also Chinese (Sinitic), Southern, 
Hokkien (Min) 
Qigihar [Daur subdialect] 192 
Qottu 286 
Qoxiong 183-4 
Qriz see Kryz 
Quapaw 9, Maps 2, 5 
Quchani see Khorasani Turkish 
Quechan (Yuma) 8, 21, Maps 3-4, 8-9 
Quechua see Kechumaran, Kechua 
Quechumara see Kechumaran 
Queensland Canefields English (Kanaka 
English) 106-7, Map 25 
Quiché see Kiché 
Quichua see Kechumaran, Kechua, Kichua 
Quileute 8,21, Map 6 
Quimbaya see Kimbaya 
Quinault 8,21, Map 6 
Quinigua Maps 2-3 
Quinipissa Map 2 
Quiotepec Chinanteko 53 
Quiripi-Unquachog 7, 21, Map 1 
Qwara [Kemant variety] 286 
Qyo:l see Mam 


Rabai 325, Map 85 
Rabat [Arabic dialect] Map 73 
Rabha 166, 213, Map 51 
Raepa-Tati see Tate 
Raga 108, Map 27 
Ragane 325, Map 80 
Ragoli see Lugoli 
Ragreig [Burun dialect] 284 
Rai see Kiranti 
Rai Coast 99, 111-12, Maps 32-33 
Nuclear Rai 112, Maps 32-33 
Raja Empat (Raja Ampat) 115, 118, 
Maps 34, 37 
Rajah Kabungsuan Manobo 119, Map 40 
Rajasthani 213, Map 54 
Rajasthani 211-14, Map 54 
Rajbangshi (Rajbongshi, Rajbansi) 
214-15, Map 51 
Raji see Raute 
Rajong 118, Map 38 
Rakahanga see Manihiki-Rakahanga 
Rakaya 101, 117, Map 35 
Rakhine see Arakanese 
Ram 113, Map 33 
Rama 61, Maps 12-13 
Ramarama 75, Maps 20-21 
Ramarama-Urumi 75, Maps 16, 20 
Ramaytush see Costanoan 
Rambani Map 54 
Rampi 120, Map 39 
Ramu 99, 113, Map 33 
see also Sepik-Ramu 
Rana Tharu (Tharu, Dagaura) 214, 
Maps 51, 54 
Rangi 326, Maps 76, 86 
Rangkhol see Langrong 
Ranglo 165, Map 54 
Rangpa 165, Maps 51, 54 
Rangpa 165, Map 54 
Rangpan [Baric] see Tangsa 
Rangwa 101, 117, Map 35 
Rao [Sepik-Ramu] 113, Map 33 
Rao (Ten, Yanghuang) [Kam-Sui] 179, 
184, Map 45 
Raoruo Map 46 
Rapa 107, Map 26 


Rapting 112, Map 33 
Rarmul Paman Map 35 
Rashad 304 
Rashad Map 72 
Rashay(i)da 279, 281 
Ratahan 120, Map 39 
Raute (Raji) 165,214, Map 51 
Rauto 109, Map 30 
Rawa [Finisterre] 110, 112, Maps 32-33 
Rawang 167, 169, 175, Map 49 
Rawo [Skou] 112, Map 33 
Raya 286, Map 72 
Red Gelao see A-uo 
Red Karen see Kayah 
Red Tai see Tai-Kadai, Tai Daeng 
Reefs-Santa Cruz 107, 109, Map 29 
Reindeer Ewenk see Evenki (Ewenke) 
Rejang [Malay] 121, Map 43 
Rejang Baram 103, 120, Map 41 
Beram-Tinjar 120, Map 41 
Lower Rejang 120, Map 41 
Rejang-Bintulu 120, Map 41 
Rejang-Sajau 120, Map 41 
Rejang Kayan 120, Map 41 
Rema see Remo [Munda] 
Rembarrnga 116, Map 35 
Rembong 118, Map 38 
Remo [Pdnoan] 71, Maps 14, 16, 18, 20 
Remo (Rema, Bonda) [Munda] 217, 
Maps 51-52 
Rempi 112, Map 33 
Rendille 287, 326, Maps 72, 85-86 
Rengao 177-8, Map 48 
Rengkhang 168 
Rengma 167-8, 213 
North 167 
West 168, Map 51 
Rengpungmo 164, Map 51 
Rennell-Bellona 107, Map 26 
Rennellese 109, Map 29 
Rera 326, Map 80 
Rerau 112, Map 33 
Reshe Maps 100-101 
Reshe 326, Maps 100-101 
Resigaro 66, Maps 12, 14-15 
Reunion Creole French 216, Map 52 
Reyesano 71, Maps 17, 20-21 
Rhade (Ede) 177, 182, 184, Map 48 
Rhaeto-Romance 259, Maps 65, 67, 69 
Rhaeto-Romance Maps 65, 67, 69 
Ri see Pongu 
Riang (Reang) [Tripuri dialect] 166 
Rianglang (Riang, Black Karen, Yanglam, 
Yang Sek) 176, 178, Map 46 
Riantana 114, Map 34 
Ribe see Rihe 
Ribina 326, Maps 96-97 
Ribun 120, Map 41 
Righ Maps 71, 73 
Rihe (Ribe) 326, Map 85 
Rikbakts4 62,72, Map 19 
Rikpa 326, Map 93 
Rimi (Nyaturu) 326, Maps 76, 86 
Rinconada Bikol 119, Map 40 
Ring 306, Map 95 
Central Map 95 
East Map 95 
West Map 95 
Ringu-Ringu 101, 117, Map 35 
Rio Grande Keresan 9, 21, Maps 3, 5, 8-9 
Rio Grande Tewa Map 5 
Rio Negro 66, Maps 13, 16-17 
Ripey 326, Map 93 
Ririo 108, Map 29 
Ritharrngu 100, 116, Map 35 
Ritwan 7, Maps 4, 8-9 
Riung 118, Map 38 
River Yuman 8 
Riyadh Map 70 
Rizaygat Maps 71-72 
Roba 326, Map 98 
Rocky Peak (Yinibu) 112, Map 33 
Rocoroibo 54 
Rodong see Chamling 
Roglai 182 
Northern 182, 184, Map 48 
Southern 182, 184, Map 48 
Roinji-Nenaya 113, Map 33 
Roinji 113, Map 33 
Rolom [Mnong dialect] 177 
Rolong 326, Map 74 
Roma [Eurasia] see Romani 
Roma [Timor] 119, Map 38 
Romance Maps 61-62, 65-69 
in Asia 237, Map 52 
Romani (Roma, Romany) 27, 232-7, 
259-60, 262 
Anglo-Romani 263 
Balkan 260-1 
Cal6, Calao (Iberian Romani) 262 
Sinte 261 
Tavringer 263 
Viach 260-1 
see also Domari 
Romanian (Daco-Rumanian) 233-5, 237, 
259-60, Maps 58, 61-62 
Moldavan 260 
Wallachian 260 
see also Istro-Rumanian; Macedo- 
Rumanian; Megleno-Rumanian; 
Moldovan 
Romansch 259, 261 
Romany see Romani 
Romblon 119, Map 40 
Rombo 326, Map 86 
Romkun 113, Map 33 
Ron [Cenderawasih Bay] 115, Map 34 


Ron [Chadic subgroup] Map 96 
Ronde 326, Map 106 
Ronga [Bantu] 326, Maps 74-75 
Rongba 164, 175 
Rongba-hBrogpa (Rongpa-Brogpa) 164, 
175, Map 45 
Ronge see Temein 
Rongga [Bima-Sumba] 118, Map 38 
Rongpa-Brogpa see Rongba-hBrogpa 
Roria 108, Map 27 
Roro 111, Map 32 
Roshani (Rushani) 234, 236-7, Map 57 
Roti 119, Map 38 
Rotokas 99, 109, Map 29 
Rotuman 107 
Rotuman 107, Map 26 
Rouku see Morehead-Upper Maro, 
Morehead, Upper 
Round Valley Yuki 25 
Roviana 102, 106, 109, Map 29 
Rub 307 
see also Kuliak 
Rubatab Maps 71-72 
Ruc (Cha Cui, Tac Cui, Cuoi, Tho) 
176-8, Map 47 
Rue 326, Maps 74-75, 77 
Rufa‘a 281, Maps 71-72 
Rufiji [Bantu subgroup] Maps 75-76 
Rufiji (Ruihi) 326, Map 76 
Ruga 166 
Ruguru see Luguru 
Ruihi see Rufiji 
Rukai 182, 184, Map 50 
Rukuba-Kanufi_ Maps 96-97 
Rukuba (Che) 326, Maps 96-97 
Ruli 326, Map 86 
Rumai 176, 178, Map 46 
Rumanian see Romanian 
Rumaya (Mala) 326, Map 97 
Rumdali (Rumdali Rai) see Bahing 
Rumsen (Rumsien) see Costanoan 
Rumu (Kairi) 111, Map 32 
Rundi (Kirundi) 310, 310, 311, 326, 
Map 80 
Runga-Kibet (Aiki-Kibet) [Maban] 283 
Runga 326, Map 89 
Rungu [Bantu: Omyene] 326, Map 88 
Rungu [Bantu: Tongwe] 326, Maps 77-78 
Rungus 121, Map 41 
Rungwa 326, Maps 76, 78 
Rupini 166 
Ruruma 326, Map 97 
Rushani see Roshani 
Russian 231, 236, 259, Maps 45, 50, 
55-59, 61-62, 64 
beyond Russia 27, 188, 192-3, 231-7, 
260-1 
Ruthenian 231-3, 236 
Ruthog 164, Map 45 
Ruto-Vale Map 90 
Ruto 326, Maps 89-90 
Rutul 234, 239, Map 59 
Ruund 326, Map 81 
see also Lunda-Ruund 
Ruvu 306 
Coast Maps 76, 85 
Interior Maps 76, 85 
Ruvuma 329, Map 75 
Ruwala see Rwala 
Rvwang see Rawang 
Rwala (Ruwala) 279, Map 70 
Rwanda-Rundi Maps 78-80, 86 
Rwanda (Nyarwanda) 312, 326, 
Maps 79-80 
see also Rundi 
Rwo 326, Map 86 
Ryukyuan 184-5, Map 50 


Sa [N. Vanuatu] 108, Map 27 
Sa [Narrow Grass] 326, Map 93 
Sa'apan see Sanapand-Enenlhit 
Sa'ban 120, Map 41 
Sa'dan 120, Map 39 
Saam [Himalayan] 166 
Saami (Lappish, Saame) 234, 238, 
Maps 56, 58, 64 
Eastern Saami, Kola (Kola Saame, Kola 
Lappish) 238, Map 58 
Inare Saami (Inari Saami) 263, Map 64 
Lule Saami 263, Map 64 
Northern Saami 263, Map 64 
Pite Saami 263, Map 64 
Skolt Saami (Skolt Saame, Skolt 
Lappish) 263, Map 58 
Southern Saami 263, Map 64 
Ume Saami (Umi Saami) 263, Map 64 
Saanich [Northern Straits Salish] 20 
Sa'apan see Sanapand-Enenlhit 
Saaroa 182, 184, Map 50 
Saba 326, Map 89 
Sabaean 280 
Sabaki 306, 330, Maps 76, 85 
Sa'ban 120, Map 41 
Sabané 74, Maps 16-18, 20-21 
Sabanga 326, Map 89 
Sabela (Waorani) 69, Map 15 
Saberi 114, Map 34 
Sabian 278 
Sac 176-8 
Sacapulteco see Sakapulteko 
Sach Map 47 
Saclan see Miwok, Eastern, Bay Miwok 
Sa‘da [Yemeni] Map 70 
Sa'dan 120, Map 39 
Sad(h)ani (Sadri) 214, Map 51 
Sadochi see Sodochi 
Sadri see Sadani 


Saek 179, 184, Map 47 
Saep 112, Map 33 
Safalaba 326, Map 103 
Safen 326, Map 108 
Safeyoka (Ampale) 111, Map 32 
Safwa 326, Maps 76-77 
Sagala (Saghala) 326, Maps 76, 85 
Sagi see Kamasau 
Sahaptin 8,21, Maps 4-6, 8-9 
Sahelian-Saharan 283, 307 
Saharan 283, 306-7, 330, Maps 71-73, 
90, 94, 98-100 
Sahelian 307 
Saho [Cushitic] 285-6, Map 72 
Saho-Afar see Afar-Saho 
Sahu [Papuan] 118, Map 37 
Sai [Gumuz dialect] 285 
Sain see Tamang 
Saingbaung Chin see Kcho 
St Lawrence Island Yupik see Siberian 
Yupik, Central 
St Matthias 109, Map 30 
Saipan Carolinian 108, Map 28 
Saisiyat 182, 184, Map 50 
Sajau Basap 120, Map 41 
Sak (Luish) 163, 166-7, 175, Maps 49, 51 
Sak (Atsa', Thet, Arakanese That) 167, 
175 
Saka [Iranian] 216 
Sakaci 326, Map 75 
Sakai 178 
Sakam 110, 112, Maps 32-33 
Sakao 108, Map 27 
Sakapulteko (Sacapulteco, Zacapulteco) 
52, Maps 10-11 
Sakarda 326, Map 99 
Sakata 326, Maps 81, 83 
Sake 326, Maps 87-88 
Saker see Mugil 
Sakha see Yakut 
Saki (Maya) 112, Map 33 
Sakuyu 326, Map 85 
Sal (Baric) 163, 166, 175, Maps 49, 51 
Sala [Bantu] 326, Maps 74, 77 
Salamas [Aramaic subdialect] 278 
Salamat Maps 71-72 
Salampasu-Ndembu Maps 77-78, 81, 84 
Salar 186-7, 193, Map 45 
Saleman 118, Map 37 
Saliba see Saliva 
Salima [Arabic] Map 72 
Salinan 7-8, 21-2, Map 4 
Salina 51 
Salinan-Seri Maps 8-9 
Salishan 7-10, 46, Maps 4-5, 7-9 
Salish see Spokane-Kalispel-Flathead; 
see also Northern Straits Salish 
Sdlivan 77, Maps 12-13 
Saliva (Saliba) 77, Maps 12-13 
Salka 326, Maps 100-101 
Sallam-Pasu-Ndembu see Salam-Pasu- 
Ndembu 
Saluan 120, Map 39 
Saluma 75, Map 12 
Sam [Tibeto-Burman] 214 
Sama [Austronesian subgroup] 118, 
Maps 37-40 
Central 119, Map 40 
Northern 119, Map 40 
Sama Bajau (Bajau) 109, 118, 120, 
Maps 37-38, 41 
Sama Bajau (Sama Bajo) 119, 
Maps 38-39 
Sama Bajo (Bajo, Indonesian Bajau) 
120, Map 39 
Sama Sibuguey 119, Map 40 
Southern 119, Maps 40-41 
Western 119, Map 40 
Sama [Bantu: Kimbundu] 326, Map 84 
Sama [Bantu subgroup] Maps 87-88 
Sama 326, Maps 87-88 
Samachique 54 
Samana [creole English] 26, 27 
Samap see Kaiep 
Samar-Leyte 103, 119, Map 40 
Samarokena 114, Map 34 
Samatao [N. Ngwi] 173, 175, Map 46 
Samataw [N. Ngwi] 172 
Samatu [Central Ngwi] 172-3, 175, 
Map 46 
Samba see Daga-Nyonga (Samba Leeko); 
Leko-Nimbari, (Samba) Leko 
Sambalic 119 
Sambal 119, Map 40 
see also Tina Sambal 
Sambo 326, Map 84 
Samburu 284, 326, Maps 72, 85-86 
Sambya [Himalayan] 166 
Samei (Sani) 172, 175, Map 33 
Samia 326, Map 86 
Samish [Northern Straits Salish] 20 
Samo Map 104 
North Samo 324, Map 104 
Samo 326, Map 104 
South Samo 327, Map 104 
Samodi see Samoyedic 
Samogo(-Sembla) Maps 104-105 
Samogo 326, Maps 105, 107 
Samoic-Outlier 107 
Samoan 107, Map 26 
in Australia Map 36 
Samosa 112, Map 33 
Samoyedic (Samodi) 238, Maps 55-56, 58 
Northern Maps 56, 58 
Southern Map 56 
Sampi see Elepi 
Samray 177-8, Map 48 


Samre 177-8, Map 48 
Samre of Siem Reap 177-8, Map 48 
Samukoan (Zamuco) 72, Maps 18-21 
Samwe 326, Map 105 
San [Bantu: Tende-Yans] 326, Map 81 
San Cristobal 109, Map 29 
San Felipe see Rio Grande Keresan 
San Ildefonso see Tewa 
San Juan see Tewa 
Sanaga Maps 92-93 
Sanamai see Mondé 
Sanan Maps 2-3 
San‘ani Map 70 
Sanapand-Enenlhit (Sa'apan, Enlhet) 
Maps 19, 23 
Sanda 326, Map 106 
Sandawe 307-8, 326, Maps 76, 86 
Sanema (Sanuma) 77, Maps 12-13 
Sanga [Bantu] 326, Maps 77-78 
Sanga [Central Nigerian] 326, Map 97 
Sanggau 120, Map 41 
Sangil 120, Map 40 
Sangir-Minahasan 109, 120, Map 39 
Sangiric 120, Map 39 
Sangir 120, Maps 39-40 
Sangke 112, 115, Maps 33-34 
Sangkong (Buxia) 174-5, Map 46 
Sanglechi 211,216, Map 53 
Sango [ Ubangi] 301-2, 310, 326, 
Maps 82, 89 
as lingua franca 302, 311, 330 
Sangpang (Sangpahang) 165-6, 214, 
Map 51 
Sangtam (Thukumi) 167-8, 213, Map 51 
Sangu [Bantu: Bena-Kinga G60] 301, 
326, Map 76 
Sangu [Bantu: Shira-Punu B40] 301, 326, 
Maps 83, 88 
Sanhdja 282 
Sanhaja of Srair [Berber] Map 73 
Sanhaja [Arabic subgroup] Map 73 
Sani [N. Ngwi] see Samei 
Sani (Ni) [Central Ngwi] 172, 175, 
Map 46 
Sanie (Sa'ngwie) [N. Ngwi] 172, 175, 
Map 46 
Sanio 113, Map 33 
Sanio 113, Map 33 
Sanketic [Nepal Sign] 214 
Sanpoil-Nespelem see Okanagan 
Sanskrit 159, 177, 189, 191, 212-14, 217 
Sansu see Hlersu 
Santa see Dongxiang 
Santa Ana see Rio Grande Keresan 
Santa Clara see Tewa 
Santa Cruz 109, Map 29 
Santa Teresa Cora see Cora 
Santa Ysabel 109, Map 29 
Santali 212, 217, Map 51 
Santali (Santhali) 212-14, Map 51 
Santee Map 2 
Santee-Sisseton 14, Maps 1, 5 
Santhali see Santali 
Santiago del Estero Kechua (Kichua, 
Quechua) 70, Map 23 
Santiam (Central Kalapuyan) 16, Maps 4, 
9 
Santo Domingo see Rio Grande Keresan 
Santomense 302 
Santora [Nara dialect] 290 
Santrokofi (Sele) 326, Maps 102-103 
Sanuma see Sanema 
Sanza 326, Map 80 
Saontal [Santali?] 215 
Sapara 76, Map 12 
Sdparo-Yawan 69, Maps 12-15 
Sdparo 69 
Saparo-Konambo (Zaparo) 69, 
Maps 13-15 
Saparua 118, Map 37 
Sapé see Kaliana 
Sapiny 326, Maps 72, 86 
Saponi see Tutelo 
Saposa-Timputz 108, Map 29 
Saposa Map 29 
Sapotekan (Zapotecan) 46, Maps 8-9 
Sapoteko 46, 49-50, 53, Maps 10-11 
Northern Zapoteko 45, 53, 
Maps 10-11 
Sapuan see Oi 
Sar see Majingay 
Sara-Bagirmi 284, 307, 311, 330, 
Maps 79, 82, 89-91, 99 
Sara 284, 302, 330-1, Map 90 
Saraiki see Siraiki 
Saramaccan [creole English] 27, Map 13 
Sarangani Blaan 120, Map 40 
Sarangani Manobo 119, Map 40 
Sararé 74 
Sarave (Saraveca) 66 
Saraw 11 
Sarcee (Tsutina, Tsuut'ina) 8, 22, 
Maps 6-9 
Sardinian (Italian) 259, 261, Maps 67, 69 
Sardiyya Map 70 
Sarikoli 193 
Sarmi 115, Map 34 
Sarnami [Hindi creole] 27 
"Sart Kalmuck' 240 
Sartuul 188 
Saruga 112, Map 33 
Sarwa 326, Maps 89-90 
Saryg Yugur see Yugur, Western 
Sasak 118, Map 38 
Sasaru-Enwan-Igwe 326, Map 101 
Sasawa 115, Map 34 
Sasime see Biyom 
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Sasago [Hadiyya dialect] 287 
Satawalese 108, Map 28 
Sateré-Mawé see Mawé 
Satlej Map 54 
Sau 110, Map 32 
Sauk [Trans-New Guinea: Erap] 110, 
Map 32 
Sauk-Fox-Kickapoo 7,22, Maps 8-9 
Sauk-Fox (Meskwaki, Mesquakie) 7, 
22, Maps 1, 8-9 
Mascouten Map 1 
see also Kickapoo 
Saulteaux 7,22, Maps 5, 7-9 
Saurashtri Map 52 
Sause [Papuan: Kaure] 114, Map 34 
Sausi (Uya) [Papuan: Madang] 112, 
Map 33 
Savannah Igbo 326, Map 101 
Savara (Sora) 213,217, Maps 51-52 
Savi 211, Map 53 
Savosavo 99, 109, Map 29 
Savoyard (French) Map 69 
Savu 118, Map 38 
Sawai 118, Map 37 
Sawkna Maps 71, 73 
Sawntung 174 
Sawos (Gaikunti, Kwongai, Sepik Plains, 
Tshuosh) 113, Map 33 
Sawuy 114, Map 34 
Saxapahaw Map 2 
Saxon 
Low Map 65 
Upper Map 65 
Saya 326, Maps 96-98 
Sayulteko (Sayula Mijean, Sayula 
Popoluca) 51, Maps 10-11 
Sba‘a 279, Map 70 
Sbay’ Map 70 
Scanian (Skane, East Danish) 263 
Schouten 113, Map 33 
Schwyzertiiiitsch see Alemannic 
Scottish Gaelic see Gaelic 
Sea Island Creole see Gullah 
Seba 326, Map 77 
Sebob Kenyah 120 
Sechelt 8,22, Maps 6, 8-9 
Sechura-Katakéoan 69-70, Map 14 
Sechura 69, Map 14 
Secoya see Piohé 
Secwepemce see Shuswap 
Sedang 177-8, Map 48 
Sedere see Adele 
Sedoa 120, Map 39 
Seediq 182, 184, Map 50 
Sefwi 326, Maps 103, 105 
Segai 120, Map 41 
Segeju 326, 330, Maps 76, 85 
Seget 115, 118, Maps 34, 37 
Seimat 109, Map 30 
Seiyap see Siyi 
Sekani 8, 14, 16, 22-3, Maps 6-9 
Sekar 115, Map 34 
Seki 326, Map 88 
Sekmai see Sengmai 
Seko [Sulawesi] 120, Map 39 
Sekpela (Likpe) 301, 321, 326, 
Maps 102-103 
Selaru 119, Map 38 
Selaru 119, Map 38 
Sele [Bantu] 326, Map 84 
Sele [Kwa] see Santrokofi 
Selepet 110, 112, Maps 32-33 
Selish see Spokane-Kalispel-Flathead 
Selkup (Ostyak Samoyed) 234, 238, 
Maps 55-56 
Selpechen [Orogen dialect] 187 
Salte see Silt'i 
Selungai Murut 120, Map 41 
Selwasa 119, Map 38 
Sema (Simi) [Naga] 167,212-13, Map 51 
Semai 121, 178, Map 44 
Semandang 120, Map 41 
Semang 178 
Low Country 121, Map 44 
Semaq Beri 121, 178, Map 44 
Sembakung 121, Map 41 
Sembla-Dzuun Map 104 
Sembla 326, Maps 104-105 
Seme 305, 332, Maps 104-105 
Seme 326, Maps 104-105 
Semelai 121, 178, Map 44 
Semimi 115, Map 34 
Seminole 22 
Semitic 46, 239, 277-8, 285, 308, 
Maps 59-60, 66-67, 70-73, 90, 
99-100 
see also Ethio-Semitic 
Semnani 211, 216, Map 53 
Sempan 114, Map 34 
Sena-Senga 302, 329, Maps 74-77 
Sena 312, 326, 329, Map 75 
Senagi (Anggor) 99, 113, 115, Maps 33-34 
Senagi see Anggor 
Senari 326, Maps 104-105, 107 
Sende 326, Map 84 
Sene [ Trans-New Guinea] 110, Map 32 
Sene-Guinea Map 107 
East Map 107 
Seneca 9, 12, 22, Maps 8-9 
Sened Maps 71, 73 
Senegambian Map 107 
Seneguinea, East 332 
Senga 324, 329, Maps 75-77 
Sengam see Biliau 
Sengele 326, Maps 81, 83 
Senggi 115, Map 34 
Sengmai (Sekmai) 167 


Senhit [Bilin dialect] 286 
Seno Ful 326, Map 104 
Sensi 71 
Sentani 115 
Sentani 99, 115, Map 34 
Senthang (Sentang) 168 
Sentinelese [Andamanese] 217 
Senufo-Gur, Senufo 305, 311-12, 
Maps 103, 107 
Sepa 113, Map 33 
Sepa-Teluti 118, Map 37 
Sepen 113, Map 33 
Sepik-Ramu 111 
Sepik [subgroups] 99,115, Maps 33-34 
Lower 99, 113, Map 33 
Middle 113, Map 33 
Sepik Plains see Sawos 
Sepik Hill 113, Map 33 
Upper 108, 111, 113, Map 33 
Sera (Ssia) 113, Map 33 
Seram [subgroups] 118, Map 37 
Central 118, Map 37 
East 115, 118, Maps 34, 37 
‘Laut’ 115, 118, Maps 34, 37 
Serbian 232-4, 236, 259, 261, Maps 61, 
65-67 
abroad 232, 260, 263 
in Australia Map 36 
Sere-Ngbaka-Mba Maps 79, 82, 87, 89 
Bviri 
Sere 303, 326, Map 79 
see also Zande 
Sere 303, 326, 330, Map 89 
Serer (Serer-sin) 312, 326, Map 108 
Seri 51, Maps 3, 8-9 
Serkin (Cerquin) 54, Maps 10-11 
Serrano-Kitanemuk 9 
Serrano 9, 22, Maps 3-4, 8-9 
Serudung 121, Map 41 
Serui-Laut 115, Map 34 
Sese [Bantu: Zinza] 326, Map 86 
Sese [Gumuz dialect] 285 
Seselwa 302 
Sesotho see Sotho 
Seta 113, Map 33 
Setaman 112, Map 33 
Seti 113, 115, 118, Maps 33-34, 37 
Setob Kenyah Map 41 
Setswana see Tswana 
Seuta (Shambaa) 304, Maps 76, 85-86 
Severn Ojibwe (Oji-Cree) 7, 22, 
Maps 7-9 
Sewa Bay 110, Map 31 
Seward Peninsula Inupiaq 7, 15, 
Maps 7-8 
Sewee Map 2 
Seychelles Creole French 215-16, Map 52 
Sezo 288-9, Map 72 
Sfax [Arabic dialect] Maps 71, 73 
Sgaw 174-5, Maps 47, 49 
Sha 326, Map 96 
Shabo see Mikeyir 
Shagawu 326, Map 96 
Shakriaba (Xakriaba) 73, Maps 18-19 
Shalkota 283 
Shall-Zwall (Shall-Swall) 326, 
Maps 96-98 
Shamba(l)a 304, 326, Maps 76, 85-86 
Shambiu 326, Map 84 
Shammari 279, Map 70 
Shammar 279, Map 70 
Shan see Dai (South-Western Tai) 
Shan [Tai-Kadai] 167, 179-80, 184, 
Maps 46-47, 49 
Shan Tayok see Dai, Dai Mao 
Shan Tonghsu see Pa-O 
Shanga see Tyenga 
Shangaan see Tsonga, Tsonga-Jonga 
Shange [Atlantic] 326, Map 106 
Shangge [Sal: Rangpan clan] see Tangsa 
Shanghai Chinese [dialect] 163 
Shani 326, Map 97 
Shaninawa (Xaninaua) 71 
Shanjo 326, Maps 74, 77 
Shankali [Luba] 329 
Shangilla see Berta 
Shansu see Hlersu 
Shantou (Swatow) [Mandarin dialect] 
160 
Sharaigol Mongol see Dongxiang 
Sharanahua see Sharanawa 
Sharanawa (Sharanahua) 71, 
Maps 14-17, 20-21 
Shararat Map 70 
Shark Bay 108, Map 27 
Sharwa 326, Map 94 
Shasha 326, Map 77 
Shastan 7-8, 22,51, Map 4 
Shasta (proper) 8, 22, Map 4 
see also Konomihu; New River Shasta; 
Okwanuchu 
Shatt [Daju dialect] 284, 326, Map 72 
Shausha see Nyuga 
Shavante (Xavante) 73, Maps 18-19 
Shawnee 7, 22, Maps 1, 8-9 
Shayqiyya Maps 71-72 
She [Miao-Yao] 183-4, Map 45 
She [Omotic] 289, Map 72 
see also Gimira (Bench-She) 
Shebaye 67, Map 12 
Sheko (Dors(h)a, Bulla) 288, 289, Map 72 
Shelala-Mazer dialects Map 73 
Shemya see Sinyar 
Shengza 172 
Sherbro 326, Map 106 
Sherdukpen 165, Map 51 
Sherente (Xerente, Xeres) 73, Maps 18-19 


heri, Sheri see Jibbali 
herpa 164-5, 175, 213-14, Maps 45, 51 
hetd (Xeta) 74 
hi-Havu Map 80 
Shi 326, Map 80 
higatse see gZhiskartse 
hikomoro 302 
hikuyana (Sikiana) 75 
hila 326, Maps 77-78 
hilingol [Inner Mongolian dialect] 191 
hilluk see Colo 
himaku see Urarina 
himaore [Comorian dialect] 216 
himong (Simong) 169 
himwali [Comorian dialect] 216 
hina 211,213, Maps 53-54 
hinasha see Bworo 
hinga 326, Map 77 
hingazidja [Comorian dialect] 216 
hingu [Grassfields] 326, Map 93 
hini Map 73 
hinkan (Xinka) 54, Maps 10-11 
hinyar see Sinyar 
hinzwani [Comorian dialect] 216 
hipamon see Bamun 
hipibo (Xipibo) 70-1, Maps 14-16, 20-21 
Shipibo-Conibo 70 
hipinawa (Xipinawa) 71, Maps 14, 16, 
20 
Shira-Punu 311, Maps 83, 87-88 
Shira 326, Maps 83, 87-88 
hirazi see Fundi 
hiriana (Shriana, Xiriana) 66, Map 12 
hita 285 
hiwo 326, Maps 87-88 
hixing 170, 175, Maps 45-46 
hnini Map 71 
ho [Karen] see Pho 
hoa 286, Map 72 
hobio 326, Map 80 
hoccoree-Eno Map 2 
hokleng (Xokleng, Aweikoma) 73, 
Maps 18, 22-23 
hok6é (Xok6) 73, Maps 18-19 
holio 326, Maps 96-97 
hom Peng 178, 217 
hona 329, Maps 74-75, 77 
Shona 302, 312, 326, Map 77 
hor 234, 238, Maps 55-56 
hortlands 108, Map 29 
hoshone (Shoshoni) 8, 22, Maps 4-6, 
8-9 
see also Panamint 
howa 326, Map 81 
hqip see Albanian 
hriana see Shiriana 
hua 326, Maps 74, 77 
huara see Hivaro-Kawapanan, Hivaro 
hugni-Roshani 237 
Shughni (Shugni) 211, 216, 234, 
236-7, Maps 53, 57 
hui [Kam-Sui] see Sui 
hui Yi (Shuitian) [Ngwi] 172-3 
hukriyya 281, Map 72 
hukurti (Xukurti) 73, Maps 18-19 
humashti 211, Map 53 
huswap (Secwepemc) 8, 22, Maps 6-9 
huwa 311, 326, 331, Maps 90, 99 
Siagha-Yenimu 114, Map 34 
Sialum 110, 112, Maps 32-33 
Siane 110, 112, Maps 32-33 
Siane 110, 112, Maps 32-33 
Siang 120, Map 41 
Siar 109, Map 30 
Siau 113, Map 33 
Siberian Tungusic 238 
Siberian Yupik 239 
Central 7, 12, Maps 7-8, 55 
Sibo (Sibe) see Xibo 
Siboma see Numbami 
Sibundoy see Kamsa 
Sicilian (Italian) 261, Map 67 
Sidaama see Sidamo 
Sidaba (N.W. Jiarong) 170 
Sidamo (Sidaama) 285, 287, Map 72 
Sie 108, Map 27 
Sierra Miwok see Miwok, Eastern 
Sierra Popoluca 51 
Sighau 326, Map 85 
Sigu 326, Maps 83, 88 
Siha [Bantu] 326, Map 86 
Sihan [Trans-New Guinea] 112, Map 33 
Sikaiana 107, 109, Maps 26, 29 
Sikiana see Shikuyana 
Sikka 119, Map 38 
Siksika see Blackfoot 
Sila [Daju dialect] 284 
Sila [Ngwi] 174-5, Map 47 
Silakau 120, Map 41 
Sile 326, Map 80 
Sileibi 111, Map 33 
Sileti see Sylheti 
Siliput 113, Map 33 
Silisili_ 111, Map 32 
Silopi 112, Map 33 
Silti'i (Salt'e) 285, 288, Map 72 
Silver Palaung 176 
Simalungan 121, Map 43 
Simbari 111, Map 32 
Simbo 109, Map 29 
Simeulue 121, Map 43 
Simi see Sema 
Simog 112, Map 33 
Simong see Shimong 
Simte 168 
Sinagoro-Keapara 110-11, Maps 31-32 
Sinagoro 110-11, Maps 31-32 
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Sinai Map 70 
Sinauna 119, Map 40 
Sindhi Maps 53-54 
Sindhi 211-14, 217, Maps 53-54 
Sinesip 108, Map 27 
Singgie 120, Map 41 
Singpho 180, Map 51 
Singsali, Singsili see Phunoi 
Sinhalese (Sinhala) 214, 217, Map 52 
Sinji 326, Map 84 
Sinkyone see Eel River Athabaskan 
Sino-Tibetan 46, 159, 239, Maps 45-50 
Sinitic see Chinese (Sinitic) 
see also Tibeto-Burman 
Sinsali see Phunoi 
Sinsauru (Kow) 112, Map 33 
Sinufana (Cenufara) 63, Map 12 
Sinyar 283,285, Maps 71-72 
Sinyar (Shinyar, Shemya) 285, 326, 
Maps 71-72, 89 
Sio 111, 113, Maps 32-33 
Siouan-Catawban 7, 10, 46, Maps 1-2, 
5-9 
Siouan 9, Maps 2, 5-9 
Sioux 9-10, 14, 22,27, Maps 5, 8-9 
Sipacaperse see Sipakapenyo 
Sipakapenyo (Sipacaperse) 52, 
Maps 10-11 
Siraiki (Saraiki) 211-12, Maps 53-54 
Sirak 111, Map 32 
Sirasira 111, Map 32 
Siraya 182, 184 
Sirenik [Central Siberian Yupik dialect] 
12 
Sirenik (Old Sirenik) 7, 22 
Sirhan Map 70 
Siri 326, Map 97 
Siriana 68, Maps 12-13, 16 
Sirio 111-12, Maps 31-32 
Sirioné 74, Map 17, 20-21 
Sirkai 114, Map 34 
Sirmauri Map 54 
Sirohi Map 54 
Siroi (Suroi) Map 33 
Sisaala 326, Maps 103-105 
Sisi' [Besisi dialect] 178 
Sisi-Bipi 109, Map 30 
Sisimin see Hewa 
Siska (Tonga) [Bantu: Manda] 326, 
Maps 74-76 
Sissano 113, Map 33 
Sistani 216, Map 53 
Siswati see Swazi (Swati) 
Sita (E. Jiarong) 170 
Siti 326, Maps 103-105 
Siuslaw 7-8, 22, Map 4 
Siwa (Siwa) [Berber] 282, Map 71 
Siwai [Papuan] 109, Map 29 
Siwu see Akpafu-Lolobi 
Siya see Avatime-Yangbo, Avatime 
Siyi (Seiyap) [Yue Chinese dialect] 160, 
162 
Siyin [Kuki-Chin] 168 
Siyu 326, Map 85 
Sizaki 326, Map 86 
Siziwang see Dorvon Hiiiihed 
Skane see Scanian 
Ske 108, Map 27 
Skofro see Manem 
Skolt Saami (Skolt Lappish) Map 58 
Skou 99, 108, 111-12, 115, Maps 33-34 
Skou (Tumawo, Tumawoi) 112, 115, 
Maps 33-34 
Skiiiixs see Tsimshianic, Maritime 
Tsimshian 
SKwXwu7mesh snichim see Squamish 
Slave (South Slavey) 8, 22-3, 
Maps 7-9 
Slavey 23 
North 8 
see also Bearlake; Hare; Mountain 
Slavey 
Slavic see Macedonian 
Slav(on)ic 259, Maps 45, 50, 55-62, 
64-67 
East 236, 259 
South 236, 259, Maps 61-62 
West 236, 259 
Slovak 232-4, 236, 259-60, Maps 58, 
61-62, 65, 67 
Slovene (Slovenian) 232-3, 236, 259-61, 
Maps 61, 65, 67 
Slit Map 70 
Sm'algyax see Tsimshianic, Maritime 
Tsimshian 
So (Soo) [Bantu: Maka-Njem] 326, 
Map 87 
So [Bantu: Poko-Kele] 326, Map 79 
So [Katuic] 178, Map 47 
So [Kuliak] 326, Map 86 
So'a 118, Map 38 
Sobei 115, Map 34 
Socotri see Sogotri 
Soda [Tabi dialect] 284 
Soddo (Kastaninna, Aymallal) 288, 
Map 72 
Sodochi (Sadochi) Map 54 
Sog Mongol 192 
Soga 312, 326, Map 86 
Sogap see Ngala 
Sogdian 187, 189, 216 
Sogoba see Sumariup 
Sohra [Khasi dialect] 178 
Sok see Oi 
Sokean 51, Maps 10-11 
Isthmus Sokean 51n, Map 10 
Soke (Zoque) 50-1, Maps 10-11 


Sokoro Maps 89-90 
Sokoro 326, Maps 89-90 
Sokte 168 
Solano 7, 9, 23, Map 3 
Soli 326, Maps 74, 77 
Solomon Pijin (Solomon Islands Pidgin 
English) 102, 106, Map 25 
Solomonic Chain 
Central 99, 107, Map 29 
North-West 108, Map 29 
see also New Ireland 
South-East 109, Map 29 
Solon see Evenki (Ewenke) 
Solongo 326, Map 84 
Solos 108, Map 29 
Som [Austronesian] see Wab 
Som [Finisterre-Huon] 110, 112, 
Maps 32-33 
Somahai 114, Map 34 
Somali (Soomaali) 285, 287, 310, 326, 
Maps 72, 85 
abroad 261, 302 
Common Map 85 
Northern 287, Map 72 
Some [West Kutubu] 111, Map 32 
Somrai-Miltu Maps 89-90 
Somrai 326, Map 90 
Somyewe 326, Map 95 
Son Wa 176 
Sona (Kanasi) 110, Map 31 
Sondi [Bantu: Central] 326, Maps 81, 
84 
Sondi [Nosu dialect] 172 
Song (Kasong) 177-8, Map 48 
Songhay (Songhai) 283, 306-7, 331-2, 
Maps 73, 102, 104 
Songhai 312, 326, Maps 73, 102, 104 
Songo 326, Map 80 
Songu 326, Map 84 
Songum 112, Map 33 
Songye 311, 329, Maps 78, 80-81 
Songye 326, Maps 78, 80 
North 324, Map 78 
South 327, Map 78 
Songye (Luba-Lulua) [Luba] 326, 
Map 81 
Soni 326, Map 80 
Sonia 110, Map 32 
Sonid [Shilingol subdialect] 191 
Soninke(-Samogo) Maps 104, 107-108 
Soninke 312, 326, Maps 104, 107-108 
Sonjo 326, Map 86 
Sonoran 53-4, Maps 8-9 
Sonsorolese 108, Map 28 
Soo see So [Bantu: Maka-Njem] 
Soomaali see Somali 
Soqotri (Socotri) 281, 287, Map 70 
Sora-Gorum Maps 51-52 
Sora see Savara 
Sorbian (Wendish) 231, 236, 259, 261, 
Map 65 
Sori-Harengan 109, Map 30 
Soro [Hadiyya dialect] 287 
Sorsogon (Masbate) 119, Map 40 
Sorsogon (Waray) 119, Map 40 
Sotang (Sotange) 165-6, Map 51 
Soteapaneko 50-1, Map 11 
Sotho-Tswana 329, Maps 74-75, 77, 84 
Sotho (Sesotho) 310, 312, 327, 332-3 
North 324, Maps 74-75 
South 327, 329, Map 74 
Sou see Su 
South Bird's Head see Bird's Head 
Southern Adelbert Range-Korak-Waskia 
99, 111, Map 33 
Southern Adelbert Range 111, Map 33 
Southwestern Caribbean Creole English 
26, Map 13 
Soviet Azerbaijanian [Aramaic subgroup] 
Map 70 
Sowa 108, Map 27 
Sowanda see Waina 
Soyod (Soyots) see Tsaatan 
Sozo 327, Map 80 
Spanish 49, 259, 262, Maps 68-69 
abroad 234, 261, 263 
in Africa 311, Map 73 
in Australia Map 36 
in North America 26-7, 27, 
Maps 8-9 
in South and Meso-America 26, 
Maps 11, 13, 17, 19, 21, 23 
creoles 26,27, 103, 262, Maps 13, 25, 
40 
see also Chavacano 
Trade Spanish 27 
Spiti (Piti) 164, Map 54 
Spokane-Kalispel 8, 23, Maps 5-9 
Flathead (Montana Salish) 23, 
Maps 4-7 
Squamish (SKwXwu7mesh snichim) 8, 
23, Maps 6, 8-9 
Sranan [creole] 27, 262 
Sre [Koho dialect] 177 
Sri Lankan Creole Malay 214, Map 52 
Ssia see Sera 
St'at'imcets see Lillooet 
Standard Bikol 103, Map 40 
Stieng 177-8, Map 48 
Sto:lo see Halkomelem 
Stoney 9, 14, 23, Maps 5-9 
Straits Malay see Baba Malay 
Straits Salish see Klallam; Northern Straits 
Salish 
Strickland, East 99, 99, 110, 112, 
Maps 32-33 
Su (Sou, Suq) [Brao dialect] 177 
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Su-ung (Saoch) 177 
Sua-Gola Map 108 
Sua Map 108 
Sua see Masoanka 
Suau 109, Map 31 
Suau 102, 105-6, 109, Map 31 
Suay see Kuy 
Suba 327, Map 86 
Subanen see Subanon 
Subanon 119, Map 40 
Central Subanon 119, Map 40 
Kolibugan Subanon 119, Map 40 
Lapuyan Subanun 119, Map 40 
Northern Subanon 119, Map 40 
Western Subanon 119, Map 40 
Subanun see Subanon 
Subi 327, Map 80 
Subinba 49 
Subiya Maps 74, 77 
Subiya 327, Maps 74, 77 
Subre-Kaberwa 327, Map 105 
Sudanese Arabic 279-80 
colloquial 282 
Sudanic 282, 283 
Central 283-4, 307, 309, 330-1, 
Maps 79-80, 82, 86, 90-91, 99 
East 283-4, 306-7, 330, Maps 71-72, 
76, 79, 85-86, 90 
Eastern 283-4 
Western 283, 284 
Sudest 110, Map 31 
Suena 111, Map 32 
Suerre 64 
Suga-Vute Map 93 
Suga (Nyamnyam) 327, Maps 91, 95 
Sugpiak, Sugpiaq see Alutiiq (Pacific 
Yupik) 
Sugurti 327, Map 99 
Sui (Shui) 179, 184, Map 45 
Suid see Suya 
Suki [ Trans-New Guinea] 111, Map 32 
Suku [Bantu: Yaka] 327, Maps 81, 84 
Sukuma-Nyamwezi 312, Maps 76-78, 80, 
86 
Sukuma 327, Map 86 
Sukurum 111, Map 32 
Sula [Austronesian] 118, Map 37 
Sulawesi, Central 120, Maps 37, 39 
East Map 39 
South 120, Map 39 
West Central 120, Map 39 
Sulaymaniyya 278, Map 70 
Sulaymi 281 
Sulka 99, 109, Map 30 
Suluk Map 41 
Sulung (Poh yut) 169, Maps 45, 51 
Suma 327, Map 90 
Suma-Jumano Map 3 
Sumambu-Tagal 121, Map 41 
Sumariup (Sogoba) 113, Map 33 
Sumatra Malay 121, Map 43 
Sumau (Garia) 112, Map 33 
Sumbanese, East 118, Map 38 
Sumbawa 118, Map 38 
Sumbe 327, Map 84 
Sumbwa (Mweri) 327, Maps 80, 86 
Sumu-Ulwa 61, 65, Maps 12-13 
Sundanese 97, 103, 121, Map 42 
Sundi 327, Maps 83-84 
Sunga 327, Map 86 
Sungor 284 
Sunwar (Koints, Sunuwar, Bahrathar) 
165, 214, Map 51 
Suomi see Finnish 
Suoy 177-8, Map 48 
Supyire 327, Maps 104-105, 107 
Suq see Su 
Sura 327, Maps 96-97 
Surbakhal 283 
Surel 165, Map 51 
Surgujiya Map 51 
Suri [Omotic] 289 
Surigaonon 119, Map 40 
Surma [Surmic] 283, 284—5, 289, 307 
Surma see Tirma 
Suroi see Siroi 
Sursurunga 109, Map 30 
Suruaha see Zuruaha 
Surubu 327, Map 97 
Surui do Jiparana [Tupi] 75 
Surui [ Tupi, spoken in Para] 75, Maps 17, 
21 
Sts Map 73 
Susquehannock 9, 23, Map 1 
Susu-Yalunka Maps 106-108 
Susu 311, 327, 332, Maps 106-107 
Sutiaba 45, 49-50, 53, Map 10 
Sutiaba-Maribio 61 
Suwal see Kwalhioqua-Clatskanie 
Suwawa 120, Map 39 
Suya (Suid) 72, Maps 18-19 
Svan 234—5, 238, Map 59 
Svea Swedish Map 64 
Swahili 285, 290, 301-2, 306, 309, 327, 
329-30, Maps 52, 72, 75-76, 85 
Kiswahili 302, 310 
as lingua franca 303, 310-12, 329 
Swaka 327, Map 77 
Swampy Cree 7, 13, 19, Maps 1, 7-9 
Eastern 14, 25, Maps 1, 7-8 
Western 25, Maps 7-8 
see also Moose Cree 
Swati see Swazi 
Swatow see Shantou 
Swazi (Swati) 312, 327, 329, 332-3, 
Maps 74-75 
Siswati 310 


Swedish 233-4, 259, 263, Maps 62, 64 
Eastern Map 64 
South Map 64 
Swoeng 177, Maps 47-48 
Sya see Bobo 
Sylheti (Sileti) 215, 259, 263, Map 51 
Synteng see Pnar 
Syriac, Classical 278 
Syrian dialects 
Central Map 70 
North Map 70 
Syro-Palestinian 280, Map 70 


T'in see Mal 
Ta'aysha Maps 71-72 
Ta-ang see Palaung- Wa, Palaung 
Ta-Maipurean 66 
Ta-oih 177-8, Maps 47-48 
Taa Map 74 
Taari 327, Map 91 
Tabar 109, Map 30 
Tabar 109, Map 30 
Tabasaran 234, 239, Map 59 
Tabi (Gaam, Ingessana) 284, Map 72 
Tableng see Konyak 
Taboan 327, Map 93 
Tabwa 327, Maps 77-78 
Tac Cui see Ruc 
Tacame see Takame 
Tacana see Takana 
Tachi [Yokuts dialect] 25 
Tachoni 327, Map 86 
Tadakshak 327, Map 104 
Tadashak Map 73 
Tadmur see Palmyra (Qaltu) 
Tadyawan 119, Map 40 
Tadzhik see Tajik 
Taensa Map 2 
Tafi 327, Map 103 
Tafire 327, Maps 104-105 
Tagabawa 119, Map 40 
Tagakaulu 119, Map 40 
Tagalog 103, 119, Map 40 
abroad 26-7, Map 36 
see also Filipino 
Tagbana Maps 104-105 
Tagbana 327, Map 105 
Tagen (Tagin) 169 
Tagish 8, 16, 23, Map 7 
Tagna Uriankhai see Tsaatan 
Tagoli Map 72 
Tagu 173,175, Map 46 
Tagvi Samoyed see Nganasan 
Tahitian 102, 106-7, Maps 25-26 
Parau Tinito [pidgin] 102 
Tahltan 8, 23, Maps 7-9 
rTahu 164, 175, Map 45 
Tahue Map 3 
Tai-Kadai (Thai-Kadai) 97, 109, 121, 159, 
179, 184, 217, Maps 44-47, 49 
Kadai 181-2, Maps 45-47 
Kam-Tai Map 44-47 
Tai (Thai) 49, 104, 121, 162, 174, 
179-81, 184, 195, Maps 44-48, 
51-57 
North-east see Lao (Isan) 
Northern see Yuan 
South-western see Dai 
Southern see Pak Tay 
Tai Daeng (Red Tai, Tai Neua) 
180, 184, Map 47 
Tai Dam (Black Tai) 179-80, 184, 
Maps 47-48 
Tai Khaw (White Tai, Tai Don) 
179-80, 184, Map 47 
Tai Mao see Dai, Dai Mao 
Tai Neua see Dai, Dai Neua 
Thai-Malay 121, 181 
Taiap see Gapun 
Taihu see Chinese (Sinitic), Central, Wu 
Taikat (Arso, Abrap) 112, 115, 
Maps 33-34 
Tain see Taraon ‘Mishmi’ 
Taino 66, Map 12 
Tairona 64, Map 12 
Tairora 110, Map 32 
Tairora 110, Map 32 
Taisun see Tashon 
Taita 304, Map 85 
Taivuan 182 
Taiwan Austronesian 161, 182, 184, 
Map 50 
Eastern 182, 184 
North-Western 182, 184 
Western Plains 182, 184 
Taiwan Mandarin (Guoyu) see Chinese 
(Sinitic), Northern 
Taiwanese (Taiyu) see Chinese (Sinitic), 
Southern, Hokkien 
Taizhou see Chinese (Sinitic), Central, Wu 
Taje see Petapar 
Tajik (Tajiki, Tadzhik) 211, 216, 236-7, 
Maps 45, 53, 57 
abroad 192-3, 235 
Tajio see Kasimbar 
Tajuos 327, Map 106 
Tak Bay (Chehe) [Tai] 181 
Takame-Jaruroan Maps 12-13 
Takame (Tacame) 68, Map 12 
Takanan 71, Maps 16-17, 20-21 
Takana (Tacana) 71, Maps 20-21 
Takelma(n) 7-8, 23, Map 4 
Takestani 211,216, Map 53 
Takia [Austronesian: Vitiaz] 113, Map 33 
Takic [Uto-Aztecan] 8, Maps 3, 8-9 
Takua [Bahnaric] 177-8, Maps 47-48 
Takum 327, Map 95 


Takunyapé 74, Map 18 
Takuu 107, 109, Maps 26, 29 
Tal 327, Map 96 
Tala 327, Maps 97-98 
Talamanka 64 
Talaud 120, Map 39 
Tale [Bantu: Songye L20] 327, Map 80 
Tale [Gur-Adamawa: Mbum] 327, 
Map 90 
Taleshi (Talishi, Talyshi) 211, Maps 53, 59 
see also Harzani 
Taliabo 118, Map 37 
Talinge 327, Maps 79-80 
Talise 109, Map 29 
Talishi 234—5, 237 
Talni 327, Maps 102-104 
Talodi 304, Maps 71-72 
Talodi 282, Map 72 
Talu 173, 175, Map 46 
Taluy [Katu dialect] 177 
Talyshi see Taleshi 
Tama [Nilo-Saharan] 283, Maps 71-72 
Tama 283-4, Maps 71-72 
Tama [Trans-New Guinea subgroup] 113, 
Map 33 
Tamagario Map 34 
Tamambo 108, Map 27 
Taman [Sal: Luish] 167, 175, Map 49 
Tamang (Murmi, Ishang, Sain) 
[W. Bodish] 164-5, 214, Map 51 
Tamanghaset Map 73 
Tamarkhole Limbu 166 
Tamasheq [Berber] 312, 327, Maps 73, 
100 
Tamazight [Berber] 262, Map 73 
Tambago 327, Maps 79, 89-90 
Tambanua 121 
Tambas 327, Map 96 
Tambo 327, Maps 75-77 
Tambotalo 108, Map 27 
Tambuana Map 41 
Tamezret Maps 71, 73 
Tami 111, Map 32 
Tamil 162-3, 212-13, 214,217, Map 52 
abroad 181, 195, 216, 263 
Tamlu see Phom 
Tamok 284 
Tamolan 113, Map 33 
Tampril [Katu dialect] 177 
Tampuan 177-8, Map 48 
Tampulma 327, Maps 102-104 
Tamu see Gurung 
Tamyen see Costanoan 
Tanacross 8, 23, Maps 7-9 
Tanah Merah 99, 99, 114-15, Map 34 
Tanaina (Dena'ina) 8, 23, Maps 7-9 
Tanana 8, 23, Maps 7-9 
Lower 8, 17,23, Maps 7-9 
Upper 8, 23-4, Maps 7-9 
see also Kolchan (Upper Kuskokwim); 
Tanacross 
Tanchaynga 215 
Tanda (Tenda) [Atlantic] 332 
Tandia 115, Map 34 
Tangale 327, Map 98 
Tangam 169 
Tanger [Arabic dialect] Map 73 
Tangga 109, Map 30 
Tangjan see Wancho 
Tangjap see Nocte 
Tangkhul 168, 175, 213, Maps 49, 51 
Tangkic (Tangic) 100, 115-16, Map 35 
Tanglang 173, 175, Map 46 
Tanglapui 119, Map 38 
Tangoa 108, Map 27 
Tangsa (Rangpan) 167, 175, 213, 
Maps 49, 51 
Tangsarr see Dvngsar 
Tangu 113, Map 33 
Tanguat 113, Map 33 
Tangut see Xixia 
Tangwang [mixed language] 193, Map 45 
Tani (Miami) 112, Map 33 
Tanima 109, Map 29 
Tanimbili 109, Map 29 
Tanjong 120, Map 41 
Tanna 108 
North 108, Map 27 
South-West 108, Map 27 
Tannu Uriankhai (Malchin Tuva) 188 
Tano (Hano, Arizona Tewa) [Kiowa- 
Tanoan] 9, Maps 3, 5 
Tano [Kwa subgroup] 332, Maps 103-105 
Central Maps 102-103, 105 
West Map 105 
Tanima Map 70 
Tanzanian Nyanja see Nyasa 
Tao-Suamato 111 
Taogwe 115, Map 34 
Taokas 182, 184 
Taori-Kei 115, Map 34 
Taori-So 115, Map 34 
Taos 9,23, Maps 5, 8-9 
Tapachulteko (Tapachula Mijean, 
Tapachulteco) 51, Map 10 
Tapang see Thakali 
Taparita 65, Map 12 
Tape see Marakhus 
Tapirapé 74, Maps 18-19 
Tapshin 327, Maps 96-97 
Taracahitic 9, Maps 3, 8-9 
Tarahumara (Taraumara) 54, Maps 3, 
8-9 
Tarairiu 73 
Tarali Kham see Kaike 
Taram 327, Map 95 
Tarao [Kuki-Chin] 168 


Taraon ‘Mishmi’ (Taruang, Tain, Taying, 
Darang Deng) 169, 175, Maps 45, 
49, 51 
Tarascan see Tarasko 
Tarasko (Purepecha) 50,54, Maps 10-11 
Taraumara see Tarahumara 
Tareng [ Katuic: Katu dialect] 177 
Tareng [N.W. Bahnaric] 177-8, 
Maps 47-48 
Tariana (Tariano) 66, Maps 12-13 
Tarifit [Berber] 262 
Tarof 114, Map 34 
Tarok(oid) 305, 306, Maps 96, 98 
Tarok 327, Maps 96, 98 
Tarpia 115, Map 34 
Taruang see Taraon ‘Mishmi’ 
Taruma 73, Map 16 
Tarunggare 115, Map 34 
Tashelhait Map 73 
Tashon (Taisun) 168 
Tasmanian 100, 118, Map 35 
Tasmate 108, Map 27 
Tat 234-5, 237, Map 59 
Tatalpepec Chatino 53 
Tatana 121, Map 41 
Tatar 186-7, 232, 234-8, Maps 45, 55-56, 
58 
Abakan (Khakas) 238 
Alabugat 234, 238 
Baraba 234, 238 
Chulym (Chulym Turkic) 234, 238 
Crimean 234, 236, 238, 260, Map 61 
Yurt(a) 234, 238 
see also Khakas 
Tataviam 23, Maps 3-4 
Tate (Raepa-Tati) [Trans-New Guinea] 
111, Map 32 
Tati [Indo-Iranian] 216, Map 53 
sTau see Ergong 
Tauade 111, Map 32 
Taulil 99, 109, Map 30 
Taungthu see Pa-O 
Taungyo 171,175, Map 49 
Taupota 110, Map 31 
Taupota 110, Map 31 
Taurap 115, Map 34 
Tause 99 
Taushiro (Pinchi) 69, Map 15 
Tausug 119, Map 40 
as lingua franca 120, Map 40 
Tauya (Inafosa) 112, Map 33 
Tavara (Tawara) 327, Map 75 
Taveta 327, Map 85 
Tavoyan 171,175, Map 48 
Tavringer see Romani 
Tawala 110, Map 31 
Tawana 327, Map 74 
Tawara see Tavara 
Tawasa see Timacua(n) 
Taworta 115, Map 34 
Tawr 168 
Tay (Th6) 179, 184, Map 47 
Tay Tum see Tum 
Tayari 327, Maps 102, 104 
Taying see Taraon ‘Mishmi’ 
Tayyt 279, Map 70 
Tazhi 173, 175, Map 46 
Tboli 120, Map 40 
Tchambuli see Chambri 
Te’ 327, Map 93 
Te:nek see Wasteko 
Teberan 99, 99,111, Map 32 
Tebilung 121, Map 41 
Tecoxquin see Tekoxkin 
Tectiteco see Teko 
Tecual see Wichol (Huichol) 
Tecuexe see Tekwexe 
Teda-Daza 283, Map 99 
Teda-Daza 301, 327, 331, Map 100 
Tee 327, Map 83 
Teen Maps 103-105 
Teere 327, Map 91 
Tegali Maps 71-72 
Tegali Map 72 
Tegbo 327 [under Tafi], Map 103 
Tege 327, Map 83 
Tegesye 327, Maps 103-105 
Tagre see Tigre; Tigrinya 
Tehit 115 
Tehit 115, Map 34 
Tehnu [Nicobarese dialect] 178 
Tehueco see Teweko 
Teiwa 119, Map 38 
Teke 311, 327, Maps 81, 83, 88 
East Teke 316, Maps 81, 83 
West Teke 328, Maps 81, 83, 88 
Tekiraka-Kenichana 68, Maps 12-14 
Tekiraka (Tequiraca, Avishiri) 68, 
Maps 12, 14 
Teko (Tectiteco) 52, Maps 10-11 
Tekoxkin (Tecoxquin) 53-4, Map 10 
Teksispetekenyo 51, Map 11 
Tekwexe (Tecuexe) 53-4, Map 10 
Telefol 112, Map 33 
Telengic 240 
Teleut 234, 238 
Telugu 212-13, 214, 216-17, 233, 
Maps 51-52 
Tem see Kotokoli 
Tembé see Tenetehara 
Tembo [Bantu: Bira-Huku D30] 327, 
Map 80 
Tembo [Bantu: Luba L30] 327, 
Maps 77-78 
Teme 327, Map 98 
Temein 283-4, Maps 71-72 
Temein (Ronge) 284, Maps 71-72 


Temiar 121, 178, Map 44 
Temne 309, Maps 106-107 
Temne 312, 327, 332, Map 106 
West Temne 328, Map 106 
Temo’ (Temoq) 178 
Temuan 121, Map 44 
Ten see Rao [Kam-Sui] 
Ten'en see Maniq 
Tenda see Tanda 
Tende-Yens (Tende-Yans) Maps 81, 83 
Tenetehara (Guajajara, Tembé) 74, 
Maps 14, 16, 18-19 
Tengima 167 
Tengo (Mamvu) 327, Map 79 
Tengu Kan 327, Map 104 
Tenharim see Parintintin 
Tenis 109, Map 30 
Teochiu (Teochew) [Hokkien dialect] 
103, 104, 181 
see also Chaozhou; Chinese (Sinitic), 
Southern, Hokkien 
Teop 108, Map 29 
Teor-Kur 115, 118, Maps 34, 37 
Tepekano (Tepecano) 54 
Tepewa (Tepewan, Tepehuan) 51, 54, 
Maps 10-11 
Tepiman 9, 53-4, Maps 3-4, 8-9 
Tepecano Map 3 
Tepehuan Maps 3, 8-9 
Tepo 327, Map 105 
Tequesta Map 2 
Tequiraca see Tekiraka-Kenichana, 
Tekiraka 
Tequistlateco see Chontal 
Tera [Biu-Mandara subgroup] Maps 91, 
94, 98 
Tera 327, Map 98 
Tera [Chadic: Bura] Map 100 
Tereila 101, 117, Map 35 
Terena (Tereno) 62, 66, Maps 18-20, 22 
Teressa [Nicobarese dialect] 178, 217 
Teribe (Terraba, Naso) 64, Maps 12-13 
Terik 327, Map 86 
Ternate (Ternatese) [Papuan] 103-4, 118, 
Maps 25, 37 
Ternatefio [Chavacano dialect] 103 
Terraba see Teribe 
Teso-Maa (Teso-Mafro-Asiatic) 284, 307, 
Map 86 
Teso-Turkana 284, 307, Maps 72, 86 
Teso 284, 312, 327, Maps 72, 86 
Tesuque see Tewa 
Téte de Boule (Attikamek) 7, 10-11 
Tetela 329, Maps 78, 80-81 
Tetela 327, Maps 78, 80-81 
Teteté 68, Map 13 
Teton (Lakota) 14, Map 5 
Tetouan [Arabic] Maps 71, 73 
Tetun (Tetum) 103-4, 119, Maps 25, 38 
Teueia see Makushi 
Teun 119, Map 38 
Teuso see Ik 
Teve 327, Maps 74-75 
Tewa (Tesuque) 9, 23, Maps 3, 8-9 
Arizona Tewa (Hano, Tano) 9, Maps 3, 5 
Rio Grande Tewa Map 5 
Teweko (Tehueco) 54 
Tewelche (Tehuelche) 71, Maps 22-23 
Teweya see Makushi 
Thado 168, 175, 213, Maps 49, 51 
Thadou 168, 175 
Thagicu 330, Maps 85-86 
see also Bantu, N.E. Interior 
Thai see Tai-Kadai, Kam-Tai, Tai (Thai) 
Thakali (Tapang) 164-5, 214, Map 51 
Thalanyji 100, 116, Map 35 
Thami (Thangmi) 165, 214, Map 51 
Thaminyi-Kod(h)in 284, Map 71 
Thangmi see Thami 
Thao 182, 184, Map 50 
Tharaka 327, Maps 72, 85-86 
Tharrgari 100, 116, Map 35 
Tharu see Rana Tharu 
Thavung 177-8, Map 47 
Thawa 102, 118, Map 35 
Thaypan 101, 117, Map 35 
Thebor see Kanauri 
Thembu 327, Map 74 
Thet see Sak 
Thiin 100, 116, Map 35 
Thiro Map 72 
Théo [ Tai-Kam] see Tay 
Tho [ Vietic] see Ruc 
Thompson 8, 23, Maps 6-9 
Three Rivers 118, Map 37 
Thukumi see Sangtam 
Thulung 165,214, Map 51 
Thura-Yura 101, 117, Map 35 
Tianba 172 
Tiang 109, Map 30 
Tiba 327, Map 98 
Tibea 327, Map 93 
Tibeto-Burman 46, 159, 163, 213, 217, 
Maps 45-49, 51, 53-54 
Central Map 46 
Northeastern (Qiangic, Xifan, Hor) 
163, 169-70, Map 46 
Southeastern (Burmic) 163, Map 46 
Tibetan 163, 170, 175, 188-9, 191, 
213-14, Maps 45, 49, 51, 53 
Central 163-4, Map 45 
Eastern (Khams) 163-4, 175, 
Map 45 
liturgical 186 
Monpa Map 45 
North-Eastern (Amdo) 164, 175, 
Map 45 
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Southern 163-4, 175, Map 45 
Western 163-4, 175, Maps 45, 53 
see also Bodic; Burmese-Ngwi; 
Himalayan; Karenic 
Tibor-Omosa 112, Map 33 
Tichurong 164, Map 51 
Tiddim Chin see Kamhau Chin 
Tidikelt Map 73 
Tidong 121, Map 41 
Tidong 121, Map 41 
Tidore (Tidorese) 103-4, 118, Maps 25, 
37 
Tiene 327, Maps 81, 83 
Tiepo Maps 104-105 
Tiepo (Tyepo) 327, Maps 104-105 
Tifal 112, Map 33 
Tigak 109, Map 30 
Tigania 327, Map 85 
Tige 327, Map 95 
Tigre 285-7, 290, Map 72 
Tigrinya 277-9, 285, 287, 302, Map 72 
Tigwa Manobo 119, Map 40 
Tihama [Yemeni dialects] 280, Map 70 
Tiharo 327, Map 74 
Tika [Kunama dialect] 290 
Tikar 305, 306, Maps 93, 95 
Tikar 327, Maps 93, 95 
Tikopia 107, 109, Maps 26, 29 
Tikrit (Qaltu) Map 70 
Tikuna (Ticuna) 68, Maps 15-17 
Tikuu see Bajun 
Tilapa Otomi 53, Map 10 
Tillamook 8, 23, Maps 4, 6 
Tilung 165-6, 214, Map 51 
Timasinin Map 73 
Timbaro (Qabena) 287, Map 72 
Timbe 110, 112, Maps 32-33 
Timbira (Timibira, Krikati-Timbira) 72, 
Maps 18-19 
Timbisha [Panamint] 21 
Timibira see Timbira 
Timor Area [Austronesian subgroup] 119, 
Map 38 
Timor and Islands 119, Map 38 
Timor [Papuan subgroup] 
East Timor 99,119, Map 38 
West Timor-Alor-Pantar 99, 119, 
Map 38 
Timotéan 63, Map 12 
Timote-Kuika 63, Map 12 
Timputz 108, Map 29 
Timucua(n) 7,9, 23, Map 2 
Timugon 121, Map 41 
Tina Sambal 119, Map 40 
Tinan (Gond(h)la) 165, Map 54 
Tindi 233-4, 239, Map 59 
Tingelin 327, Map 91 
Tingong 327, Map 93 
Tinigua see Tiniwa 
Tiniwan 65, Maps 12-13 
Tiniwa (Tinigua) 65, Maps 12-13 
Tintikia 166 
Tiou Map 2 
Tipai [Dieguefio-Cocopa] 8, 14, Maps 4, 
8-9 
Tipaza Map 73 
Tipra (Tippera) see Noatia 
Tirahi 211, Map 53 
Tiri 108, Map 27 
Tiriko 327, Map 86 
Tirio 111 
Tirio [New Guinea] 99, 99, Map 32 
Tiriy6 (Tiri6) [Amazonia] 75, 
Maps 12-13, 17, 19 
Tirma (Surma) 284, 289 
Tiruray 120, Map 40 
Tiryometesem see Akuriyo 
Tit Map 73 
Titan 109, Map 30 
Tiverikoto (Tivericoto) 76 
Tivoid 306, Maps 91, 93, 95-96 
Tiv 312, 327, 331, Maps 95-96 
Tiwa [Bodo-Garo] see Lalung 
Tiwa [Kiowa-Tanoan] 9, Maps 3, 8-9 
Southern 23, Maps 8-9 
Isleta-Sandia 9, Map 5 
see also Taos 
Tiwi [Australia] 100, 116, Maps 35-36 
Tjalkatjarra 100, 116, Map 35 
Tjapanmay 101, 117, Map 35 
Tjokwai see Morehead-Upper Maro, 
Morehead, Upper 
Tjungundji 101, 117, Map 35 
Tlachichilco Tepewa 51 
Tlapaneco-Mangean 53 
Tlapaneko 45, 50, 53, Maps 10-11 
Tlemcen [Arabic dialect] Map 73 
Tlhaping 327, Map 74 
Tlingit 8,24, Maps 6-9 
Toaripi 106, 111, Map 32 
Toaripi Hiri [trading language] Map 25 
Toba 72, Maps 19-23 
Toba Batak 121, Map 43 
Tobagonian Creole 26, 27 
Tobaru 118, Map 37 
Tobelo 118, Map 37 
Tobo 110, Map 32 
Toboso Map 3 
Tocenga 327, Map 93 
Tocharian 259 
Tocobaga Map 2 
Tod 164, Map 54 
Toda 212,217, Map 52 
Todrah 177-8, Map 48 
Todzhi [Tuvan dialect] 240 
Tofa (Tofalar, Karagas) 234, 238, 
Maps 55-56 


Tofalar see Tofa 
Tofamna 114, Map 34 
Toga see Gizra 
Togba-Vara Maps 82, 89 
Togbo 327, Maps 82, 89 
Togo Kan 327, Map 104 
Togo Remnant (Central Togo) 305, 332 
Tohono O'odham (Papago) 27 
see also O'odham; Piman, Upper 
Tojolabal 52, Maps 10-11 
Tok Pisin [New Guinea Pidgin English] 
102, 104-5, Map 25 
Toka 327, Maps 74, 77 
Tokelau 107, Map 26 
Tokwasa see Morehead-Upper Maro, 
Morehead, Upper 
Tol 51-2, 61, Maps 10-11 
Jicaque 52n 
Tola [Koho dialect] 177 
Tolai (Kuanua) 102, 105, 109, Map 30 
Tolimeko Map 10 
Tolitoli (Totoli) 120, Map 39 
Tolkapaya (Yavapai) see Yuman, Upland 
Tolo (White Gelao) [Gelao variety] 182, 
184 
Tolomako 108, Map 27 
Tolowa 8, 24, Maps 4, 8-9 
Toltec 50 
Tombulu 120, Map 39 
Tomini 120, Map 39 
Tomini 120, Map 39 
Tommo So 327, Map 104 
Tomo Kan 327, Map 104 
Tomoip 109, Map 30 
Tompiro see Piro 
Tomu River 110, Map 32 
Tonda 111,115 
Tonda 111, 115, Maps 32, 34 
Tondano 120, Map 39 
Tonga [Bantu: Chopi S60] 327, Map 75 
Tonga [Bantu: Lenje-Tonga M60] 327, 
Maps 74, 77 
as lingua franca 312, 329 
Tonga [ Bantu: Manda] see Siska (Tonga) 
Tonga’ [Aslian] see Maniq 
Tongan [Polynesian] see Tongic 
Tongareva 107, Map 26 
Tongass [Tlingit dialect] 24 
Tongcengnya (Tunzunnya, Daingna, 
Dengnak, Doingnak) 166 
Tongic [Polynesian subgroup] 107 
Tongan 107, Map 26 
Tongren [Mongolian dialect] 191, 
Map 45 
Tongva see Gabrielino 
Tongwe Maps 76-78, 80 
Tongwe 327, Maps 78, 80 
Tonjo 327, Map 93 
Tonjo-Tukombe 327, Map 93 
Tonkawa 7,9, 24, Maps 2-3 
Tonkin Lolo see Ngwi 
Tonsawang 120, Map 39 
Tonsea 120, Map 39 
Tontemboan 120, Map 39 
Tooro see Toro [Zinza] 
Toposa 284 
Tor 108, 114-15, Map 34 
Tora 65, Maps 16, 20 
Toram 327, Map 89 
Torau 108, Map 29 
Torgut (Torguud) 188, 191, Map 45 
Torguud see Torgut 
Tornedalen Finnish 263 
Toro [Central Nigerian] see Turkwam 
Toro [Nyoro-Ganda] 327, Maps 79-80 
Toro (Tooro) [Zinza] 327, Map 86 
Toro So 327, Map 104 
Toro Tegu 327, Map 104 
Toromona 71, Maps 16-17, 20-21 
Torres Strait (Islander) Creole 103, 
Map 36 
Torres Strait Pidgin English (Broken) 
107, Map 25 
Torricelli 99, 108, 111, 113, Map 33 
Lou (Torricelli) 113, Map 33 
Torwali 211, Map 53 
Tosk see Albanian 
Tosu see Ersu 
Totela 327, Maps 74, 77 
Toto 169, Map 51 
Totoli see Tolitoli 
Totonakan 51, Maps 10-11 
Totonako 50-1, Maps 10-11 
dialects 51 
Totorame 53 
Totoro [Niger-Kordofanian] Map 72 
Totoré [Paes-Barbakéan] 63, Map 12 
Touareg 282, Maps 71, 73, 100 
Touat Map 73 
Toura see Tura 
Towa see Jemez 
Towei 114, Map 34 
Trade Spanish 26 
Trans-Fly, Eastern 99, 108, 111, 115, 
Map 32 
Trans-New Guinea 98-100, 99, 108, 
110-12, 114-15, 119, Maps 31-34, 38 
West Trans-New Guinea Linkage 114, 
119 
Tregami (Gambiri) 211, 216, Map 53 
Tri [Katuic] 177-8, Map 47 
Tri-State 71, Maps 16-17 
Trike (Triki, Trique) 45, 52-3, 
Maps 10-11 
Triki see Trike 
Tring [Koho dialect] 177 
Trinidadian Creole 26, 27 


Trinitario 67, Map 21 
Trinkat [Nicobarese dialect] 178 
Tripoli [Arabic dialect] Map 71 
Tripuri see Kokborok 
Trique see Trike 
Triw 177-8, Map 47 
Trobiawan 182, 184 
Tromowa see Groma 
Trukic 108, 115, Maps 28, 34 
Central Trukic (Carolinian) 108 
Eastern Trukic 108 
Trukese 108, Map 28 
Western Trukic (Ulithic) 108 
Trumai 62, 65, Maps 18-19 
Trung see Dulong 
Ts'ao (Kua-Tsua) 327, Map 74 
Ts'ikha (Ts'ixa) 327, Map 74 
Tsaatan (Tagna Uriankhai, Soyod) 188 
Tsabe 327, Map 102 
Tsdchela see Tsafiki 
Tsdfiki (Colorado, Tsdchela) 64, 
Maps 12-14 
Tsagaan see Monguor 
Tsaiwa see Atsi 
Tsakhar (Chahar, Chakhar) 189, 191 
Tsakhur 234-5, 239, Map 59 
Tsakonian 259-60 
Tsamako 287, 290 
Tsamba 327, Map 81 
rTsamdah 164, Map 45 
Tsamosan 8, 10, Maps 4, 6, 8-9 
gIsang 164,175, Map 45 
Tsangi 327, Maps 83, 88 
Tsase see Piapoko 
Tsat 161, 182, 184, 193, Map 45 
Tseminyu 168 
Tsenap see Chenapian 
rTsethang 164 
Tsetsaut 8, 24, Maps 6-7 
Tsezic (Didoic) 239 
Tsez (Dido) 234, 239, Map 59 
Tshangla (Sharchop) 215 
see also Monpa 
mTshochen 164, Map 45 
Tshuosh see Sawos 
Tsiege 327, Map 83 
Tsilhgot'in see Chilcotin 
Tsilmamo see Zilmamu 
Tsimané see Chimane 
Tsimshianic 8,24, Maps 7-9 
Maritime 8, 18, Maps 8-9 
Coast Tsimshian (Sm'algyax) 18, 24, 
Maps 6-7, 9 
Southern Tsimshian (Skiiiixs) 18, 
Maps 6-7, 9 
Tsogo Maps 83, 87-88 
Tsogo 327, Maps 83, 88 
Tsokanian (Greek) Map 66 
Tsonga 312, 332-3, Maps 74-75 
Tsonga-Jonga (Shangaan) 326, 
Maps 74-75 
Tsonga-Ronga-Tshwa 312 
Tsongool Buryat 190, 239 
Tsotso 327, Map 86 
Tsouic 182, 184, Map 50 
Tsou 182, 184, Map 50 
Tsova Tush (Bats, Batsbi) 235, 239, 
Map 59 
Tsum [Tibetan] 164, Map 51 
Tsuut'ina (Tsutina) see Sarcee 
Tsuvan 327, Map 94 
Tswa 327, Map 75 
Tswana 311-12, 327, 329, 332-3, 
Maps 74, 84 
Setswana 310 
Ttao-Suamato Map 32 
Tu see Monguor 
Tualatin-Yamhill 16, Maps 4, 6, 9 
Tuareg see Touareg 
Tubalar 234, 238 
Tubar 54, Map 3 
Tubare 54 
Tubatulabal 8, 24, Maps 4, 8-9 
Tubetube 109, Map 31 
“Tubw’ [Chadian Arabic dialect] Maps 71, 
73 
Tubu [Teda-Daza] 312 
Tucano see Tukanoan, Tukano 
Tugan Ful, South 327, Map 104 
Tuhujia see Monguor 
Tuic [Jieng dialect] 284 
Tujia 163, 169-70, 175, Map 45 
Northern Tujia (Biji) 170 
Southern Tujia (Mozi) 170, 175 
Tujia 169 
Tukanarikem Maps 16, 20 
Tukang-Besi 120, Map 39 
Tukanoan 68, Maps 12-17 
Tukano (Tucano) 68, Maps 12-13, 
15-17 
Tukele 327, Map 92 
Tuken 327, Maps 72, 86 
Tukhemmi see Khyamnyungan 
Tuki 327, Maps 92-93 
Tukombe 327, Map 93 
Tukudede 119, Map 38 
Tukudh see Gwich'in 
Tukun see Monguor 
Tukurina 67 
Tula Map 98 
Tula 327, Map 98 
Tulama 286 
Tule-Kaweah 25, Maps 4, 9 
Tuling 327, Maps 92-93 
Tulishi [Kamdang dialect] 285 
Tulishi [language] see Kamdang 
Tulu 213,217, Map 52 


Tum (Ktum, Tay Tum) 177-8, Map 47 
Tumak 327, Maps 89-90 
Tumala Map 72 
Tumara see Muniwara 
Tumawoi see Skou 
Tumbatu 327, Maps 76, 85 
Tumbuka 329, Maps 75-77 
Tumbuka 312, 327, Maps 75-77 
Tumbwe 327, Maps 78, 80 
Tumd [Ju Ud subdialect] 191 
Tumi see Kitimi 
Tumleo 113, Map 33 
Tumma 285 
Tumnelia 327, Map 99 
Tiimpisa Shoshone see Panamint 
Tumtum 285 
Tundra Yukaghir 239 
Tunebo 64, Maps 12-13 
Tunen Map 92 
Tunen 327, Maps 92-93 
Tungoro 327, Map 93 
Tungus see Ewenke 
Tungus(ic) 238 
Northern (North-western, Ewenki) 187, 
Map 55 
Southern 187, Map 55 
Central 187 
Tungus see Evenki 
Tunia 327, Maps 89-90 
Tunica 7,9, 24, Map 2 
Tunis [Arabic dialect] Map 73 
Tunisia [Arabic subgroup] Map 71 
Tunjung 120, Map 41 
Tunzunnya see Tongcengnya 
Tuotampa 328, Map 93 
Tupari Maps 16-17, 20-21 
Tupian 62, 74-5, Maps 12-13, 16-23 
Tupi-Guarani 62, 74-5, Maps 16-23 
Tupi 62,74, Maps 18, 22 
Southern 74 
Tupinamba 74 
Tupuri 328, Maps 90-91, 94 
Tura-Guro 
Tura (Wen, Toura) 301, 328, 
Maps 105-106 
see also Guro-Tura 
Turaka 110, Map 31 
Turama River Kiwai see Morigi 
Turama-Kikorian 99, 99,111, Map 32 
Turama-Omatian (Mena) 111 
Turka (Tyurama) [Gur] 328, 
Maps 104-105 
Turkana [Nilotic] 283-5, 289, 328, 
Maps 72, 86 
see also Nyangatom 
Turkic (Tujue) 186, 216, 237-8, Maps 45, 
53, 55, 57-61, 66 
Central (Kipchak-Noghai, Aralo- 
Caspian) 238, Map 59 
Eastern (Karluk) 186-7, 238, Maps 45, 
53, 57 
Northern (Eastern Hunnish) 186-7, 
238, Maps 45, 55-57 
Southern 237-8, Map 53 
Turkish 233-7, 260-2, 277, Maps 53, 
59-61, 70 
abroad 236, 260, 262-3, 280 
in Australia Map 36 
Khorasani (Quchani) Map 53 
Turkmen 211, 216, 234, 236-8, 
Maps 53, 57-59 
Western (Kipchak) 186-7, 216, 238, 
Maps 45, 55, 57-59 
Turkish; Turkmen see Turkic 
Turks & Caicos Creole 26, 26, 27, Map 9 
Turkwam (Toro) 328, Map 96 
Turo (Duro) 286 
Turoy6 (Turéyo) 278, Map 70 
Turu 115, Map 34 
Turuj [Kamdang dialect] 285 
Turung 180 
Tuscan (Italian) Map 67 
Tuscarora 9, 24, Maps 1-2, 8-9 
Tusha (Tuxa) 73, Maps 18-19 
Tushinawa (Tuxinaua) 71, Maps 14, 16, 20 
Tusyan Maps 104-105 
Tusya (Tusyang) 328, Maps 104-105 
Tutchone 8, Maps 7-8 
Northern 7-8, 20, Maps 7-8 
Southern 8, 23, Maps 7-8 
Tutelo 9, 24, Maps 1-2 
Tutoh Kenyah 120, Map 41 
Tutong 120-1, Map 41 
Tutrugba 327 [under Tafi], Map 103 
Tutsingo 328, Map 93 
Tutuba 108, Map 27 
Tututni 8, 24, Map 4 
Tuvalu 107, Map 26 
Tuva(n) (Urianhay, Uryankhai) 186, 188, 
190, 234, 238, Maps 45, 55-56 
Todzhi [dialect] 240 
Tuwari (Akiapmin) 113, Map 33 
Tuxa see Tusha 
Tuxinaua see Tushinawa 
Tuyoneri 65, Maps 20-21 
Tuzanteco see Muchu 
Tvluq see Zorwang 
Twa [Bantu: Bemba M40] 328, Map 77 
Twa [Bantu: Rwanda-Rundi] 328, Map 80 
Twafap 328, Map 93 
Twampa (Uduk) 285, Map 72 
Twana 8, 24, Map 6 
Twi 303, 331 
see also Akan 
Twumfu 328, Map 95 
Txikao see Chikaon 
Tyap 328, Map 97 


Tyapi 328, Map 107 

Tyenga (Shanga) 328, 331, Maps 100, 102 

Tyepo see Tiepo 

Tyeraity 116, Map 35 

Tyo 328, Map 83 

Tyurama see Turka 

Tyva see Tuvan 

Tzeltal (B'atz'il k'op) 45, 50, 52, 
Maps 10-11 

Tzeltalan 52, Maps 10-11 

Tzotzil (B'atz'i k'op) 45, 50, 52, 
Maps 10-11 

Tzova Tush see Tsova Tush 

Tzutujil 45,52, Maps 10-11 


U see Lhasa 
U see Phuman 
Uab Meto 119 
Uaiuai see Waiwai 
Uamué see Wamoé 
Uara see Waura-Meinaku 
Uaripi 111, Map 32 
Ubaghara-Kohumono Map 101 
Ubaghala 328, Map 101 
Ubamer see Aari 
Ubang 328, Map 95 
Ubangi 305-6, 330-1, Maps 79, 82, 87, 
89-91 
Ubate Maps 3,5 
Ubir 110, Map 31 
Ubu-Mbokomu-Moshi-Kirua-Kilema- 
Mwika 310, 328, Map 86 
Ubykh 238 
Uchiti (Huchiti) 15, 24, Map 3 
Udeghe [Tungus subgroup] 187 
Udeghe 187 
Udehe see Udihe 
Udi 234-5, 239, Map 59 
Udihe 238 
Udihe (Udehe) 233-4, 238, Map 55 
Udmurt (Votyak) 234, 236, Maps 56, 58 
Uduk see Twampa 
Udung see Wutung 
Ufia 328, Maps 95, 101 
Ufim 110, 112, Maps 32-33 
Ugare 328, Map 95 
Ughele 109, Map 29 
Ugong see Gong 
Ugric 238, 259-60, Maps 55-56, 58, 
61-62, 65, 67 
Uhami-lyayu 328, Map 101 
Uhunduni 99, 99, 114, Map 34 
Uhuum 328, Map 95 
Uighur Uriankhai see Tsaatan 
Uighur (Uyghur) 186-7, 211, 234-8, 
Maps 45, 57 
Central [dialect] 186 
Uintah & Ouray Ute 24 
Ujimgin [Shilingol subdialect] 191 
Ujir 115, Map 34 
Ukaan 306, Map 101 
Ukaan 328, Map 101 
Ukele see Kukele 
Ukpe-Bayobiri 328, Map 95 
Ukpet-Ehom 328, Map 93 
Ukrainian 236, 259, Maps 58-59, 61-62 
abroad 231-7, 260, 262 
in Canada 26, 236 
Ukue-Ehwen 328, Map 101 
Ukuriguma 112, Map 33 
Ukwani-Aboh 328, Map 101 
Ukwo 328, Map 101 
Ulanqab [Inner Mongolian dialect] 191 
Ulau-Suain 113, Map 33 
Ulbarag (Urbarag) 288 
Ulcha (Olcha) 234, 238, Map 55 
Ulingan (Mawake) 112, Map 33 
Ulithic 108 
Ulithian 108, Map 28 
see also Trukic 
Ulta see Orok 
Ulu 328, Map 101 
Ulukwumi 328, Map 101 
Uma see Pipikoro 
Umairof see Hewa 
Umbale see Chaurasia 
Umbindhamu 101, 117, Map 35 
Umbrian (Italian) 261, Map 67 
Umbugarla-Ngurmbur 100, 116, Map 35 
Umbugarla 116, Map 35 
Umbule see Chaurasia 
Umbundu 330, Maps 74, 84 
Umbundu 302, 310, 313-14, 328, 
Maps 74, 84 
Umbuygamu 101, 117, Map 35 
Ume Saami (Umi Saami) 263, Map 64 
Umiray Dumaget see Dumagat 
Umon 328, Map 101 
Umotina see Umutina 
Umpila 101, 117, Map 35 
Umpqua, Upper 8, 24, Map 4 
‘Umtr Map 70 
Umutina (Umotina) 72, Map 20 
Umvang 328, Maps 87-88, 92 
Unalachtigo [Delaware dialect] 24 
Unami (Oklahoma Delaware, Lenape) 7, 
17, 19, 24, 26, Map 1 
Unangan (Aleut) 10 
Undro see Andro 
Undu [Berta dialect] 285, 290 
Uneme 328, Map 101 
Unga 328, Map 77 
Ungarinjin Map 35 
Unggumi 116, Map 35 
Unguja 328, Map 85 
Zanzibar Map 76 
Unquachog see Quiripi- Unquachog 
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Unua 108, Map 27 
Upland Yuman see Yuman 
Upriver [Nez Perce dialect] 19 
Uptu (Chinbon) 168 
Ura [Austronesian] 108, Map 27 
Ura [Benue-Congo] 328, Maps 100-101 
Urad [Ulanqab subdialect] 191 
Uradhi 101, 117, Map 35 
dialects 101, Map 35 
Urak Lawoi' 121, 183-4, Maps 44, 48 
Urali Map 52 
Uralian (Kipchak-Bulgar) 238 
Uralic 231, 238, Maps 55, 61-62, 64-65, 
67 
Urarina (Shimaku) 69, Maps 14-15 
Urat 113, Map 33 
Urdu 1, 211-13, 214, 217, 259, Maps 51, 
53-54 
abroad 181, 216, 263, 280 
Dakhini Map 52 
Urdus see Ordos 
Urhobo 328, Map 101 
Uria Map 34 
Urianhay see Tuvan 
Uriankhai 188 
see also Tannu Uriankhai; Tsaatan; 
Tuvan 
Urigina 112, Map 33 
Urii 110, Map 32 
Urim [Torricelli: Palei] 113, Map 33 
Urimo [ Torricelli: Marienberg] 113, 
Map 33 
Uripiv-Wala-Rano-Atchin 108, Map 27 
Urmi (Urmi) 278, Map 70 
Urninganggic 116 
Urningangg 116, Map 35 
Urrti 283 
Uru-Chipaya 70, Maps 20-21 
Uru 70, Maps 20-21 
Uru-eu-uau-uau see Uruewauwau 
Uruak see Awaké-Kaliana, Awaké 
Uruangnirin 115, Map 34 
Uruava 108, Map 29 
Urubti 74 
Uruewauwau (Uru-eu-uau-uau) 74, 
Maps 17, 21 
Urupa-Jarti 65, Maps 16, 20 
Uruwa 110, 112, Maps 32-33 
Uryankhai see Tuvan 
Usaghade 328, Map 93 
Useri 328, Map 86 
Ushojo Map 53 
Usila Chinanteko 53 
Usino 112, Map 33 
Usku 114, Map 34 
Uspanteko (Uspanteco) 52, Maps 10-11 
Usu 112, Map 33 
Utagwang 328, Map 95 
“‘Utayba Map 70 
Ute-Chemehuevi 9, 24, Maps 3-5, 8-9 
Chemehuevi 24-5, Maps 3-5, 8-9 
Southern Paiute 24, Maps 3-5 
Ute 24, Maps 3-5, 8-9 
Southern Ute 24 
Uintah & Ouray Ute 24 
Ute Mountain Ute 24 
Utian 7-8, 10, Maps 4, 8-9 
Uto-Aztecan 7-8, 46, Maps 2-4, 6, 8-9 
see also Yuta-Nawan 
Utu 112, Map 33 
Utuli 328, Map 85 
Utupua 107, 109, Map 29 
Utyangikwathiya [Uradhi dialect] 101, 
117, Map 35 
Uvbie 328, Map 101 
Uvea 
East 107, Map 26 
West 107, Map 26 
Uvol 109, Map 30 
Uw Oykangand 101, 117, Map 35 
Uya see Sausi 
Uyghur see Uighur 
Uzbek 186-7, 211, 216, 234-8, Maps 45, 
53, 57 
Uzekwe 328, Map 95 
Uzemchin 188 


Vafsi 216, Map 53 
Vagdi Map 54 
Vaghua 108, Map 29 
Vagla 328, Maps 103-104 
Vai 303, 328, 332, Map 106 
Vaiphei 213 
Vale 328, Maps 89-90 
Valencian (Spanish) Map 68 
Valley Yokuts 25 
Valman (Koroko) 113, Map 33 
Vangunu 109, Map 29 
Vanikoro 107, 109, Map 29 
Vanimo 112, 115, Maps 33-34 
Vanimo (Manimo) 112, 115, 
Maps 33-34 
Vankieu see Bru 
Vano 109, Map 29 
Vanuatu 
Central 108 
North 108 
Southern 108 
Vao 108, Map 27 
Varama 328, Map 88 
Varisi 108, Map 29 
Varli Maps 52, 54 
Varofu see Warapu 
Vata 328, Map 105 
Vayu see Hayu 
Vedda(h) 214, Map 52 
Vehes 111, Map 32 


Venda Maps 74-75 
Venda 312, 328, 333, Maps 74-75 
Venetian (Italian) 261, Map 67 
Veng'o 328, Maps 93, 95 
Venturefio [Chumashan] 25 
Veps 234, 238 
Vera'a 108, Map 27 
Vere-Gimme Map 98 
Vere Map 91 
Vere 328, Maps 91, 98 
Vidunda 328, Map 76 
Viemo Maps 104-105 
Viemo 328, Maps 104-105 
Vietic [Mon-Khmer subgroup] 175-6, 
178, Maps 45, 47 
Vietnamese 176, 178, 194, 263, 
Maps 47-48 
abroad 234, Map 36 
Vilela 72, Maps 20, 22 
Vili 328, Maps 83, 88 
Vimtin (Vimtim) 328, Map 94 
Vincentian Creole 26 
Vinza 328, Maps 78, 80 
Vira 328, Map 80 
Virgin Islands Creole 26, 27 
Virginia Algonquian 7, 25, Map 1 
Vitiaz 109, 111, 113, Map 30 
Viya 328, Map 88 
Vlach Romani see Roma (Romani) 
Vogul see Mansi 
Voh-Kone 108, Map 27 
Voko-Dowayo Maps 91, 98 
Voko Map 91 
Volga see Mordvin 
Volgaic 231, 238 
Volta-Congo 305-6 
North (West) 305 
South (East) 305 
Vora 328, Maps 82, 89 
Votic 234, 238 
Votyak see Udmurt 
Vovo 108, Map 27 
Vuite 168 
Vuma 328, Map 86 
Vumba 328, Map 85 
Vunapu 108, Map 27 
Vungu 328, Maps 83, 88 
Vunjo see Kiwo-Naruma-Vunjo 
Vurés 108, Map 27 
Vute 328, 331, Maps 87, 91-93, 95 


Wa see Waic, Wa 
Waanyi 116, Map 35 
Waarnambool see Kuurn Kopan Noot 
Waata (Oromo-Orma) 286, 328, 330, 
Maps 72, 85 
Wab (Som) 113, Map 33 
Wabo 115, Map 34 
Wabuda 111, Map 32 
Wachi (Guachi) see Chapakura 
Wachipaéri (Huachipaeri, Harakmbut) 
65, Maps 17, 20-21 
Wada 328, Maps 79, 82, 89-90 
Wadaginam 111, Map 33 
Wadapi-Laut 115, Map 34 
Waddai Maps 71-72 
Wadikali 102, 117, Map 35 
Wadjabangayi 101, 117, Map 35 
Wadjalang 101, 117, Map 35 
Wadjigu 101, 117, Map 35 
Waerana 118, Map 38 
Waffa 110, Map 32 
Wagawaga 109, Map 31 
Wagiman (Wagoman) 116, Map 35 
Wahau Kayan 120, Map 41 
Wahau Kenyah 120, Map 41 
Wahgi 110, 112, Maps 32-33 
Wahgi 102, 110, 112, Maps 32-33 
Wahiro (Guajiro, Wayuu) 66, 
Maps 12-13 
Wahitpang 166 
Wahivoan 65, Maps 12-13 
Wahivo (Guajibo, Guahibo, Hiwi) 65, 
Maps 12-13 
Wai see Wainuma 
Waia 111, Map 32 
Waiampi see Wayampi 
Waiare see Waviare 
Waibuk (Piawi) Map 33 
Waibuk (Wiyaw) Map 33 
Waic (Wa) 175-6, 178, Map 46 
Wa (Paraok) 176, 178, Maps 47, 49 
Paraok Wa Map 46 
Waigali 211, 216, Map 53 
Waikena see Wanana-Pira 
Waikurian, Waikuri [N. America] see 
Guaicurian, Guaicuri 
Waikuruan [S. America] 72, Maps 18-21 
Waikurti 72, Maps 18-21 
Wailaki see Eel River Athabaskan 
Wailapa 108, Map 27 
Wailu (Houailou, Ajié) 102, 106 
see also Ajié 
Waima'a 119, Map 38 
Waima'a 119, Map 38 
Waimi 64 
Waina (Sowanda) 112, 115, Maps 33-34 
Wainuma (Wai) 65, Maps 16-17 
Waipuinavi see Puinave 
Waisara see Owena 
Waiwai (Uaiuai) 76, Maps 12-13 
Waja-Jen Maps 91, 94, 98 
Waja Maps 91, 98 
Waja-Awak Map 98 
Waja 328, Maps 91, 98 
Wajarri 100, 116, Map 35 
Wajumara 76, Map 12 


Waka 328, Map 98 
Waka-Kabic 102, 117-18, Map 35 
Wakka 102, 118, Map 35 
Wakka-Wakka 102, 118, Map 35 
Wakapunga 101, 117, Map 35 
Wakashan 7-9, 46, Maps 6-9 
Wakaya 101, 117, Map 35 
Wakhi 193, 211, 216, 236-7, Maps 53, 57 
Walane 288, Map 72 
Walangama 101, 117, Map 35 
Walap(a)i see Yuman, Upland 
Wali 328, Maps 103-104 
Walio 113, Map 33 
Walio 113, Map 33 
Wallachian see Romanian 
Waillagga (Mecha) Map 72 
Wallo [Amharic dialect] 288 
Walloon 262, Map 69 
Walmadjari see Walmajarri 
Walmajarri (Walmadjari) 100, 103, 107, 
116, Map 35 
Walser [German dialect] 261 
Wama-Tankamba 328, Maps 102, 104 
Wamaka (Guamaca, Malayo) 64, 
Maps 12-13 
Wamas 112, Map 33 
Wambayan 116 
Wambaya 116, Map 35 
Wambiano (Guambiano, Mogés) 63, 
Map 12 
Wambon 114, Map 34 
Wambule see Chaurasia 
Wamin 101, 117, Map 35 
Wamo-Chapakuran 65, Maps 12, 16-17, 
20-21 
Wamo (Guamo) 65, Map 12 
see also Chapakuran 
Wamoé (Uamué) 73, Maps 18-19 
Wampano [Munsee dialect] 19 
Wampanoag [Massachusett] 18 
Wampar 111, Map 32 
Wampur 111, Map 32 
Wan see Nwa 
Wanai see Mapoyo-Yavarana 
Wanambre 112, Map 33 
Wanana-Pira (Piratapuyo, Waikena) 68, 
Maps 12-13 
Wanang [Papuan subgroup] 111, Map 33 
Wanang [ Tibeto-Burman] 166, Map 51 
Wancho 167, 175,213, Maps 49, 51 
Wanda 328, Maps 75-76 
Wandabong 110, 112, Maps 32-33 
Wandala [Chadic subgroup] 331, 
Maps 94-95, 99 
Central Map 94 
East Map 94 
Wandala 328, Maps 94-95, 99 
West Map 94 
Wandala Daza [Saharan] 328, Map 99 
Wandamen 115, Map 34 
Waneci 216, Map 53 
Wanes 115, Map 34 
Wanga [Bantu] 328, Map 86 
Wangan [Pama-Nyungan] 101, 117, 
Map 35 
Wangaybuwan-Ngiyambaa 102, 117, 
Map 35 
Wanggom 114, Map 34 
Wangka-Yutjurru 101, 117, Map 35 
Wangkajunga 100, 116, Map 35 
Wangkamadla 101, 117, Map 35 
Wangkangurru 101, 117, Map 35 
Wangkayi 101, 117, Map 35 
Wangkumara 101, 117, Map 35 
Wanham see Wanyam 
Waniabu see Ama 
Wanji 328, Maps 76-77 
Wanka (Huanca) 70, Maps 20-21 
Wanninawa (Katukina) 70, Maps 14-17, 
20-21 
Wano 114, Map 34 
Wantoat 110, 112, Maps 32-33 
Wantoat 110, 112, Maps 32-33 
Wanukaka 118, Map 38 
Wanuma 112, Map 33 
Wanyam (Wanham) 65, Maps 16, 20 
Wanyubarlgu 102, 118, Map 35 
Waorani see Sabela 
Wapei, West 113, Map 33 
Wapei-Palei 113, Map 33 
Wapei 113, Map 33 
Wapi [ Papuan: Piwai] 113, Map 33 
Wapi [Papuan: Wapei] see Olo 
Wapishana (Aruma, Wapixana) 66, 
Maps 12-13 
Wappo 8, 25, Map 4 
War [Mon-Khmer] 178, 217, Map 51 
Wara-Natioro Maps 104-105 
Waraiku 66, Maps 14, 16 
Warao (Guarao) 64, Maps 12-13 
Warapu (Varofu, Waropu) 112, Map 33 
Waray 116, Map 35 
see also Sorsogon 
Wardaman-Wagoman 100, 116, Map 35 
Wardaman 116, Map 35 
Warekena (Guarequena) 66, 
Maps 12-13 
Warenbori 115, Map 34 
Wargla Map 73 
Wari 65 
Wariapano 70 
Wariji(y)o (Guarijio) 54, Maps 3, 9 
Warikiana, Warikyana see Kashuyana- 
Warikyana 
Waris 112, 115, Maps 33-34 
Waris 112, Maps 33-34 
Wariva (Guariba) 68, Maps 12-13, 17 


Warji Map 97 
Warji 328, Map 97 
Warkay-Bipim 114, Map 34 
Warki-m-be see Dilling 
Warlmanpa 100, 116, Map 35 
Warlpiri 100, 103, 116, Maps 35-36 
Warluwarric (Warlu-Warric) 101, 108, 
115, 117, Map 35 
Warluwarra-Bularnu 101, 117, 
Map 35 
Warndarang 116 
Warndarang 116, Map 35 
Warnman 100, 116, Map 35 
Waropen 115, Map 34 
Waropu see Warapu 
Warpe 71 
Warrgamay 101, 117, Map 35 
Warriyangka 100, 116, Map 35 
Warrungu 117, Map 35 
Warsangeli see af-Warsangeli 
Warumungu 101, 117, Map 35 
Warup 112 
Waruwaru see Hoti 
Warwa 116, Map 35 
Wasa 328, Maps 79, 89 
Wasco see Kiksht 
Wase 328, Maps 96, 98 
Wasembo (Gusap) 111-12, Maps 32-33 
Wasembo 112, Maps 32-33 
Washa Map 2 
Washkuk see Kwoma 
Washo (Washoe, Washu) 7-8, 25,51, 
Maps 4, 8-9 
Wasi 109, Map 30 
Waskia 112, Map 33 
Wastekan 52, Maps 10-11 
Wasteko (Huasteco) 45, 49-50, 52, 
Maps 10-11 
Wasulunka 328, Maps 105-107 
Watakataui 113, Map 33 
Wataluma 110, Map 31 
Watam 113, Map 33 
Watapor see Anggor 
Watawit see Berta 
Wateree Map 2 
Wathawurung 102, 118, Map 35 
Wati see Western Desert 
Watiwa see Dumpu 
Watjanmay 101, 117, Map 35 
Watsi 328, Map 81 
Watubela 115, 118, Maps 34, 37 
Watuso-Weétar (Guatuso, Guetar) 64, 
Maps 12-13 
Wauja see Wauré-Meinaku 
Waunana see Noanama 
Waupe see Kwato 
Waura-Meinaku (Uara, Wauja) 66, 
Maps 18-19 
Wave (Wavi, Huave) 51, Maps 10-11 
Wavi see Wave 
Waviare (Guaviare, Waiare, Nukak 
Maku) 68, Maps 12-13 
Wawa 328, Map 91 
Waxiang see Chinese (Sinitic), Old South- 
western 
Wayampi (Waiampi) 62,74, Map 13 
Wayana [Cariban] 76, Maps 12-13, 18 
Wayana (Guayana) [Je] 73, Map 22 
Wayavero (Guayabero, Hiw) 65, 
Maps 12-13 
Waykuri see Guiacuri 
Wayord Maps 16-17, 20-21 
Wayuu see Wahiro 
We [Bantu] 328, Maps 74, 77 
Webo 328, Map 105 
Weda 118, Map 37 
Wedau 105, 110, Map 31 
Wee see Guere 
Wee [Kru] 311, Maps 105-106 
Weiguerh, Weiwu(erh) see Yugur 
Welamo see Wollaitta 
Welaung 168 
Weliki 110, 112, Maps 32-33 
Wellegga-Mact'a 286 
Welsh 259, 263, Maps 63, 69 
Wemale 118, Map 37 
Wemba Wemba 102, 118, Map 35 
Wembi see Manem 
Wemo [Kate dialect] 105 
Wen see Tura-Guro, Tura 
Wendat see Huron-Wyandotte 
Wendish see Sorbian 
Wenro Map 1 
Werchikwar 217-18, Map 53 
Weretai 115, Map 34 
Weri 111, Map 32 
Wesa see Barwe 
Weskos [pidgin English] 328, 330-1 
Western Canadian Inuktitut see Inuktitut 
Western Desert (Wati) [Australian] 100, 
103, Maps 35-36 
Western Eskimo see Yupik 
Wetamut 108, Map 27 
Wétar (Guetar) [Chibchan] 64 
Wetar [Timor] 119, Map 38 
Wetsuwet'en see Babine 
Wewaw 174 
Weyewa 118, Map 38 
White Gelao see Tolo 
White Mong see Hmong 
White Mongol see Monguor 
White Tai see Tai-Kadai, Tai Khaw 
Whitesands 108, Map 27 
Wi-Isu 328, Map 95 
Wiaki 113, Map 33 
Wichi see Matako 
Wichita 9, 25, Maps 2-3, 5, 8-9 


Wichol (Huichol, Tecual) 53-4, 
Maps 10-11 
Wide Grass see Grassfields 
Widekum see Moghamo 
Wider Bantu see Grassfields 
Wik Ayangenycha 101, 117, Map 35 
Wik Ep (Wik Me'anh) 101, 117, Map 35 
Wik Iyanh 101, 117, Map 35 
Wik Keyangan 101, 117, Map 35 
Wik Me'anh 101, 117, Map 35 
see also Wik Ep 
Wik Mungkan (Wik Munkan) 101, 107, 
117, Map 35 
Wik Ngathan 101, 117, Map 35 
Wik Ngatharra 101, 117, Map 35 
Wik Ompon (Wik-Ompom) 101, 117, 
Map 35 
Wik Paach 101, 117, Map 35 
Wikchamni see Wukchumne 
Wilawila 116, Map 35 
Wild Sulayman 279, Map 70 
Wild‘ali 279, Map 70 
Wile 328, Maps 103-105 
Willapa see Kwalhioqua-Clatskanie 
Willaumez 109, Map 30 
Wilyagali 102, 118, Map 35 
Windward Creole 26, 27 
Winnebago (Ho-Chunk, Hochank) 9, 25, 
Maps 1, 5, 8-9 
Wintuan 7-8, 10, Maps 8-9 
Wintu-Nomlaki 8, 25, Maps 4, 8-9 
Winyaw Map 2 
Wipsi see Poka-Kori-Wipsi-Geeri 
Wipukpaya see Yuman, Upland 
Wiradhuri 102, 117, Map 35 
Wirangu 101, 117, Map 35 
Wiri 101,117, Map 35 
Wirina 66, Map 12 
Wirrtjarakantja 100, 117, Map 35 
Wirtimay 100, 116, Map 35 
Wiru 99, 99, 110, Map 32 
Wishram see Kiksht 
Wissel Lakes-Kemandoga 114, Map 34 
Wissel Lakes 99 
Witotoan 69 
Wiyaw see Waibuk 
Wiyot 7,25, Map 4 
Wobe 328, Map 105 
Woccon 9, 25, Maps 1-2 
Wodani 114, Map 34 
Wogamusin 113 
Wogamusin 113, Map 33 
Wogeo 113, Map 33 
Wogu see Bahinemo 
Woi 115, Map 34 
Woj (Enekan) [Mande] 328, 
Maps 105-106 
Wokam-Tarangan 115, Map 34 
Wokha [Lhota dialect] 167 
Wolaitta, Wolayta see Wollaitta 
Woleaian 108, Map 28 
Wolio 120, Map 39 
Wollaitta (Wolayta, Wolamo, Welamo) 
285, 287-9, Map 72 
Wollo 286, Map 72 
Wolof 302, 311-12, 328, 332, Maps 73, 
107-108 
Wolyamidi 116, Map 35 
Wom [Leko-Nimbari] 328, Maps 91, 98 
Wom (Wam) [Torricelli] 113, Map 33 
Womsak see Kwanga 
Wongo 328, Map 81 
Woods Cree 7, 25, Map 7-9 
Wor (Dugukan) 328, Map 105 
Woro see Orlo 
Worrorran 100 
Wororic 116 
Worora 116, Map 35 
Wotapuri 211, Map 53 
Wotu 120, Map 39 
Woyo 328, Map 84 
Wu see Chinese (Sinitic), Central, Wu 
Wudhadhi [Uradhi dialect] 101, 117, 
Map 35 
Wudjari 101, 117, Map 35 
Wuhua [Yue Chinese dialect] 160 
Wukari 328, 331, Maps 95-96 
Wukchumne (Wikchamni) [Yokuts 
dialect] 25 
Wulgur 101, 117, Map 35 
Wuli-Wuli 102, 118, Map 35 
Wulna 116, Map 35 
Wumboko 328, Maps 92-93 
Wumbu [Bantu: Yaka] 328, Maps 81, 83 
Wumbvu (Wumvu) [ Bantu: Kele-Mbete] 
328, Maps 83, 88 
Wumeng see Dai, Dai Sa 
Wuming [Buyi dialect] 179 
Wumu [Bantu: Teke] 328, Maps 81, 83 
Wumvu see Wumbvu 
Wunambalic 116 
Wunambal 116, Map 35 
Wungu 328, Maps 76-77 
Wunumara 101, 117, Map 35 
Wurbo Maps 95-96 98 
Wurkum see Piya 
Wushi 328, Maps 93, 95 
Wusi 108, Map 27 
Wutun [mixed language] 193, Map 45 
Wutung (Udung) 112, 115, Maps 33-34 
Wuumu see Mvumbu 
Wuvulu-Aua 109, Map 30 
Wuywurrung 102, 118, Map 35 
Wuzhou see Chinese (Sinitic), Central, 
Wu 
Wuzlam 328, Map 94 
Wyandotte see Huron-Wyandotte 
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Xakriaba see Shakriaba 

Xambioa see Karaja-Shambioa 

Xamtana see Khamtanga 

Xaninaua see Shaninawa 

Xapho see Laghuu 

Xaract: 108, Map 27 

Xaragiire 108, Map 27 

Xarua 109, Map 30 

Xasonke 328, Map 107 

IIX'au-ll'e see I[Kh'au-ll'e 

Xavante see Shavante 

Xerente, Xeres see Sherente 

Xetd see Sheta 

Xhosa 308, 312, 314, 328-9, 332-3, 
Map 74 

Xiang see Chinese (Sinitic), Central, Xiang 

Xianghua see Chinese (Sinitic), Waxiang 

Xiangtang see Lolopo 

Xianjiang [Mandarin dialect] see Chinese 
(Sinitic), Northern 

Xibo (Xibe, Sibo, Sibe) 187, Map 45 

Xide [Shengza Nosu dialect] 172 

Xifan see Tibeto-Burman 

Xinhmul (Phong, Puoc) [Khmuic] 176, 
178, Map 47 

Xinjiang [Daur dialect] 192 

Xinkan 50, 61 

Xinka see Shinkan 

Xipaia see Juruna 

Xipibo see Shipibo 

Xipinawa see Shipinawa 

Xiriana see Shriana 

Xishuangbanna Dai see Dai (S.W. Tai), 
Dai Lue 

Xixia (Tangut) 169 

Xixime Map 3 

Xokleng see Shokleng 

Xok6 see Shoké 

Xuan(g)zhou see Chinese (Sinitic), 
Central, Wu 

Xukurti see Shukurt 


Yaa 328, Map 83 
Yaaku 286, Map 72 
Yaaku (Mogododo) 285, 287, Map 72 
Yabadna 66, Maps 12-13 
Yabasi 328, Maps 92-93 
Yabém 105, 111, Map 32 
Yabim 102 
Yaben 112, Map 33 
Yabim see Yabém 
Yabio 113, Map 33 
Yabiyufa (Yawiyuha) 110, Map 32 
Yabong 112, Map 33 
Yabula-Yabula 102, 118, Map 35 
Yabuti see Jabuti 
Yachumi [Yimchungrii dialect] 167 
Yadina see Yedina-Kotoko, Yadina 
Yaeh 177-8, Maps 47-48 
Yafi 112, 114, Maps 33-34 
Yagalingu 101, 117, Map 35 
Yaganon 112, Map 33 
Yagara 102, 118, Map 35 
Yagawak 110, Map 32 
Yagba 328, Map 101 
Yaghnobi 236-7, Map 57 
Yagima 328, Map 81 
Yagomi 112, Map 33 
Yagua see Yawan, Yawa 
Yagwoia 111, Map 32 
Yahadian 114, Map 34 
Yahahi 71, Map 16 
Yahang 113, Map 33 
Yahuna see Jaina 
Yaitepec Chatino 53 
Yaka [Bantu subgroup] Maps 81, 83-84 
Yaka 328, Maps 81, 83-84 
Yaka [Lake Bantu] see Hanga 
Yakan 119, Map 40 
Yaki see Yaqui 
Yakkha 165-6, 214, Map 51 
Yaknge 110, Map 32 
Yakoma 328, Maps 82, 89 
Yakpa 328, Maps 79, 82, 89-90 
Yakur see Loka 
‘Yakut’ see Evenki 
Yakut (Sakha) 234, 238, Maps 55-56 
Yala [Idomoid] 328, Maps 93, 95 
Yala [Ngwi] see Aluo 
Yalanjic 101, 117, Map 35 
Yalarnnga 101,117, Map 35 
Yalayu 108, Map 27 
Yalé 99 
Yalu 111, Map 32 
Yalunka 328, Maps 106-108 
Yamalele 110, Map 31 
Yamamadi see Jamamadi 
Yadmana 71, Map 22 
Yamasee Maps 1-2 
Yamba 328, Map 95 
Yambasa Maps 92-93 
Yambasa 328, Maps 92-93 
Yambes 113, Map 33 
Yambiang Yapeke 328, Map 92 
Yambina 101, 117, Map 35 
Yamdena 119, Map 38 
Yaméo 69, Map 14 


Yamhill see Kalapuyan 
Yami 179, 182, 184, Map 50 
Yaminawa (Yaminahua) 71, Maps 14-17, 
20-21 
Yamphe [Bodic: Kiranti] 166, 214, 
Map 51 
Yamphu [Bodic: Kiranti] 165-6, 214, 
Map 51 
Yana 7-8, 25, Map 4 
Yanam-Ninam (Ninam) 77, Maps 12-13 
Yanda 101, 117, Map 35 
Yanda Dom 328, Map 104 
Yandang 328, Map 98 
Yandjibara 101 
Yandruwandha 101, 117, Map 35 
Yanesha see Amoesha 
Yang Sek see Rianglang 
Yangavok 328, Map 93 
Yangbo see Nyangbo 
Yangere 328, Maps 87, 90 
Yangga 101, 117, Map 35 
Yanghuang see Rao [Kam-Sui] 
Yanglam see Rianglang 
Yangman Map 35 
Yangulam 112, Map 33 
Yankton-Yanktonai 14, Map 5 
Yankunjatjara 100, 116, Map 35 
Yanomaman 77, Maps 12-13 
Yanomam(a) (Ianomami) 77, 
Maps 12-13 
Yans (Yansi) 328, Maps 81, 83 
Yanyuwa 101,117, Map 35 
Yao [Bantu] 302, 312, 328, Maps 75-76 
Yao [Kdriban] 76, Map 12 
Yao [Miao-Yao] 181, 183, Maps 45-47 
Baiku Yao (Baonao) 183 
Yaohu'er see Yugur 
Yapa 100, 116, Map 35 
Yapese 108, Map 28 
Yapoma 328, Map 92 
Yapunda 113, Map 33 
Yaga (Huaylla) 70 
Yagay [Papuan] 114 
Yaqay 114, Map 34 
Yaqui (Cahita, Yoeme) [Uto-Aztecan] 9, 
25, Maps 3, 8-9 
Yaqui (Yaki) [Sonoran: Cahita dialect] 54 
Yaquina see Alsea 
Yarawata 112, Map 33 
Yaraytyana [Uradhi dialect] 101, 117, 
Map 35 
Yardliyawara 102,117, Map 35 
Yareban 99, 110-11, Maps 31-32 
Yareba 110-11, Maps 31-32 
Yarli 102, 117, Map 35 
Yarluyandi 101, 117, Map 35 
Yaruma 76, Map 18 
Yarumarra 101, 117, Map 35 
Yaruro see Jaruro 
Yasa 328, Maps 88, 92 
Yasem 328, Map 93 
Yashi-Eggon Maps 96-97 
Yashi 328, Map 96 
Yasi see Mayo 
Yasing [Mbum-Day: N. Mbum] 328, 
Maps 90-91, 94 
Yasung (Yasug) [Bantu: Basaa] 328, 
Map 92 
Yasyin see Mayo 
Yatasi Map 2 
Yatye 329, Map 95 
Yau 110, 112-13, Maps 32-33 
Yaul 113, Map 33 
Yauma 329, Map 84 
Yaunde see Ewondo 
Yaur 115, Map 34 
Yaure [Mande] 329, Map 105 
Yauyos Kechua (Kichua, Quechua) 70, 
Maps 20-21 
Yava 99, 108, 114-15, Map 34 
Yavapai, Yavepe see Yuman, Upland 
Yavitero 66, Map 13 
Yaw 171,175, Map 49 
Yawalapiti (Iaualapiti) 66, Maps 18-19 
Yawan 69, Maps 14-17 
Yawa (Yagua) 69, Maps 12, 14-17 
Yawaperi (Atruahi) 76, Maps 12-13, 
16-17 
Yawarrawarrka 101, 117, Map 35 
Yawelmani see Yowlumne 
Yawiyuha see Yabiyufa 
Yawuru 116, Map 35 
Yay (Nhang, Giay) 179, 184, Map 47 
Yaygir 102, 118, Map 35 
Yazgulami 236-7, Map 57 
Yazi 329, Maps 78, 80 
Yazoo Map 2 
YEAI [Yoruboid-Edoid-Akokoid-Igboid] 
306, Maps 95-96, 101 
Yebokolo 329, Map 87 
Yebu see Awak 
Yedina-Kotoko Maps 90, 94, 99 
Yadina 328, Map 99 
Yega 110-11, Maps 31-32 
Yei 329, Maps 74, 77 
Yeji see Chumburung 
Yeke 329, Map 77 


Yekhee see Etsako 
Yekora 111, Map 32 
Yela [Niger-Congo] 329, Maps 80-81 
Yela [Papuan] see Yele-West New Britain, 
Yele 
Yele-West New Britain 99, 109-10, 
Maps 30-31 
Yele (Yela, Yeletnye) 99, 108-10, 
Maps 30-31 
Yeletnye see Yele-West New Britain, Yele 
Yellow River 113, Map 33 
Yellow Uighur see Yugur, Western 
Yelmek-Maklew 99, 115, Map 34 
Yelmek 115, Map 34 
Yemba (Dschang) [Grassfields] 329, 
Map 93 
Yembe [Bantu: Kongo] 329, Maps 81, 84 
Yeméan (Komerkrudo) 51 
Yemeni [Arabic subgroup] 279-80, 
Map 70 
Eastern dialects 280, Map 70 
Southern Plateau dialects 280, 
Map 70 
Yemsa 289, Map 72 
Yemsa (Yem, Janjero, Zinjero) 289, 
Map 72 
Yendang Map 98 
Yenisei see Yenisey 
Yeniseian 159 
Yenisey Ostyak see Ket 
Yenisey Samoyed see Enets 
Yenisey Tatar see Khakas 
Yerakai 113, Map 33 
Yerani see Gorovu 
Yeretuar 115, Map 34 
Yerong 181-2, 184, Map 45 
Yeskwa 329, Maps 96-97 
Yessan-Mayo see Mayo 
Yetfa see Biksi 
Yey 111,115, Maps 32, 34 
Yeye Maps 74, 77 
Yezed [Kurdish variety] 235 
Yholmo 214 
Yi (Yipho) [N. Ngwi] 172 
Black see Nasu, Central 
Central 172 
Eastern see Nasu 
Northern see Nosu 
Red see Naisu, Western 
South-Eastern 172 
Southern see Nisu 
Western 172 
Yiddish 27, 232-6, 260 
Yidena see Yedina-Kotoko, Yadina 
Yidgha 211,216, Map 53 
Yidinyic 101, 117, Map 35 
Yidiny 101, 117, Map 35 
Yil 113, Map 33 
Yilba 101, 117, Map 35 
Yiltawangka 100, 116, Map 35 
Yiman 101,117, Map 35 
Yimas 113, Map 33 
Yimas Pidgin 105, Map 25 
Yimchungrti (Yimchungre) 167, 175, 
213, Maps 49, 51 
Yinbaw (Ka-ngan) 174 
Yindjibarndi 100, 116, Map 35 
Yingkarta 100, 116, Map 35 
Yinhawangka 100, 116, Map 35 
Yinibu see Iteri; Rocky Peak 
Yiningayi 101,117, Map 35 
Yinjilanji 101, 117, Map 35 
Yintale 174-5, Map 49 
Yinuo 172 
Yinwum 101, 117, Map 35 
Yir see Ong 
Yir-Thangedl 101, 117, Map 35 
Yir-Yoront 101, 117, Map 35 
Yira 329, Map 79 
Yirandhali 101, 117, Map 35 
Yis 113, Map 33 
Yitha-Yitha 102, 118, Map 35 
Yiwom see Gerka 
Yo (Nyo) [ Tai-Kadai] 180, 184, 
Maps 47-48 
Yobi Map 31 
Yobin see Lisoid, Lisu 
Yoeme see Yaqui 
Yogad 119, Map 40 
Yogoma 329, Map 80 
Yoidik 112, Map 33 
Yokot'an, Yokotan see Chontal [Mayan] 
Yokuts 7-8, 25, Maps 4, 9 
Valley Yokuts 25, Maps 4, 9 
Yolngu 100, 116, Map 35 
Yom-Nawdm Maps 102, 104 
Yom-Tanla 329, Maps 102, 104 
Yombe [Bantu: Kongo] 329, Maps 83-84, 
88 
Yombo [Bantu: Bushong] 329, Map 81 
Yoncalla 16, Maps 4, 9 
Yonggom 110, 112, 114, Maps 32-34 
Yongren Dai 179-80, 184, Map 46 
Yongxun [Yue dialect] 160 
Yoni 329, Map 106 
Yorta-Yorta 102, 118, Map 35 
Yorta-Yorta 102, 118, Map 35 


Yoruboid 302, 305, 306, 331-2, Maps 96, 
101-102 
East Map 102 
West 
Central Yoruba 315, Maps 101-102 
North-East Yoruba 324, 
Maps 101-102 
South-East Yoruba 327, 
Maps 101-102 
South-West Yoruba 327, Map 101 
West Yoruba 328, Map 102 
Yoruba as lingua franca 303, 310, 
311-12 
Yoruma 329, Map 99 
Yotafa-Sarmi 108, 115, Map 34 
Yotafa 115 
Yotafa 115, Map 34 
Younuo 183-4, Map 45 
Yowlumne (Yawelmani) [Yokuts dialect] 
25 
Yoy [Yay dialect] 179 
Yuaga 108, Map 27 
Yuan (Nyuan, Kam Myang) 180-1, 184, 
Map 47 
Yuapin see Jaruro 
Yuat 99, 113, Map 33 
Yucateco see Yukateko 
Yuchi (Euchee) 7, 9,25, Maps 1-2, 8-9 
Yucuna see Jukuna 
Yue see Chinese (Sinitic), Southern 
Yué (Kotoname) 51 
Yugambal 102, 118, Map 35 
Yugh 233-4, 239 
Yugul 116, Map 35 
Yugulda 116, Map 35 
Yugum 329, Map 93 
Yugur (Yaohu'er, Weiwu(erh), Weiguerh) 
192 
Eastern 186, 191-2, Map 45 
Western (Saryg Yugur) 186-7, 192, 
Map 45 
Yuin-Kuric 102, 118, Map 35 
Yuin 102, 118, Map 35 
Yukaghir 231, 233-4, 239, Map 55 
Yukatekan/Maya 50,52, Maps 10-11 
Yukateko (Yucateco) 49-50, 
Maps 10-11 
Yukian 7-8, 10, Maps 4, 8-9 
Yuki 25, Map 4 
Yukpa-Japreria 75, Maps 12-13 
Yukpa 75 
Yukuben Map 95 
Yukuben 329, Map 95 
Yukuna see Jukuna 
Yulparija 100, 116, Map 35 
Yulu 284, 329, Map 89 
Yulu-Binga 283,284, Map 72 
Yulu-Binga dialects Maps 72, 79 
Yuman 8, 46, Maps 3-4 
River Maps 3-5, 8-9 
Upland 8, 24, Maps 3-5, 8-9 
Yuma see Quechan 
Yuma-Cochimi 51 
Yumbri see Mlabri 
Yumu 100, 116, Map 35 
Yumukuntyi [Uradhi dialect] 101, 117, 
Map 35 
Yunggor 116, Map 35 
Yungur Maps 91, 94, 98 
Yungur 329, Maps 91, 98 
Yungqay 70, Map 14 
Yupik (Western Eskimo) 7, Maps 7-8 
see also Alaskan Yup'ik, Central; 
Alutiiq; East Cape Yupik; Siberian 
Yupik, Central 
Yupiltepegue 54 
Yupina Map 32 
Yupna 110, 112, Map 33 
Isan (Yupna) 110, 112, Maps 32-33 
Yupngayth 101, 117, Map 35 
Yupua-Durina 68, Maps 12, 16 
Yurak Samoyed see Nenets 
Yurakare (Yuracaré) 70, Maps 20-21 
Yuri [Amazonian] see Juri-Tikuna, Juri 
Yuri [Papuan] 113, 115, Maps 33-34 
Yurimangi 63 
Yurlayurlanya 101, 117, Map 35 
Yurok 8,25, Maps 4, 8-9 
Yurta Tatar see Tatar 
Yuru 101, 117, Map 35 
Yuta-Nawan 51-2, 54, Maps 10-11 
see also Uto-Aztecan 
Yuwana see Hoti 
Yuwibara 101, 117, Map 35 
Yuyu 102, 118, Map 35 


Zaar Maps 96-98 

Zaba 170, 175, Map 45 

Zacapulteco see Sakapulteko 

Zacatec Map 3 

Zaghawa 283, Maps 71, 73 

Zahao see Laizo 

Zaiwa 169,175, Map 51 
see also Atsi; Zha 

Zakan 329, Map 93 

Zakhchin 188, Map 45 

Zakho 278, Map 70 


Zalage 329, Map 87 
Zaman 329, Maps 87-88 
Zamba 329, Maps 82, 87 
Zamboanguefio [Chavacano dialect] 103 
Zamuco see Samukoan 
Zan [Kartvelian] 238 
Zan Gula 329, Map 89 
Zanaki 329, Map 86 
Zande 332, Maps 79, 82, 89 
Zande 282,329, Maps 79, 82, 89 
as lingua franca 303, 330 
Zangskari 164, Map 54 
Zangwal 329, Maps 97-98 
Zanniat 168 
Zanzibar Unguja see Unguja 
Zanzibari [Swahili variety] 303 
Zaomin [Yao variety] 183 
Zaozou 169, 173, 175 
Zaparo see Sdparo-Yawan, Saparo- 
Konambo 
Zapotecan see Sapotekan 
Zapoteko see Sapoteko 
Zaramo 306, 309-10, 329, Maps 76, 85 
Zarek see Izarek 
Zari 329, Maps 96-97 
Zarma Kabi Ful [Atlantic] Map 102 
Zarma [Nilo-Saharan] 329, Maps 73, 102, 
104 
Zay see Zway 
Zayein (Latha) 174 
Zayse-Zergulla 288, 289, Map 72 
Zayse 289 
Zaza [Kurdish variety] 233, 237 
Zazao 109, Map 29 
Zeem 329, Maps 96-97 
Zela [Luba] 329, Maps 77-78 
dialect 329 
Zelgwa-'Dugwor (B) [S. Mafa-Matakam 
subgroup] Map 94 
Zelgwa 329, Map 94 
Zeliang 167-8, 213 
Zeliangrong 167-8, Map 51 
Zeme (Naga C2) 168 
Zeme (Empeo) 167 
Zemi 213 
Zenaga 282, Map 73 
Zenap see Chenapian 
Zenzontepec Chatino 53 
Zergulla see Zayse-Zergulla 
Zezuru 329, Maps 74-75 
Zha (Zaiwa) 169, Maps 45-46 
Zhangzhung 165 
gZhiskartse (Shigatse) 164 
Zhu [Khoisan] see Ju 
Zhu-!Hoa (Zhu-|Hoasi) 329, Maps 74, 
84 
Zhuang 183, Maps 45—46 
Northern (Buyi) 179, 184, Maps 45-46 
Southern (Nung) 169, 179, 184, 
Maps 45-47 
Zi see Atsi 
Zia [Keresan] see Rio Grande Keresan 
Zia [Trans-New Guinea] 110-11, 
Maps 31-32 
Ziba 329, Map 86 
Zigua 329, Maps 76, 85-86 
Zilmamu-Olam 289 
Olam (Alam) 289-90 
Zilmamu (Mashi, Bale, Tsilmamo) 
284, 289-90 
Zimakani 111, Map 32 
Zime (Herde) 329, Maps 90-91, 94 
Zinjero see Yemsa 
Zinza Map 86 
Zinza (Jinja) 318, Maps 80, 86 
Ziziliveken 329, Map 94 
Zo'é see Jo'é 
Zoe Map 3 
Zolamnai 168 
Zombo Maps 81, 83-84 
Zomi 168 
Zongbi 329, Map 91 
Zophei 168 
Zoque see Soke 
Zorwang (Jerwang, Tvluq) 169, 175, 
Map 49 
Zost (Josotu) [Inner Mongolian dialect] 
191 
Zou 213 
Zua 329, Map 93 
Zubaki 329, Map 85 
Zubaydiyya 281 
Zufar 280, Map 70 
Zula [Bantu: Luba] 329, Maps 78, 
80 
Zulu [Bantu: Nguni] 302, 312, 329, 
332-3, Maps 74-75 
Zumaya 329, Map 94 
Zumbun see Jimbin 
Zuni 7,9, 26, Maps 3, 5, 8-9 
Ziirchid see Khamnigan 
Zuruaha (Suruaha) 67 
Zuwara Maps 71, 73 
Zway (Zay) 288, Map 72 
Zyryan see Komi 


Sassaorta [Saho dialect] 286 
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Aasen, Ivar 263 

Aba Prefecture 162, 170 

Abkhazia 231, 235, 238 

Aboriginal languages 115-118 

Addis Ababa 288, 296 

Aden 280, 294 

Admiralty Islands 109, 123, 136 

Adygeya 233 

affix-transferring 115 

Afghanistan 211, 216, 237, 238, 227 

agglutinating languages 108, 109 

Agvaandorj 189 

Aksu Prefecture 186 

Alabama 10, 23, 36, 43 

Alaska 10, 12, 14, 15, 16, 21, 23, 26, 40, 
41, 42 

Alaungsithu 171 

Alava 262 

Albania 259-60, 271 

Alberta 11, 12, 14, 18, 20, 21, 22, 23, 40, 
41, 42 

Albuquerque 10, 16, 21, 23, 37 

Aleppo 277, 279, 280 

Aleutian Islands 10, 41, 42 

Alexander of Rhodes 176 

Alexandria 277, 281 

Algeria 277, 281, 282, 298 

Algiers 277, 298 

Ali Prefecture 164 

Almora 165 

Alor 99, 100, 119, 125 

alphabet 187, 188, 189, 190, 232-9 
passim, 260, 262, 274, 282, 285, 286, 
287, 303 

Altay Republic 233, 240 

Amarro mountains 287 

Amazon 68, 86, 88 

Ambon 161 

Amoy 160, 201 

Anambas Islands 152 

Anatolia 277 

Anawrahta 171 

Anchorite Islands see Kaniet Islands 

Andaman Islands 97, 126, 174, 212, 225 

Andes 62, 69, 80 

Andhra Pradesh 213, 217, 224-5 

Andorra 262, 273 

Anglican Mission 105 

Angola 301, 302, 307, 308, 310, 311, 333, 
350 

Anguilla 27 

Anning County 172 

Antigua and Barbuda 27 

Apalachee Bay 10 

Arabia 287 

Arabian Peninsula 287, 294 

Aragon 261, 262 

Arakan 171 

Aral Sea 2, 248 

Argentina 27, 69, 71,72, 74, 92, 93 

Arizona 13, 18, 19, 20, 21, 23, 24, 25, 37, 
38, 39, 43 

Arkansas 11, 36, 43 

Armenia 235, 237, 252-3, 278 

Arnhem Land 103, 107, 124 

Aruba 27 

Arunachal Pradesh 165, 166, 167, 169, 
180, 213, 217,223 

Asia Pacific Christian Mission 105 

aspect (grammatical) 110, 111, 114, 236, 
278, 282 

Assam 166, 180, 213, 217, 223 

Asturias 262 

Aswan 280 

Atlas mountains 282 

Austral Islands 102, 106 

Australasia 159, 166-7, 169, 180, 129, 130 

Australia 1,97, 100, 102-3, 104, 107, 111, 
115-18, 129, 130, 144-5, 146, 160, 161, 
176, 177, 183, 186, 260 

Austria 2, 260, 261, 262, 270 

Azerbaijan 216, 235, 237, 238, 239, 252-3, 
278 

Azores 262, 273 


Badulla District 214, 225 
Baffin Island 14, 15 
Baghdad 277, 279, 294 
Bahamas 27, 43, 82 
Bahrayn 279, 280, 294 


Baja California 13, 14, 15, 16, 21, 22, 37, 
38 

Balfour Declaration 278 

Bali 109, 121, 153 

Baltistan 164 

Bangalore 214 

Bangkok 159, 180, 183, 184, 206 

Bangladesh 2, 166, 167, 168, 171, 178, 

211, 293, 214, 215, 217, 223 

Banks Islands 102, 106, 133 

banner (kojuun) 191 

Barbados 27, 82 

Barisal 215 

Bas DRC 330, 349 

Basilan 103 

Basra 279 

Bass Strait 107 

Batavia 103 

Batticaloa District 215, 225 

Bavaria 261 

Becher Bay Reserve 11 

bedouin 277-81 

Beijing 159, 160, 161, 187, 189, 193, 201 

Beirut 277, 280 

Belarus 234, 235, 236, 251 

Belize 27, 49, 51, 52, 56, 57, 61, 66, 83 

Ben Yahuda, Eliazer 278 

Bender, Lionel 306, 307 

Benedict, Paul K. 163, 165, 169, 171, 
179 

Benelux 262 

Benin 282, 311, 314, 331, 332, 333, 368 

Bering Island 13 

Bering Strait 12, 16, 245 

Bermuda 27 

Bhamo 157, 171 

Bhutan 163, 164, 165, 166, 168, 211, 215, 

217,223 

Big Cypress Reservation 18 

Bihar 213, 217, 222 

Binchuan County 172, 173 

Bismarck Archipelago 98 

Blackfeet Reservation 11 

Blood Reserve 11 

Blue Nile 282, 284 

Boas, Franz 11, 12, 24 

Bolivia 27, 62-8 passim, 70-4 passim, 90, 

91, 184 

Bombay (Mumbai) 213, 214, 224 

Borneo 103, 104, 120-1, 130, 152 

Bornu 281 

Bosnia-Herzegovina 231, 231, 233, 236, 

255 

Botswana 311, 329, 333, 340 

Bougainville 108, 109, 135 

Brahmaputra valley 166, 169 

Brahmi script 217 

Brazil 27, 61, 63, 65-77 passim, 82-9, 92, 

93 

Brighton Reservation 13, 18 

British Columbia 11-20 passim, 22, 23, 

24, 40, 41 

British Indian Ocean Territory 216 

British Isles 263, 268 

British Virgin Islands 27 

Broman Lake 11 

Brunei 152, 182-3 

Buena Vista Lake 11 

Bulgaria 232-3, 236, 237, 255 

Bulkley River 11 

Burkina Faso 305, 311, 312, 327, 332, 370 

Burma (Myanmar) 97, 160, 163, 164, 166, 
167, 168, 169, 171-2, 173, 174, 176, 177, 
180, 181, 182, 183, 194 195, 206, 207, 
217 

Burns Lake 11 

Burundi 310, 311, 312, 330, 333, 346 

Buryatia 233 

Buryat Republic 239 


Cachar Hills 166, 168 

Caddo County 11, 16, 25 

Cahto Valley 16 

Cairo 277, 280, 281, 295 

Calabria 261 

Calcutta (Kolkata) 213, 214, 215, 223 

Calgary 22, 23, 41, 42 

California 9-26 passim, 37, 38, 43 

Cambodia 160, 176, 177, 180, 181, 182, 
194, 195, 206 


General index 


Cameroon 281, 301, 303, 306, 308, 309, 
S11, 312, 330, 334,.353, 353, 356,357, 
358, 359, 360 

Camp Verde Reservation 25 

Canada 7-27 passim, 35, 40-3 159, 160, 
162, 176, 177,260 

Canaries 262, 273 

Cape Breton Island 18 

Cape Cod 16, 18 

Cape Mendocino 18 

Cape Mudge 17 

Cape Verde Islands 262, 273 

Cape York Peninsula 103, 105, 115 

Carmel Valley 14 

Caroline Islands 97, 102 

Carpentaria, Gulf of 100 

Carry the Kettle Reserve 10 

Casablanca 277, 298 

case 184, 185, 186, 187, 191 

Caspian Sea 216, 251, 253 

Castile 262 

Catalonia 262 

Caucasus 231, 234, 237, 238, 239, 252, 
278 

Cavite Province 103 

Cayman Islands 27 

Cenderawasih Bay 114, 115 

census statistics 162, 181, 188, 190, 191, 
212, 213, 214, 215, 216, 231-6, 260 

Central African Republic 282, 284, 311, 
312, 330; 333,353,395, 396 

Central Asia 235, 237-8, 240 

Ceylon see Sri Lanka 

Chad 281-5 passim, 295, 301, 307, 308, 
$12,330, 331,335, 355, 356,.360; 365 

Chagos Islands 216 

Champa kingdom 182 

Chandra River 165 

hangsha 160, 161 

hannel Islands 263, 268 

hatham Island 132 

haozhou 160, 161, 162, 181 

hechnya 233, 239 

hehalis Reservation 24 

hennai see Madras 

hesapeake Bay 19 

heyenne River Reservation 12, 22 

hhattisgarh 213, 222, 224 

hile 27, 90, 91, 92, 93 

hina 97, 159-63 passim, 165, 167-76 

passim, 185-9 passim, 191, 192, 193, 

194, 200-1, 202-3, 237, 238, 239 

hinab River 165 

hinese characters 159, 162 

hin State 168 

hittagong 215, 223 

hittagong Hill Tracts 166, 171 

hoiseul 102, 104, 105 

hukchi Autonomous District 14, 22 

Chukotka Peninsula 14 

Chuvashia 234 

classifier 160, 171, 184, 185 

cluster 46, 62 

Cocos Islands 211, 216 

Colombia 27, 61, 62-6 passim, 68, 69,75, 
82, 83, 84, 85, 86, 87 

colonial languages 63 

Colorado 13, 24, 39, 43 

Columbia River 11, 13, 16, 17, 21, 22, 26 

Colville Reservation 13, 19, 21 

Comoro Islands (Comoros) 211 216, 
225, 311,312, 336, 329 

Congo 303, 330 

Congo, Democratic Republic of 303, 309, 
311, 312, 333, 344-9 

Connecticut 17, 18, 19, 21, 35, 43 

Constantine 281 

contact languages 25 

Cook Islands 107, 130 

Copenhagen 15, 269 

Copper River 10, 14 

Corsica 261-2, 272 

Costa Rica 27, 49, 53, 56, 57, 64, 83 

Cote d'Ivoire 311, 312 

Cowlitz River 13 

creole 26, 27, 103, 104, 106, 107, 120, 159, 
162, 193, 214, 215, 216, 262, 263, 308, 
309; 321, 328,330, 332, 333 

Crete 271 

Crimea 234, 238, 251 


aqaqaqqagagagaaaqaa 


aaqqqqaa 


Croatia 233, 236, 237, 255, 260 

Crow Creek Reservation 22 

Crow Reservation 13 

Cuba 27, 66, 43 

Cyprus 233, 237, 260, 263, 271, 280 
Cyrillic 179, 186-90 passim, 192 
Czech Republic 231, 232, 236, 237, 256 


Dadra & Nagar Haveli 213, 224 

Daghestan 234, 239 

Damascus 277, 278, 279, 280 

Dante Alighieri 261 

Dar es Salaam 303, 309, 342 

Darfur 281, 282, 283, 284 

Darjeeling 163, 164, 166, 168 

Darwin 145, 146, 161 

Delaware 24, 35, 43 

Delaware River 19 

Delhi 113, 114, 228 

Déline (Fort Franklin) 11, 14 

Denmark 14, 261, 263, 264, 269 

D’Entrecastaux Islands 102, 105, 109, 137 

Devils Lake Reservation 22 

Dhaka 215 

Dhofar 279, 280, 281, 287 

dialect 1,2, 10-25 passim, 50, 51, 52, 53, 
54, 61, 62, 63, 64, 66, 67, 68, 69, 73, 74, 
76, 97, 101-6 passim, 160-74 passim, 
176-92 passim, 194, 211, 212, 214, 215, 
216, 217, 231, 236-40 passim, 259-64 
passim, 277-90 passim, 301, 302, 303, 
304, 308, 314, 329, 330, 331, 332 

dialect chain 45, 277-81 passim, 303 

Diego Garcia 216 

diglossia 159, 164, 171, 184 

Diomede Islands 14, 16 

Dixie Valley 11 

Dixon, Roland B. 9 

Djibouti see Jibuti 

Dominica 27, 43, 83 


Easter Island 98, 107, 132 

Eastern Cape 332 

Eastern Sepik Province 105 

East Malaysia 152, 181, 182, 183, 194 

East Timor 98, 99, 104 

Ecuador 27, 62, 63, 64, 68, 69, 70, 82, 83, 

84, 85 

Eel River 14, 25 

Eglise Protestante 106 

Egypt 277-82 passim, 295, 384, 385 

Ehret, Christopher 306, 307 

El Salvador 27, 49, 50, 54, 56, 57, 65 

emergent language 10-25 passim, 45, 46 

England 263, 268 

Equatorial Guinea 301, 302, 309, 311, 
$26, 330, 332, 333, 354 

ergative 115 

Erhai Lake 170 

Eritrea 281, 284, 285, 286, 287, 289, 290, 

296-7 

Eshan 172, 173 

Espiritu Santo 107 

Estonia 232, 238, 250 

Ethiopia 277, 278, 279, 282-90 passim, 

296-7, 301, 304, 308 

Euphrates 277, 278, 279, 294 


Falkland Islands 27, 92, 93 

family 7-10, 11-25 passim, 49, 51-54, 62, 
65, 67-77 passim, 97-102, 105, 110, 
111, 114, 195; :118,.159, 161,162,163, 
178, 179, 183, 184, 187, 212, 213, 
214-16, 231, 236-9, 259, 260, 277, 278, 
279, 281-8 passim, 304-8 passim, 329, 
330, 3351 

Faroe Islands 263, 264, 269 

Fiji 106, 130, 131, 132 

Finland 238, 263, 269 

Flathead Reservation 17, 23 

Flinders Island 101, 107, 117 

Florida 10, 11, 13, 18, 22, 23, 36, 43 

Fly Delta 105 

Fly River 105 

Foley, William A. 98 

Forbidden City 187 

Fort Apache Reservation 25 

Fort Belknap Reservation 10, 15 

Fort Berthold Reservation 10, 15, 18 

Fort Franklin see Déline 


Fort Good Hope Reservation 15 

Fort Hall Reservation 20, 22 

Fort McDermitt 20 

Fort McDowell Reservation 25 

Fort McPherson Reservation 55 

Fort Norman see Tulita 

Fort Peck Reservation 10, 22 

Fort Wrigley 19, 23 

France 261, 262, 274 

Fraser River 12, 15, 17, 22, 23 

Free State 332 

French Guiana 27, 66, 74, 75, 76, 82, 83, 
183 

Friesland 262 

Fugong County 169, 170, 173 

Fujian Province 160, 161, 162, 182 

Fumin County 172, 173 

Fuzhou 160, 161 


Gabon 303, 309, 311, 330, 353, 354 

Galapagos Islands 85 

Galicia 262 

Galveston 10, 16 

Gambia 302, 311, 312, 302, 303, 374 

Gansu 160, 163, 169, 170, 186, 191, 192, 
193 

Ganzi Prefecture 170 

Garo Hills 166 

Gauteng 332 

gender (grammatical) 108, 111, 118 

Genghis Khan 188, 189, 239 

Georgia (Europe) 216, 235, 237, 238, 239, 
252-3, 278 

Georgia (USA) 23, 26, 34, 36, 43 

Georgian Bay 14, 15, 18 

Germany 2, 186, 236, 237, 260, 261, 270 

Ghana 310, 311, 315, 331, 332, 333, 369 

ghost languages 49 

Gibraltar 262, 273 

Gila Bend Reservation 20 

Gila River Reservation 18 

Goa 213, 224 

Gogodala 99, 102, 105 

Gozo 281, 298 

Grand Ronde Reservation 14, 16, 18, 23, 
24 

Great Bear Lake 14, 41, 42 

Greater Bombay 214, 224 

Great Lakes 12, 34 

Great Slave Lake 14 

Greece 233, 236, 237, 259-60, 271 

Greenberg, Joseph H. 9, 10, 61, 306 

Greenland 15, 16, 41, 42, 263 

Grenada 27 

Grierson, G.A. 165, 212, 216 

Guadalcanal 102, 104, 109 

Guadeloupe 27 

Guangdong 160, 161, 162, 179, 181, 183 

Guangxi 160, 161, 162, 169, 172, 174, 
176,179, 182, 183,193 

Guangzhou 160, 161, 162, 201 

Guatemala 27, 49, 50, 51, 52, 54, 56, 57, 
61, 66 

Guernsey 263 

Guinea 301, 302, 309, 311, 312, 314, 317, 
£32, 9595.375 

Guinea Bissau 309, 312, 315, 322, 325, 
333, 374 

Guiptizcoa 262 

Guizhou 161, 172, 179, 183, 192, 203 

Gujarat 213, 224, 228 

Gulf of Mexico 11, 56, 57 

Guthrie, Malcolm 306, 

Guyana 27, 64, 66, 73, 75, 82, 83 


Haas, Mary R. 9, 19, 24 

Hainan Island 97, 160, 161, 179, 182, 183, 
193, 201 

Hainan Province 160, 161, 179 

Haiti 27, 43, 83 

Halmahera 97, 98, 103, 104 

Hamadan 278 

Harappa 217 

Harrington, John P. 12, 13, 14, 17, 21 

Haryana 228 

Haute Zaire see Orientale 

Hawaii 102, 106, 131 

Hawaiian Islands 102, 106, 130, 131, 132 

Hebei 160, 192 

Heilongjiang Province 186, 187, 192 
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Heqing County 172, 173 

Hermit Islands 136 

‘high language’ 121 

High Plains 10, 11, 12, 15,19 

Himachal Pradesh 165, 213, 228 

Hoijer, Harry 24 

Hokkaido 185-6, 208 

Holland 329 

Hollywood Reservation 18 

Honduras 27, 49, 50, 52, 53, 54, 56, 57, 
61, 64, 65, 82, 83 

Hong Kong 159, 160, 161-2, 176, 177, 
183, 193, 194, 195, 201 

Hongta 172 

honorific forms 184—5, 194 

Honshu 185, 208 

Hoopa Valley Reservation 15 

Horn of Africa 2, 277, 285, 296 

Hudson Bay 14, 41, 42 

Hudson River 17, 19, 35 

Huidong 173, 183 

Hunan 160-1, 170, 183 

Hungary 233, 236, 238, 255, 260 

Hunza valley 217 

Huon Gulf 111, 113 

Huon Peninsula 98, 105 

Hyderabad, India 214, 224 


Iceland 263-4, 269 

Idaho 13,17, 19, 20, 22, 23, 38, 39, 43 

Illinois 18, 34, 36, 39, 43 

Immokalee Reservation 18 

India 2, 161-169 passim, 171, 172, 174, 
178, 180, 181, 211, 212-214, 215, 216, 
222-5, 228 

Indiana 18, 34, 36, 43 

Indian Island 14 

Indian Ocean 211, 215, 216, 225 

Indonesia 97, 98, 102, 103, 114, 120, 131, 
132, 147, 148, 149, 153, 154, 159, 160, 
182, 183 

Indus valley 217 

Ingushetia 234, 239 

Inner Mongolia 187, 188, 189, 191, 192, 
193 

inscription 159, 165, 167, 168, 171, 177, 
181, 188 

invasion zones 62 

Iowa 22, 39, 43 

Ipoh 160 

Tran. 211, 216, 237; 238,239, 277, 278, 
226-7 

Iraq 237, 239, 277, 278, 279, 280, 294 

Ireland, Republic of 263, 268 

Irian Jaya 98, 103, 104, 105, 114, 142-3 

Isabella Reservation 20 

Islamabad 212, 227 

Isle of Man 263, 268 

isolates 7, 9-17 passim, 22, 24, 25, 50, 51, 
53, 61, 62, 64, 73, 76, 99, 114, 185, 211, 
217, 231,239, 259, 262, 285, 306, 307, 
331 

Israel 260, 277, 278, 279, 280, 294 

Istanbul 233, 237, 254 

Istria 232 

Istrian peninsula 260 

Italy 2, 260, 261, 262, 272 

Ivory Coast 301, 302, 332, 333, 371 


Jaffna District 215, 225 

Jakarta 103, 121, 153 

Jalpaiguri District 166, 169 

Jamaica 27, 66, 82, 263 

James Bay 14, 19 

Jammu & Kashmir 212, 213, 228 

Jamtsarano 189, 239 

Japan 2, 159, 162, 179, 184-5, 186, 194, 
208 

Java 103, 104, 105, 121, 130, 153, 161 

Jehovah’s Witnesses 105 

Jersey 263 

Jesuit missions 13, 14, 15 

Jharkhand 213, 217, 222 

Jianghuai 160, 161 

Jiangxi 160, 161, 162 

Jibuti 285, 286, 297 

Jidda 280 

Jin dynasty 187 

Jinning 172 

Jochi 239 

Jordan 278, 280, 294 


Kabardino-Balkaria 234 

Kachin State 171 

Kalimantan 152, 160, 161 

Kalispel Reservation 23 

Kalmyk Republic 239, 240 

Kalmykia 234 

Kamchatka 13, 26 

Kaniet (Anchorite) Islands 136 

kanji 184, 194 

Kansas 12, 13, 15, 16, 18, 19, 21, 22, 24, 
36, 39, 43 

Karabagh 231, 235, 237 

Karachay-Cherkassia 234, 235, 238 

Karakalpakstan 236 

Karbi Hills 168 

Karelia 234 

Karnataka 213, 224-5 

Kashmir 163, 164, 212, 213, 228 

Katanga 329 

Kathmandu valley 163, 164, 165 

Kawawachikamach Reserve 25 

Kazakhstan 159, 186, 200, 235-9 passim, 
247, 248-9, 251 

Kelantan 181 

Kenadiid, Cismaan 285 


Kenai Peninsula 21, 23 

Kentucky 34, 36, 43 

Kenya 282, 284, 285, 286, 287, 289, 296-7, 
301, 303, 306, 310; 311, 312, 316, 330, 
333,351, 352 

Kerala 213, 225 

Khakasia 234 

Khulna 215 

Kilinochchi District 215 

Kinnaur 164, 165 

Kiribati 130, 132, 134 

Kivu 330 

Klamath River 16, 22 

Kleinschmidt, Samuel 264 

Kola peninsula 263 

Kolkata see Calcutta 

Komi Republic 234 

Kordofan 281, 283-4 

Korea 2, 159, 184, 185, 194 

Korea, Democratic People’s Republic of 
(North Korea) 185, 194, 201, 208 

Korea, Republic of (South Korea) 185, 
194, 201, 208 

Kosovo/Kosova 231, 233, 237, 259, 260 

Kowloon 161 

Kroeber, A.L. 9, 19 

Kuala Lumpur 103, 155 

Kublai Khan 189 

Kuching 161 

Kunming 170, 172, 173 

Kuomintang 160 

Kurdistan 277, 278 

Kuria Muria islands 281 

Kuwait 278-80, 294 

Kwazulu-Natal 332 

Kyanzittha 171 

Kyoto 185, 208 

Kyrgystan 200, 216, 236, 238, 239, 250 

Kyrgystan Republic 240 

Kyushu 185, 208 


Labrador 13, 14, 16, 19, 41, 42 

Labuan 162, 194 

Laccadive Islands see Lakshadweep 

Ladakh 163, 164 

Lahul 164, 165 

Lake Abaya 187, 189 

Lake Babine 11 

Lake Chad 330, 331, 365 

Lake Erie 22, 34 

Lake Huron 14, 15, 20, 34 

Lake Malawi 329, 341, 342 

Lake Tanganyika 329, 330, 346 

Lake Winnipeg 22 

Lakshadweep 225 

language and dialect 1, 45 

language and genetic group 45 

language area 45 

language complex 45 

language policies 159, 193-4 

Lanping County 169, 170, 173 

Laos 172, 173, 174, 176, 177, 179, 180, 
181, 183, 192, 204-5 

Latvia 232, 238, 250 

Lavongai 105 

league (aimag) 191 

Lebanon 277, 280, 294 

Lesotho 340 

Lesser Sunda Islands 118-19, 148 

Levant 277 

Liangshan 172 

Liberia 303, 309, 311, 320, 322, 332-3, 
372 

Libya 277, 279, 280, 281, 282, 295, 331 

Liechtenstein 2, 261, 270 

Lihir 105 

Lijiang 170, 173 

lingua franca 11, 14, 15, 26, 103, 104, 105, 
106, 107, 116, 161, 162, 164, 165, 166, 
167, 173, 181, 182, 186, 212, 214, 215, 
216, 233, 239, 278, 282, 301, 302, 303, 
309, 310, 329-34 passim 

literacy 50, 162, 278, 287, 303 

Lithuania 232, 236, 238, 250-1 

loan word 10, 216, 217, 240, 279, 280, 329 

logograpic writing system 159, 170, 172, 
184 

London, England 159, 195 216, 268, 309 

London, Ontario 20, 34 

London Missionary Society 105, 106 

Long Island 18, 19, 22 

Lorraine 262 

Los Angeles 14, 27, 43 

Louisiana 10, 11, 12, 17, 19, 20, 22, 24, 27, 
36, 43 

Loukotka, Cestmir 61 

Loyalty Islands 102, 106, 108, 132 

Lugu Lake 170 

Lutheran Mission 105 

Luxembourg 262, 274 


Macao (Macau) 159, 160, 161, 162, 193, 
201 

Macedonia 232, 233, 236, 237, 255, 260 

Madagascar 97, 98, 120, 130, 152, 211, 
216 

Madang 105 

Madeira 273 

Madhya Pradesh 213, 217, 222, 224, 228 

Madras (Chennai) 213, 214, 225 

Madurai District 217, 225 

Maharashtra 213, 217, 224, 228 

Maine 14, 17, 18, 35, 43 

Malacca (Melaka) 104, 104, 155 

Malacca, Straits of 154, 162 

Malaita 102, 106, 109 

Malaku see Moluccas 


Malawi 312, 329, 333, 341 

Malaysia 97, 102, 121, 131, 132, 162, 178, 
181, 182, 183, 193, 194 

Maldives 211, 215, 216, 225 

Malekula 107 

Mali 282, 285, 298, 302, 312, 314, 316, 
325, 327, 342,999,375 

Malta 261, 272, 281, 298 

Mandalay 171, 207 

Manipur 167, 168, 213, 217, 223 

Manitoba 12, 13, 18, 20, 22, 25, 40, 41 

Mannar District 215, 225 

Manouane/Manuane Reserve 11 

Marquesas Islands 108, 130, 132 

Marshall Islands 130, 132 

Martha’s Vineyard 18 

Martinique 27 

Maryland 19, 23, 34-5, 43 

Massachusetts 16, 17, 18, 35, 43 

Maty Islands see Wuvulu Islands 

Mauritania 277, 278, 281, 282, 298 

Mauritius 161, 211, 216, 225 

Mayotte 216, 225 

Mecca 279, 280, 294 

Mednyj Island 26 

Meghalaya 166, 178, 213, 217, 223 

Mekhong valley 170, 174, 176, 180, 194, 
206 

Melaka see Malacca 

Melanesia 97,99, 101, 102, 104, 130 

Melanesian Mission 106 

Melbourne 145, 146 159 

Mesopotamia 280 

Methodist Mission 105 

Mexico 9, 11, 13, 16, 20, 27, 36, 37, 38, 43, 
49, 50, 52, 53, 54, 56, 57, 61, 160 

Michigan 12, 14, 19, 20, 21, 34, 39, 43 

Micronesia 97, 102, 107, 108, 134 

Middle East 211, 238-9, 278, 280, 294 

Milne Bay Province 102, 105 

Minnesota 12, 14, 22, 39, 43 

missionary lingua francas 105 

Mississippi 10, 11, 19, 24, 36, 43 

Mississippi River 12, 19, 20, 22, 36 

Missouri 20, 39, 43 

Missouri River 10, 15, 18, 39 

mixed language 7, 13, 18, 26, 27, 159, 160, 
168, 180, 193 

Mizoram 157, 158, 213, 223 

Mohawk River 18 

Moldova 231, 234, 235, 237, 238, 239, 
251, 255, 260 

Moluccas (Malaku) 118, 147 

Monaco 262, 274 

Mongol-Buryat Republic 239 

Mongolia 186-194 passim, 200-1, 238, 
239, 240, 247 

Mongolia Autonomous Region 191 

Mongolian People’s Republic 187 

Montana 10-15 passim, 17, 23, 39, 41, 43 

Montenegro 231, 232, 236, 237, 255 

Monterey 13, 14, 22 

Montreal 14, 18, 41, 42 

Montserrat 27 

Mordovia 234 

Morobe Province 102 

Morocco 277, 281-2, 298 

Morongo Reservation 22 

Mosquito-Grizzly Bear’s Head Reserve 
10 

Mosul 278, 279 

Mota Island 106 

Mount Currie 17 

Mozambique 312, 317, 329, 333, 341 

Mpumalanga 332 

Mullaitivu District 215, 225 

multilingualism 302 

Mumbai see Bombay 

Mustang 164 

Myanmar see Burma 


Nagaland 166, 167, 212, 213, 217, 223 

Nagapatnam 215 

Nager valley 217 

Nagoya 185 

Nairobi 303, 352 

Namibia 301, 307, 312, 329, 333, 340 

Namtha Province 174 

Nanjing 160, 201 

Natchez 19 

national language 26, 27, 62, 102, 103, 

59, 161, 162, 163, 165, 171, 176, 180, 

82, 183, 184, 186, 188, 193, 194, 195, 

211, 212, 214, 215, 218, 261, 263. 282, 

285, 286, 287, 290, 303, 310, 311, 312 

national orthography 285 

Nauru 102, 132, 134 

Navajo Reservation 24 

Navarra 262 

Nebraska 12, 20, 21, 22, 25, 39 

Negev 282 

Nepal 2, 161, 163, 164, 165, 166, 169, 211, 

214, 216, 217, 218, 222 

Netherlands 262, 274 

Netherlands Antilles 27, 82, 83 

Nevada 20, 21, 22, 24, 25, 38, 39, 43 

New Britain 97-9, 102, 104, 105, 109, 136 

New Brunswick 17, 18, 35, 41, 42 

New Caledonia 104, 106, 107, 130, 131, 
132, 133 

Newfoundland 11, 35, 41, 42 

New Georgia 102, 106, 109, 135 

New Guinea 97, 98-100, 102-6 passim, 
108-14, 115, 118, 138-41 

New Hampshire 25, 35, 43 

New Hebrides see Vanuatu 

New Ireland 97, 98, 99, 105, 109, 136 
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New Jersey 19, 35, 43 

New Mexico 10, 16, 17, 18, 19, 21, 23, 26, 
37, 39, 43 

New York city 19, 27, 159, 185 

New York state 17, 18, 19, 20, 22, 24, 26, 
34-5, 43 

New Zealand 107, 130, 131, 132, 160 

Nicaragua 27, 49, 50, 52, 53, 56, 57, 64, 
65, 67, 82 

Nicobar Islands 175, 178, 208, 217, 225 

Niger 282, 298, 312, 326, 331, 365, 366, 
368 

Nigeria 281, 301, 308, 309, 310, 312, 320, 
331, 333, 357, 361-8 

Nile 277, 280, 281, 282, 295, 296 

Nile delta 277, 280, 295 

Nile Valley 280-5 passim, 295, 296 

Ninglang County 170, 172-3 

Ningxia 160, 191, 192, 193 

Ninigo Islands 136 

Norfolk Island 107, 131 

North Africa 277, 279, 282, 295, 298 

North Carolina 11, 12, 24, 25, 34, 43 

North Dakota 10, 12, 13, 15, 18, 22, 39 

Northern Cape 332 

Northern Cheyenne Reservation 12 

Northern Ireland 263, 268 

Northern Mariana Islands 134 

Northern Province (South Africa) 333 

Northern Territory (Australia) 100, 103, 
146 

North Korea see Korea, Democratic 

People’s Republic of 

North Ossetia-Alania 234 

North-West Province (South Africa) 332 

Northwest Territories (Canada) 12, 14, 
15, 16, 19, 22, 26, 41, 42 

Norway 263, 264, 269 

Nova Scotia 18, 35, 41, 42 

Nuba Hills 284 

Nunavut 16, 26, 41 

Nuristan 216 


Obedjiwan/Obidjewan Reserve 11 

Oceania 97, 98, 100, 104 

Odanak Reserve 25 

official language 12, 26, 27, 212, 214, 216, 
303, 310;.311, 312 

Ogédei 189 

Ohio 16, 20, 22, 34, 43 

Okinawa 185, 208 

Oklahoma 10, 11, 12, 13, 15, 16, 18-22 
passim, 24, 25, 27, 36, 43 

Olympic Peninsula 12, 13, 17,21, 24 

Oman 277, 279, 280, 294 

Omdurman 281 

Ontario 12, 14, 15, 18-22 passim, 24, 34, 
41, 42 

Oregon 10, 12, 15, 16, 17, 18-24 passim, 
38, 43 

Orientale, DRC 330, 345 

Orissa 213, 217, 222, 224 

orthography see writing system 

Osaka 185, 208 

OSV 180 

Ottawa 22, 34, 41, 42 

Ottawa Valley 20, 23, 34-5 

Outer Hebrides 263, 268 

Outer Mongolia 187, 191 

OV 305, 306, 307, 332 


Pakistan 163, 193, 211-12, 216, 217, 218, 
227,237 

Palestine 278, 279, 280, 294 

Pamlico Sound 11 

Panama 27, 61, 63, 64, 82, 83 

Pantar 97, 98, 99, 100 

Pantelleria 281 

Papago Reservation 20 

Papua 97, 98, 99, 102, 103, 104, 105, 
114-15 

Papua New Guinea 98, 99, 102-6 passim, 
108-14, 131, 135, 137, 138-9, 140-1 

Papuan Gulf 105, 106 

Paraguay 27, 62, 66, 88, 89, 90, 91, 92, 93 

Peigan Reserve 11 

pejorative names 62, 68, 71, 163, 166, 
167, 169, 174, 176, 178, 183, 302 

Peloponnese 260 

Pemba 306, 325 

Penang 103, 155 

Pennsylvania 23, 24,27, 34-5 43 

Persia 279 

Persian Gulf 277, 294 

Peru 27, 62, 65-71, 74, 84, 85, 90, 91, 184 

Phags-pa Lama Lodoijantsan 189 

Philippines 97, 98, 102, 103, 108, 119-20, 
131, 132, 150-1, 160, 162, 182 

phylum 9, 46, 62, 100, 111, 231, 237, 238, 
239, 260, 277, 282, 283, 285, 288, 289, 
290, 304, 306, 307, 329 

pidgin 26, 102-7 passim, 109, 217, 277, 
281, 282,303, 509, 311, 316, 321, 326, 
$28, 390; 33:1, 3352, 333 

Pit River 10, 11 

Pitcairn Island 107, 131 

Poland 231-2, 236, 237, 240, 256 

Polonnaruwa District 214, 225 

Polynesia 97, 102, 106, 107, 132 

Pomerania 231 

Port Keats 107 

Port Moresby 102, 105, 139 

Portugal 262, 273 

Postpositions 159, 160, 193 

Powell, John Wesley 9 

prefixing languages 111, 115, 118, 120, 
121 


Pribilof Islands 10 

Prince Edward Island 18, 35, 42 
Prince of Wales Island 15, 40, 41, 42 
Prince William Sound 20, 21 
Puerto Rico 27, 66, 82 

Puget Sound 17 

Puglia 261 

Punjab 211, 212, 213, 228 

Puxian 160, 161 
Pyrénées-Atlantiques 262 


Qatar 294 

Qing dynasty 187, 189, 194 

Qing Empire 240 

Qinghai 163, 189, 191, 192, 193 

Quebec 11, 13, 14, 15, 17, 18, 19, 20, 23, 
25, 26, 34-5, 41, 42 

Queen Charlotte Islands 15, 40, 41, 42 

Queen Elizabeth Islands 41, 42 

Queensland 103, 106, 107, 146 

Quinault Reservation 17, 21 


Rajasthan 213, 228 

Rajshahi 215 

Rajya Sabha 212 

Rangoon 174, 207 

ranking of categories 46 

Rapa 110, 132 

Red River 10, 15, 36 

Red Sea 277, 280, 281, 285, 294, 296-7 

Reunion 211, 216, 225 

Rhode Island 18, 35, 43 

Rift Valley 288, 289 

Rio Grande 13, 21, 23, 37 

River Desert Reserve 23 

Rocky Mountains 13, 23, 32 

Roman Catholic Church 49, 106 

Romania 233, 251, 260 

Ross, M.D. 98, 99 

Rossell Island 136 

Russia 13, 14, 22, 185-193 passim, 216, 
278 

Russian Far East 237 

Russian Federation 41, 42, 233, 244-5, 
246-7, 250-1, 252-3 

Rwanda 312, 330, 333, 346 

Ryukyu Islands 184-5, 208 


Sabah 160, 194, 152 

Sabine River 10 

Sacramento 19, 21, 38 

Sacramento Valley 17, 19, 20, 21, 25, 38 

Sahara 277, 281, 331, 298 

Sahel 281, 282, 285 

St Kitts and Nevis 27 

St Lawrence Island 12, 23, 42 

St Lawrence River 17, 18, 20, 385 

St Lucia 27 

St Vincent 66 

St Vincent and the Grenadines 27 

Sakha/Yakutia 234 

Salinas Valley 13 

Salmon River 17, 19 

Salt River Reservation 18, 20 

Salween river 176 

Samarkand 216 

Samneua Province 180 

Samoa 104, 130, 132, 161 

San Carlos Reservation 25 

Sandaun Province 99 

sandhi 159, 160, 171, 179 

San Diego County 14 

San Fernando Valley 14 

San Francisco 13, 18, 21, 23, 27, 38, 43, 
159 

San Juan Islands 20 

San Manuel Reservation 22 

San Marino 261 

Santa Barbara 11, 12, 16 

Santa Cruz 13, 16 

Santa Cruz Islands 121, 129, 131, 135 

Santa Fe 16, 21 

Santa Isabel 106 

Santa Ynez 15 

San Xavier Reservation 20 

Sao Tomé and Principe 354 

Sapir, Edward 9, 23 

Sapuahfik 102 

Sarawak 103, 152 

Sardinia 261, 272 

Saskatchewan 10, 12, 14, 18, 21, 22, 25, 
41, 42 

Saskatchewan Reserve 10 

Saudi Arabia 277, 279, 280, 294 

Savo 106 

Scandinavia 238, 269 

Scheduled languages (India) 168, 212, 
217 

Scotland 263, 268 

script 18,21, 163-8 passim, 170, 172, 173, 
178, 180, 181, 182, 183, 186-90 passim, 
192, 194, 216, 217, 278, 281, 282, 285, 
287 

Senegal 301, 312, 332, 333, 374 

Sequoyah (George Guest) 12 

Serbia 231, 232, 233, 236, 237, 238, 255, 
260 

Seventh Day Adventists 106 

Seward Peninsula 16 

Seychelles 211, 215, 225 

Shaanxi 160 

Shafer, Robert 163, 165, 166, 169 

Shandong 160 

Shanghai 160, 161, 201 

Shan State 160, 171, 174, 176, 180 

Shigatse Prefecture 164 

Shigiken-qutuqu 189 
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Shikoku Island 185, 208 

Shilin County 172 

Shoalwater Bay Reservation 17 

Siberia 14, 187, 188, 189, 190, 231, 237, 
238, 239, 240, 244-5, 246-7 

Sichuan 161, 163, 164, 170, 172, 173, 180, 
182, 183, 192, 202 

Sicily 260, 261, 272, 281 

Siebert, Frank T. 14 

Sierra Leone 312, 332, 333, 372 

Sierra Nevada 14, 17, 18, 19, 21, 24, 25 

Sikkim 163, 164, 166, 168, 213, 222 

Siletz Reservation 10, 14, 23, 24 

Simsang River 166 

Sinai 277, 280 

Sind 211 

Singapore 18, 21,97, 103, 121, 130, 154, 
155, 159, 160, 161, 162-3 183, 194, 
195 

Six Nations Reserve 11, 18, 19, 20, 24 

Skokomish Reservation 24 

Slovakia 231-2, 236, 238, 256, 260 

Slovenia 232-3, 236, 255, 261, 262 

Smith Sound 17 

Snake River 19, 21 

Soboba Reservation 22 

Society Islands 106, 130, 132 

Solomon Islands 97, 98, 99, 100, 106, 108, 
109,,.132, 132, 135 

Somalia 277, 285, 289, 296-7 

Somali Republic 285, 288 

Sonora 13, 25, 37 

Sogotra 281, 294 

South Africa, Republic of 309, 310, 312, 
329, 332-3, 340 

South Australia 103, 146 

South Carolina 11, 27, 34, 43 

South Dakota 10, 14, 22, 39 

South Georgia 93 

South Korea see Korea, Republic of 

South Ossetia 231, 235 

SOV 114, 118, 163, 166, 167, 169, 179, 
180, 184, 185, 187, 217, 218, 237, 283, 
288, 305, 306 

Spain 262, 273 

Spiti valley 164 

Spokane Reservation 23 

Sri Lanka 211, 214-15, 216, 225 

standardization 303 

Standing Rock Reservation 22 

stock 9, 11, 12, 13, 14, 46, 51, 52, 53, 54, 
61-72 passim, 74, 75, 99, 114, 191, 277, 
282-7 passim, 304 


Strait of Georgia 13, 23, 24 

Sudan 2, 277, 279, 281-7 passim, 289, 
290; 296, 301, 312, 314, 332 

Sula Islands 147 

Sulawesi 120, 130, 149 

Sumatra 121, 130, 154, 160-1, 176, 182-3 

Summer Institute of Linguistics 10, 49, 
50, 61, 177 

superstock 62, 64 

Suriname 27, 64, 66, 73, 75, 82, 83 

Svalbard 263-4, 269 

SVO 114, 163, 170, 174, 179, 180, 181, 
183, 236, 238, 278, 283, 308 

Swadesh, Morris 12, 61 

Swanton, John R. 13 

Swaziland 312, 340 

Sweden 263, 269 

Switzerland 2, 261, 262, 270 

Sydney 145, 146, 159 

Sylhet 215 

syllabary 285, 287, 288 

Syria 277, 278, 279, 280, 294 


Tabar 105 

Tabriz 278 

Tahiti 102, 106, 132, 161 

Taiwan (Formosa) 2, 107, 108, 159, 160, 
161, 162, 179, 182, 201, 208 

Tajikistan 200, 216, 235-239 passim, 249 

Takla Lake 11 

Tallinn 232 

Tamil Nadu 213, 217, 225 

Tampa Reservation 18 

Tanzania 282, 284, 285, 287, 289, 303, 
306, 308, 309, 310, 312, 320-5 passim, 
329, 330, 333,342, 352 

Tasmania 100, 107, 118, 145, 146 

Tatarstan 234 

Tatatungga 189 

Teheran 278, 226 

Tennessee 20, 25, 34, 43 

Terai plains 165, 169 

Texas 10, 11, 13, 16, 17, 23, 24, 25, 27, 36, 
37, 43 

Thailand 159, 160, 161, 162, 171, 172, 
173, 174, 176-83 passim, 204-5, 206 

Tibet 163, 164, 165, 168, 169, 189, 191, 
192, 193, 200, 217 

Tiddim 168 

Tierra del Fuego 71, 92, 93 

Tigris 288, 289, 294 

time depth 51, 63 

Timor 118, 119, 148 


Timor Leste 161 

Tod valley 164 

Togo 312, 317, 318, 319, 320, 327, 331, 
352,399, 300 

Tokyo 184, 185, 208 

tone / tonal languages 107, 111, 114, 121, 
159, 160, 162, 163, 167, 171, 172, 174, 
176, 179, 180, 181, 193 

tone sandhi 159, 160, 171, 129 

Tonga 132 

Torres Strait 103, 107 

trade languages 26, 103 

Transdnestria 231 

Trincomalee District 215, 225 

Trinidad 67,76 

Trinidad and Tobago 27, 82, 83 

Trinity River 12, 25 

Tripoli 281, 295 

Tripura 166, 168, 171, 213, 223 

Trobriand Islands 105, 137 

Tuamotu Archipelago 102, 106, 131, 132 

Tubuai Islands 132 

Tucson 20, 25, 37 

Tulalip Reservation 17 

Tule River Reservation 25 

Tulita 11,19 

Tunisia 281-2, 298 

Turkey 214, 216, 231, 233, 237, 238, 239, 
254, 277, 278, 279, 280 

Turkmenistan 216, 235, 236, 237, 238, 248 

Turks and Caicos Islands 27 

Turtle Mountain Reservation 18 

Tuva, Republic of 140 

Tuvalu 130, 132 

Tuvan ASSR 140 

Tyendinaga Reserve 18 

Tyva see Tuva 


dmurtia 234 

ganda 282, 284, 285, 289, 352 

kraine 234, 236, 237, 238, 251 
matilla Reservation 19, 21 
ndocumented groups 52 
nevangelized Fields Mission 105 
nidentified languages 62 

nited Arab Emirates 294 

nited Church 105 

nited Kingdom 263, 268 

nited States of America 7-27 passim, 
34-43 159, 160, 173, 176, 177, 181, 185, 
278 

pper Volta 282 

ruguay 27,72, 92, 93 


cade 


c 


Ge ee ece 


lane 


USSR 189, 190, 231, 234, 235, 236, 237, 
239, 240 

Utah 22, 24, 38, 39, 43 

Uttaranchal (Uttarkhand) 165, 228 

Uttar Pradesh 213, 222, 228 

Uzbekistan 200, 248-9, 216, 235-9 
passim, 280 


Vagindra 189 

Valencia 262 

Vancouver 15, 23, 40, 41, 42, 159, 162 

Vancouver Island 13, 15, 17, 19, 21, 40, 
41, 42 

Vanuatu 97, 106-8, 130, 131, 132, 133 

Vatican City 261 

Vavuniya District 215, 225 

Venezuela 27, 62—6, 68, 75-7, 82, 83 

Vermont 25, 35, 43 

Vernon River Reserve 20 

Vietnam 159, 160, 167, 169, 172, 173, 
174, 176, 177, 179, 180, 182, 183, 185, 
194, 204-5, 206 

Vilnius 232 

Virginia 20, 24, 25, 34, 36, 43 

virtual language 45 

Vizcaya 262 

VO 306, 307 

Voegelin, C.F. 9, 46 

Voegelin, Florence M. 9 

Volga 237-40 

Volta River 332, 369 

vowel harmony 237, 238, 239 

VSO_ 179, 278, 308 


Wales 263, 268 

Walpole Island Reserve 21 

Warm Springs Reservation 16, 21 

Washington state 12, 13, 16, 17, 19, 20, 
21, 24, 38, 39, 43, 41, 43 

Wesleyan Mission 105 

Westbank Reserve 20 

West Bengal 166, 168, 169, 213, 215, 217, 
222-3 

Western Australia 100, 101, 103, 106, 146 

Western Cape 333 

Western Desert (Australia) 103, 107 

Western Desert (North Africa) 279, 280 

West Indies 27 

West Irian 161 

West Malaysia 103, 121, 155, West 
Malaysia 175, 183, 193, 194, 195 

West Timor 99, 105, 119, 148 

West Virginia 34, 36, 43 


Weymontachingue/Wemontachie 
Reserve 11 

Whitebear Reserve 10 

White Nile 282, 296 

Wikwemikong Reserve 20 

Wind River Reservation 10 

Wisconsin 12, 17-21, 25, 34, 39, 43 

Wood Mountain Reserve 22 

word order 160, 236, 237, 238, 278, 283, 
288, 305, 307, 308, 332 (see also OV, 
SOV, SVO, VSO, VO) 

writing system 10, 11, 159, 160, 162, 
170, 172, 179, 180, 184, 194, 212, 217, 
277, 303 (see also alphabet, Cyrillic, 
kanji, logographic writing system, 
script) 

Wurm, Stephen A. 46, 98 

Wuvulu (Maty) Islands 136 

Wyclif Bible Translators 49 

Wyoming 10, 13, 22, 39, 43 


Xiamen 160 

Xianggang see Hong Kong 

Xinjiang 186, 187, 188, 189, 191, 192, 193 
Xishan District 170, 172 


Yakama Reservation 16, 21 

Yangtse 172, 173, 200-1 

Yankton Reservation 22 

Yasin valley 217 

Yemen 279, 280, 287, 294 

Yimen 172 

Yongren County 173, 180 

Yongshen County 170, 173, 180 

Yugoslavia 231-7 passim 

Yukon River 15, 17, 23, 41, 42 

Yukon Territory 15, 16, 19, 20, 23, 24, 41, 
42 

Yunnan 2, 160, 163, 164, 168, 170, 172, 
173, 174, 176, 179-83 passim, 191, 192, 
202-3 

Yuxi 173, 180 


Zaire 284 

Zambia 302, 312, 327, 329, 330, 333, 343 
Zamboanga 103 

Zanabazar 189 

Zangskar valley 163 

Zanzibar 306 

Zaya Pandita Namkhaijamts 189 
Zhanjiang 160 

Zhejiang 160 

Zimbabwe 312, 322, 326, 329, 333, 340 
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